- San Francisco
Stars
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.
This repository contains a curated collection of 300+ case studies from over 80 companies, detailing practical applications and insights into machine learning (ML) system design. The contents are oβ¦
Repo to accompany my mastering LLM engineering course
Hands-on LLM Engineering including Agentic AI Project
Cline Rule Bank for data engineering frameworks / Query engines / Table Formats.
A curated list of awesome Jupyter projects, libraries and resources
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
Learn how to design systems at scale and prepare for system design interviews
A curated list of awesome open source tools and commercial products for ML Experiment Tracking and Management π
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
βοΈπ¦ Build modular and scalable LLM Applications in Rust
a lightweight, comprehensive solution for managing delta tables built on polars and deltalake
π The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Efficient Inference of Transformer models
LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/lβ¦
Open-source search and retrieval database for AI applications.
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website β¦
[ACL24] Official repo for "Synthesizing Text-to-SQL Data from Weak and Strong LLMs"
Framework for enhancing LLMs for RAG tasks using fine-tuning.
Large language Models (LLM)
Chronos: Pretrained Models for Time Series Forecasting
Perform data science on data that remains in someone else's server
"rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files
A modular graph-based Retrieval-Augmented Generation (RAG) system

