SGLang is a fast serving framework for large language models
An AI personal assistant for your digital brain
Efficient Triton Kernels for LLM Training
Deep learning library
The behavior guidance framework for customer-facing LLM agents
Renderer for the harmony response format to be used with gpt-oss
A modular graph-based Retrieval-Augmented Generation (RAG) system
A Python library powered by Language Models (LLMs)
Train a 26M-parameter GPT from scratch in just 2h
Flower: A Friendly Federated Learning Framework
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster
Build resilient language agents as graphs
Making Enterprise Data Intelligent and Responsive for AI
Replace OpenAI GPT with another LLM in your app
Industrial-strength Natural Language Processing (NLP)
Inference code for CodeLlama models
Conversational voice AI agents
Open-source observability for your LLM application
lightweight package to simplify LLM API calls
A refreshing functional take on deep learning
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Training data (data labeling, annotation, workflow) for all data types
Multilingual sentence & image embeddings with BERT
Helping you get the most out of AWS, wherever you use MCP
Operating LLMs in production