A high-throughput and memory-efficient inference and serving engine
Robust Speech Recognition via Large-Scale Weak Supervision
Qwen3-Coder is the code version of Qwen3
Stanford NLP Python library for many human languages
Large Language Model Text Generation Inference
Database system for building simpler and faster AI-powered application
Qwen3 is the large language model series developed by Qwen team
Ongoing research training transformer models at scale
Powerful AI language model (MoE) optimized for efficiency/performance
Operating LLMs in production
Trained models & code to predict toxic comments
The no-nonsense RAG chunking library
Harness LLMs with Multi-Agent Programming
Underthesea - Vietnamese NLP Toolkit
An open-source, low-code machine learning library in Python
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Sparsity-aware deep learning inference runtime for CPUs
A modular graph-based Retrieval-Augmented Generation (RAG) system
Phi-3.5 for Mac: Locally-run Vision and Language Models
LLM based data scientist, AI native data application
TextWorld is a sandbox learning environment for the training
Superlinked is a Python framework for AI Engineers
An easy-to-use LLMs quantization package with user-friendly apis
Adding guardrails to large language models
The official repo of Qwen chat & pretrained large language model