Stars
AirLLM 70B inference with single 4GB GPU
A batched offline inference oriented version of segment-anything
Global CoT Analysis: Initial attempts to uncover patterns across many chains of thought
Introduction to Machine Learning Systems
Refine high-quality datasets and visual AI models
Model interpretability and understanding for PyTorch
Learn and train a masked diffusion language model (MDLM)
Interactive web visualisation for handwritting detection using a simple neural network
Toolkit for linearizing PDFs for LLM datasets/training
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Language Models [NeurIPS 2024 Datasets and Benchmarks Track]
Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
DigitalPlat FreeDomain: Free Domain For Everyone
This repository collects all relevant resources about interpretability in LLMs
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/l…
[NeurIPS 2024] How do Large Language Models Handle Multilingualism?



