- Beijing, China
-
00:28
(UTC +08:00) - in/yeahdongcn
Highlights
Block or Report
Block or report yeahdongcn
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (1)
Sort Name ascending (A-Z)
Language
Sort by: Recently starred
Starred repositories
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
WebAssembly binding for llama.cpp - Enabling in-browser LLM inference
Generate train.jsonl and valid.jsonl files to use for fine-tuning Mistral and other LLMs.
An actively maintained, feature-rich and performance oriented, neofetch like system information tool.
The missing glue to put together large Kubernetes deployments, composed of multiple smaller parts (Helm/Kustomize/...) in a manageable and unified way.
Fast and memory-efficient exact attention
FlashInfer: Kernel Library for LLM Serving
SGLang is yet another fast serving framework for large language models and vision language models.
A fast inference library for running LLMs locally on modern consumer-class GPUs
The eBPF tool and systems inspection framework for Kubernetes, containers and Linux hosts.
A next-generation crawling and spidering framework.
📌✨ A collection of awesome dynamic pinned gists for GitHub
makllama / llama.cpp
Forked from ggerganov/llama.cppLLM inference in C/C++
Automatic SRE Superpowers within your Kubernetes cluster
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Visualize streams of multimodal data. Fast, easy to use, and simple to integrate. Built in Rust using egui.
egui: an easy-to-use immediate mode GUI in Rust that runs on both web and native
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
Local models support for Microsoft's graphrag using ollama (llama3, mistral, gemma2 phi3)- LLM & Embedding extraction
GraphRAG using Local LLMs - Features robust API and multiple apps for Indexing/Prompt Tuning/Query/Chat/Visualizing/Etc. This is meant to be the ultimate GraphRAG/KG local LLM app.
Intelligence for Kubernetes. World's most promising Kubernetes Visualization Tool for Developer and Platform Engineering teams.
One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure
A modular graph-based Retrieval-Augmented Generation (RAG) system
The API traffic analyzer for Kubernetes providing real-time K8s protocol-level visibility, capturing and monitoring all traffic and payloads going in, out and across containers, pods, nodes and clu…