Skip to content
View yeahdongcn's full-sized avatar

Organizations

@AsteroidUI @openloft @makllama
Block or Report

Block or report yeahdongcn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 5,435 261 Updated Aug 6, 2024

WebAssembly binding for llama.cpp - Enabling in-browser LLM inference

C++ 262 6 Updated Aug 3, 2024

Generate train.jsonl and valid.jsonl files to use for fine-tuning Mistral and other LLMs.

PHP 64 10 Updated Feb 5, 2024

Mac compatible Ollama Voice

Python 372 36 Updated Mar 26, 2024

An actively maintained, feature-rich and performance oriented, neofetch like system information tool.

C 8,729 362 Updated Aug 6, 2024

Creates Helm chart from Kubernetes yaml

Go 1,283 124 Updated Jul 18, 2024

LLama.cpp golang bindings

C++ 639 78 Updated Jul 30, 2024

The missing glue to put together large Kubernetes deployments, composed of multiple smaller parts (Helm/Kustomize/...) in a manageable and unified way.

Go 554 35 Updated Aug 5, 2024

Fast and memory-efficient exact attention

Python 12,791 1,149 Updated Aug 6, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 939 85 Updated Aug 6, 2024

SGLang is yet another fast serving framework for large language models and vision language models.

Python 3,921 239 Updated Aug 6, 2024

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 3,331 248 Updated Jul 30, 2024

The eBPF tool and systems inspection framework for Kubernetes, containers and Linux hosts.

C 2,104 213 Updated Aug 6, 2024

A next-generation crawling and spidering framework.

Go 10,631 559 Updated Aug 5, 2024

Self-hosted AI coding assistant

Rust 20,288 923 Updated Aug 6, 2024

📌✨ A collection of awesome dynamic pinned gists for GitHub

1,860 85 Updated Jun 15, 2024

LLM inference in C/C++

C++ 2 Updated Jul 29, 2024

Mirror of CMake upstream repository

C 6,681 2,505 Updated Aug 6, 2024

Automatic SRE Superpowers within your Kubernetes cluster

Go 283 80 Updated Aug 5, 2024

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 13,842 913 Updated Aug 5, 2024

Visualize streams of multimodal data. Fast, easy to use, and simple to integrate. Built in Rust using egui.

Rust 5,906 266 Updated Aug 6, 2024

LLM training in simple, raw C/CUDA

Cuda 22,495 2,501 Updated Aug 4, 2024

egui: an easy-to-use immediate mode GUI in Rust that runs on both web and native

Rust 21,075 1,524 Updated Aug 6, 2024

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!

Python 2,595 179 Updated Aug 6, 2024

Local models support for Microsoft's graphrag using ollama (llama3, mistral, gemma2 phi3)- LLM & Embedding extraction

Python 409 47 Updated Aug 5, 2024

GraphRAG using Local LLMs - Features robust API and multiple apps for Indexing/Prompt Tuning/Query/Chat/Visualizing/Etc. This is meant to be the ultimate GraphRAG/KG local LLM app.

Python 1,129 121 Updated Aug 6, 2024

Intelligence for Kubernetes. World's most promising Kubernetes Visualization Tool for Developer and Platform Engineering teams.

Go 375 40 Updated Aug 3, 2024

One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure

Python 1,374 202 Updated Aug 5, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 14,087 1,230 Updated Aug 6, 2024

The API traffic analyzer for Kubernetes providing real-time K8s protocol-level visibility, capturing and monitoring all traffic and payloads going in, out and across containers, pods, nodes and clu…

Go 10,808 457 Updated Aug 2, 2024
Next