yeahdongcn

R0CKSTAR yeahdongcn

AI Infra | Cloud Native | Virtualization | macOS/iOS

282 followers · 724 following

@MooreThreads
Beijing, China
00:28 (UTC +08:00)
in/yeahdongcn

Achievements

x3 x2

Achievements

x3 x2

Highlights

Developer Program Member

Organizations

Block or Report

Block or report yeahdongcn

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Lists (1)

Sort

Read Later

2 repositories

Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

exo-explore / exo

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 5,435 261 Updated Aug 6, 2024

ngxson / wllama

WebAssembly binding for llama.cpp - Enabling in-browser LLM inference

C++ 262 6 Updated Aug 3, 2024

apeatling / simple-guide-to-mlx-finetuning

Generate train.jsonl and valid.jsonl files to use for fine-tuning Mistral and other LLMs.

PHP 64 10 Updated Feb 5, 2024

apeatling / ollama-voice-mac

Mac compatible Ollama Voice

Python 372 36 Updated Mar 26, 2024

fastfetch-cli / fastfetch

An actively maintained, feature-rich and performance oriented, neofetch like system information tool.

C 8,729 362 Updated Aug 6, 2024

arttor / helmify

Creates Helm chart from Kubernetes yaml

Go 1,283 124 Updated Jul 18, 2024

go-skynet / go-llama.cpp

LLama.cpp golang bindings

C++ 639 78 Updated Jul 30, 2024

kluctl / kluctl

The missing glue to put together large Kubernetes deployments, composed of multiple smaller parts (Helm/Kustomize/...) in a manageable and unified way.

Go 554 35 Updated Aug 5, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 12,791 1,149 Updated Aug 6, 2024

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 939 85 Updated Aug 6, 2024

sgl-project / sglang

SGLang is yet another fast serving framework for large language models and vision language models.

Python 3,921 239 Updated Aug 6, 2024

turboderp / exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 3,331 248 Updated Jul 30, 2024

inspektor-gadget / inspektor-gadget

The eBPF tool and systems inspection framework for Kubernetes, containers and Linux hosts.

C 2,104 213 Updated Aug 6, 2024

projectdiscovery / katana

A next-generation crawling and spidering framework.

Go 10,631 559 Updated Aug 5, 2024

TabbyML / tabby

Self-hosted AI coding assistant

Rust 20,288 923 Updated Aug 6, 2024

matchai / awesome-pinned-gists

📌✨ A collection of awesome dynamic pinned gists for GitHub

1,860 85 Updated Jun 15, 2024

makllama / llama.cpp

Forked from ggerganov/llama.cpp

LLM inference in C/C++

C++ 2 Updated Jul 29, 2024

Kitware / CMake

Mirror of CMake upstream repository

C 6,681 2,505 Updated Aug 6, 2024

k8sgpt-ai / k8sgpt-operator

Automatic SRE Superpowers within your Kubernetes cluster

Go 283 80 Updated Aug 5, 2024

unslothai / unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 13,842 913 Updated Aug 5, 2024

rerun-io / rerun

Visualize streams of multimodal data. Fast, easy to use, and simple to integrate. Built in Rust using egui.

Rust 5,906 266 Updated Aug 6, 2024

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 22,495 2,501 Updated Aug 4, 2024

emilk / egui

egui: an easy-to-use immediate mode GUI in Rust that runs on both web and native

Rust 21,075 1,524 Updated Aug 6, 2024

lm-sys / RouteLLM

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!

Python 2,595 179 Updated Aug 6, 2024

TheAiSingularity / graphrag-local-ollama

Local models support for Microsoft's graphrag using ollama (llama3, mistral, gemma2 phi3)- LLM & Embedding extraction

Python 409 47 Updated Aug 5, 2024

severian42 / GraphRAG-Local-UI

GraphRAG using Local LLMs - Features robust API and multiple apps for Indexing/Prompt Tuning/Query/Chat/Visualizing/Etc. This is meant to be the ultimate GraphRAG/KG local LLM app.

Python 1,129 121 Updated Aug 6, 2024

KusionStack / karpor

Intelligence for Kubernetes. World's most promising Kubernetes Visualization Tool for Developer and Platform Engineering teams.

Go 375 40 Updated Aug 3, 2024

Azure-Samples / graphrag-accelerator

One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure

Python 1,374 202 Updated Aug 5, 2024

microsoft / graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 14,087 1,230 Updated Aug 6, 2024

kubeshark / kubeshark

The API traffic analyzer for Kubernetes providing real-time K8s protocol-level visibility, capturing and monitoring all traffic and payloads going in, out and across containers, pods, nodes and clu…

Go 10,808 457 Updated Aug 2, 2024

R0CKSTAR yeahdongcn

Highlights

Organizations

Block or report yeahdongcn

Lists (1)

Read Later

Starred repositories

llm

slurm

Unity

Operating system

macOS