Skip to content
View cryer's full-sized avatar
✡️
say...
✡️
say...

Block or report cryer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Pure C inference of Mistral Voxtral Realtime 4B speech to text model

C 1,387 83 Updated Feb 15, 2026

Examples for using ONNX Runtime for machine learning inferencing.

C++ 1,609 402 Updated Feb 20, 2026

A modern, C++-native, test framework for unit-tests, TDD and BDD - using C++14, C++17 and later (C++11 support is in v2.x branch, and C++03 on the Catch1.x branch)

C++ 20,200 3,200 Updated Feb 18, 2026

Open-Source Frontier Voice AI

Python 23,388 2,574 Updated Feb 7, 2026

The Minimalistic x86/x64 API Hooking Library for Windows

C 5,556 1,035 Updated Nov 3, 2025

MoCha: End-to-End Video Character Replacement without Structural Guidance

Python 641 53 Updated Jan 14, 2026

💫 Toolkit to help you get started with Spec-Driven Development

Python 71,222 6,148 Updated Feb 21, 2026

A contact solver for physics-based simulations involving 👚 shells, 🪵 solids and 🪢 rods.

Python 1,579 92 Updated Feb 1, 2026

NCNN implementation of Real-ESRGAN. Real-ESRGAN aims at developing Practical Algorithms for General Image Restoration.

C 2,005 252 Updated May 10, 2024

Vim plugin for LLM-assisted code/text completion

Vim Script 1,881 92 Updated Jan 31, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 70,913 13,609 Updated Feb 22, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 23,653 4,517 Updated Feb 22, 2026

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 18,878 2,330 Updated Dec 2, 2025

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 49,573 7,034 Updated Dec 14, 2025

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 10,424 1,175 Updated Feb 20, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 67,434 8,209 Updated Feb 20, 2026

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

Python 52,606 4,367 Updated Feb 19, 2026

Text-audio foundation model from Boson AI

Python 7,920 604 Updated Jan 18, 2026

Official inference repo for FLUX.1 models

Python 25,220 1,854 Updated Jul 31, 2025

YOLOv8 TensorRT C++ Implementation

C++ 711 83 Updated Feb 9, 2025

The minimal opencv for Android, iOS, ARM Linux, Windows, Linux, MacOS, HarmonyOS, WebAssembly, watchOS, tvOS, visionOS

C++ 3,213 443 Updated Jan 2, 2026

LLM inference in C/C++

C++ 95,618 15,026 Updated Feb 22, 2026

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 55,187 6,026 Updated Feb 9, 2026

PyTorch android examples of usage in applications

Java 1,556 623 Updated Aug 27, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 103,863 11,867 Updated Feb 22, 2026

An intuitive GUI for GLIGEN that uses ComfyUI in the backend

JavaScript 2,051 187 Updated Feb 28, 2024

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Jupyter Notebook 2,066 183 Updated Aug 13, 2024

Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person

5,968 451 Updated Jul 26, 2024

Draw a mockup and generate html for it

TypeScript 13,606 1,638 Updated Jul 26, 2025

Machine learning, in numpy

Python 16,276 3,781 Updated Oct 29, 2023
Next