-
Moises.ai
- Salt Lake City, UT
- geraldoramos.com
- @geraldoramos
Highlights
- Pro
Block or Report
Block or report geraldoramos
Contact GitHub support about this userβs behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
A massively parallel, high-level programming language
An open-source alternative to Ngrok, designed to serve production traffic and be simple to host (particularly on Kubernetes)
some static binaries for linux, maybe useful for bootstrapping, no big deal
OBS plugin for local speech recognition and captioning using AI
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.
Code for paper "Noise-aware Speech Enhancement using Diffusion Probabilistic Model"
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
π SubPlayer is an online subtitle editor
EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engine
A series of large language models trained from scratch by developers @01-ai
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
OpenTofu lets you declaratively manage your cloud infrastructure.
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
A multi-voice TTS system trained with an emphasis on quality
A programming language for the cloud βοΈ A unified programming model, combining infrastructure and runtime code into one language β‘
The OpenTF Manifesto expresses concern over HashiCorp's switch of the Terraform license from open-source to the Business Source License (BSL) and calls for the tool's return to a truly open-source β¦
A modern and transparent way to use Windows VST2, VST3 and CLAP plugins on Linux
π The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming