🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds...
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in rese...
🤗 The largest hub of ready-to-use datasets for ML models with fast, eas...
SoftVC VITS Singing Voice Conversion
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Dif...
kaldi-asr/kaldi is the official location of the Kaldi project.
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOT...
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Di...
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion f...
Officially maintained, supported by PaddlePaddle, including CV, NLP, Spe...
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
💬 Speech recognition for your site
ModelScope: bring the notion of Model-as-a-Service to life.