🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
-
Updated
Aug 16, 2024 - Python
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
End-to-End Speech Processing Toolkit
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
🧠 Leon is your open-source personal assistant.
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
so-vits-svc fork with realtime support, improved interface and more features.
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
DeepMind's Tacotron-2 Tensorflow implementation
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
WaveRNN Vocoder + TTS
Foundational model for human-like, expressive TTS
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.
Add a description, image, and links to the speech-synthesis topic page so that developers can more easily learn about it.
To associate your repository with the speech-synthesis topic, visit your repo's landing page and select "manage topics."