Statistics for topic speech-synthesis
RepositoryStats tracks 518,991 Github repositories, of these 261 are tagged with the speech-synthesis topic. The most common primary language for repositories using this topic is Python (170). Other languages include: Jupyter Notebook (27), C++ (12), JavaScript (12)
Stargazers over time for topic speech-synthesis
Most starred repositories for topic speech-synthesis (view more)
Trending repositories for topic speech-synthesis (view more)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
A talking LLM that runs on your own computer without needing the internet.
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
A talking LLM that runs on your own computer without needing the internet.
Simple Python script to interact with the TikTok TTS Voices.
Easy-to-use speech toolset. Written in TypeScript. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.
FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
A talking LLM that runs on your own computer without needing the internet.
Easy-to-use speech toolset. Written in TypeScript. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.
FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code inc...
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.