Statistics for topic text-to-speech
RepositoryStats tracks 639,262 Github repositories, of these 447 are tagged with the text-to-speech topic. The most common primary language for repositories using this topic is Python (246). Other languages include: JavaScript (29), Jupyter Notebook (29), TypeScript (29), C++ (17), C# (16), Java (11)
Stargazers over time for topic text-to-speech
Most starred repositories for topic text-to-speech (view more)
Trending repositories for topic text-to-speech (view more)
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech and video generation APIs.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech and video generation APIs.
Unified AI desktop suite — offline and open-source. LLMs, image generation, voice, and chat in one app. Stable Diffusion, Mistral, Whisper, SpeechT5, OpenVoice
A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.
Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech and video generation APIs.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech and video generation APIs.
Unified AI desktop suite — offline and open-source. LLMs, image generation, voice, and chat in one app. Stable Diffusion, Mistral, Whisper, SpeechT5, OpenVoice
Discover the world of artificial intelligence and interact with your favorite characters without needing to learn tons of information. Bring your Waifu to life with Soul of Waifu!
A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.
Edge TTS is a Node or Bun package that allows access to the online text-to-speech service used by Microsoft Edge without the need for Microsoft Edge, Windows, or an API key.
Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech and video generation APIs.
A cutting-edge Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech and video generation APIs.
Discover the world of artificial intelligence and interact with your favorite characters without needing to learn tons of information. Bring your Waifu to life with Soul of Waifu!
A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.
A cutting-edge Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isola...
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Instant voice cloning by MIT and MyShell. Audio foundation model.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.