Statistics for topic audio-processing
RepositoryStats tracks 641,724 Github repositories, of these 314 are tagged with the audio-processing topic. The most common primary language for repositories using this topic is Python (118). Other languages include: C++ (70), C (23), Jupyter Notebook (18), JavaScript (11)
Stargazers over time for topic audio-processing
Most starred repositories for topic audio-processing (view more)
Trending repositories for topic audio-processing (view more)
Cross-platform, customizable ML solutions for live and streaming media.
A Web and Native UI for ffmpeg-wasm: convert video, audio and images using the power of ffmpeg, directly from your web browser or from your computer.
Fast audio player, recorder, converter for Windows, Linux & Android
Cross-platform, customizable ML solutions for live and streaming media.
This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming architecture for fluid conversations with immediate responses and na...
A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
Cross-platform, customizable ML solutions for live and streaming media.
A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.
A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.
This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming architecture for fluid conversations with immediate responses and na...
A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
InspireMusic: A Unified Framework for Music, Song, Audio Generation.
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
AI Productivity Tool - Free and open source, improve user productivity, protect privacy and data security. Provide efficient and convenient AI solutions, built-in local exclusive ChatGPT, Phi, DeepSee...
A C++ based, lightweight music and noise remover for YouTube and other internet media, using DeepFilterNet for audio enhancement.
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
Cross-platform, customizable ML solutions for live and streaming media.
InspireMusic: A Unified Framework for Music, Song, Audio Generation.
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
[EMNLP2024 Demo], [ICASSP 2025] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.
This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming architecture for fluid conversations with immediate responses and na...
Easily train a good VC model with voice data <= 10 mins!