Statistics for topic voice-activity-detection
RepositoryStats tracks 529,266 Github repositories, of these 33 are tagged with the voice-activity-detection topic. The most common primary language for repositories using this topic is Python (19).
Stargazers over time for topic voice-activity-detection
Most starred repositories for topic voice-activity-detection (view more)
Trending repositories for topic voice-activity-detection (view more)
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Speech-to-Text based on silero-vad + whisper.cpp (GGUF TTS) for ROS 2
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Runtime Audio Importer plugin for Unreal Engine. Importing audio of various formats at runtime.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Speech-to-Text based on silero-vad + whisper.cpp (GGUF TTS) for ROS 2
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Runtime Audio Importer plugin for Unreal Engine. Importing audio of various formats at runtime.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Speech-to-Text based on silero-vad + whisper.cpp (GGUF TTS) for ROS 2
ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Introduction to Speech Processing
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.