27 results found Sort:
- Filter by Primary Language:
- Python (14)
- C++ (5)
- C (4)
- MATLAB (1)
- Java (1)
- C# (1)
- Jupyter Notebook (1)
- +
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Created
2022-11-24
4,732 commits to main branch, last one 17 hours ago
Automagically synchronize subtitles with video.
Created
2019-02-24
377 commits to master branch, last one 24 days ago
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Created
2020-11-23
447 commits to master branch, last one 26 days ago
faster_whisper GUI with PySide6
Created
2023-07-18
112 commits to main branch, last one 15 days ago
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, Lich...
Created
2022-09-04
197 commits to master branch, last one 3 months ago
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Created
2017-04-18
115 commits to master branch, last one 3 years ago
An audio/acoustic activity detection and audio segmentation tool
Created
2015-09-17
438 commits to main branch, last one 10 days ago
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processi...
asr
vad
icassp
denoising
icassp2023
icassp2024
face-recognition
image-generation
keyword-spotting
music-generation
domain-adaptation
generative-models
language-modeling
signal-processing
signal-restoration
speech-recognition
multimodal-learning
semantic-segmentation
self-supervised-learning
spoken-language-understanding
Created
2023-08-01
975 commits to main branch, last one 24 hours ago
Voice Activity Detection based on Deep Learning & TensorFlow
Created
2019-12-11
37 commits to master branch, last one 3 years ago
Runtime Audio Importer plugin for Unreal Engine. Importing audio of various formats at runtime.
Created
2020-12-10
420 commits to main branch, last one 14 days ago
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
Created
2023-12-16
81 commits to main branch, last one 3 months ago
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
Created
2023-02-14
68 commits to main branch, last one about a month ago
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Created
2019-11-28
220 commits to main branch, last one 21 days ago
On-device voice activity detection (VAD) powered by deep learning
Created
2021-09-14
280 commits to main branch, last one about a month ago
Python bindings of WebRTC Audio Processing
Created
2017-02-24
24 commits to master branch, last one 3 months ago
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
Created
2021-02-22
10 commits to main branch, last one 3 years ago
Enumerate user mode shared memory mappings on Windows.
Created
2020-01-24
91 commits to master branch, last one 3 years ago
An Tensorflow Re-Implement of CVPR 2019 "Object-centric Auto-Encoders and Dummy Anomalies for Abnormal Event Detection in Video"
Created
2019-08-05
70 commits to master branch, last one 2 years ago
webrtc中apm相关代码的提取,包括AEC/NS/AGC/VAD ,另外还包括mp3/aac编码器、SoundTouch
Created
2018-11-26
25 commits to master branch, last one about a year ago
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
Created
2021-01-12
68 commits to master branch, last one 3 years ago
Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)
Created
2023-03-05
29 commits to main branch, last one about a year ago
A local and uncensored AI entity.
Created
2024-02-02
88 commits to main branch, last one 2 months ago
Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2
Created
2023-05-01
185 commits to main branch, last one 5 days ago
A voice activity detection (VAD) library for Unity.
Created
2023-06-28
57 commits to main branch, last one about a year ago
HadreamAssistant, 你的智能家居/自定义语音助手, 支持树莓派/Linux
Created
2021-11-14
45 commits to main branch, last one 5 months ago
PyTorch implementation of automatic speech recognition models.
Created
2020-11-28
18 commits to main branch, last one 3 years ago
This is FreeSwitch module that can do VAD and ASR with IFLYTEK websocket api.
Created
2022-07-01
7 commits to main branch, last one 2 years ago