22 results found Sort:
- Filter by Primary Language:
- Python (13)
- C (4)
- C++ (1)
- Jupyter Notebook (1)
- MATLAB (1)
- Java (1)
- C# (1)
- +
Automagically synchronize subtitles with video.
Created
2019-02-24
369 commits to master branch, last one 2 months ago
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Created
2022-11-24
4,267 commits to main branch, last one a day ago
faster_whisper GUI with PySide6
Created
2023-07-18
107 commits to main branch, last one 2 days ago
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Created
2017-04-18
115 commits to master branch, last one 2 years ago
An audio/acoustic activity detection and audio segmentation tool
Created
2015-09-17
388 commits to master branch, last one about a year ago
Voice Activity Detection based on Deep Learning & TensorFlow
Created
2019-12-11
37 commits to master branch, last one 2 years ago
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processi...
asr
vad
icassp
denoising
icassp2023
icassp2024
face-recognition
image-generation
keyword-spotting
music-generation
domain-adaptation
generative-models
language-modeling
signal-processing
signal-restoration
speech-recognition
multimodal-learning
semantic-segmentation
self-supervised-learning
spoken-language-understanding
Created
2023-08-01
560 commits to main branch, last one a day ago
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
Created
2023-02-14
65 commits to main branch, last one 3 months ago
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
Created
2023-12-16
78 commits to main branch, last one 3 months ago
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Created
2019-11-28
212 commits to main branch, last one 3 months ago
On-device voice activity detection (VAD) powered by deep learning
Created
2021-09-14
257 commits to main branch, last one about a month ago
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
Created
2021-02-22
10 commits to main branch, last one 2 years ago
Enumerate user mode shared memory mappings on Windows.
Created
2020-01-24
91 commits to master branch, last one 3 years ago
An Tensorflow Re-Implement of CVPR 2019 "Object-centric Auto-Encoders and Dummy Anomalies for Abnormal Event Detection in Video"
Created
2019-08-05
70 commits to master branch, last one 2 years ago
webrtc中apm相关代码的提取,包括AEC/NS/AGC/VAD ,另外还包括mp3/aac编码器、SoundTouch
Created
2018-11-26
25 commits to master branch, last one 11 months ago
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
Created
2021-01-12
68 commits to master branch, last one 2 years ago
Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)
Created
2023-03-05
29 commits to main branch, last one 6 months ago
A voice activity detection (VAD) library for Unity.
Created
2023-06-28
57 commits to main branch, last one 5 months ago
PyTorch implementation of automatic speech recognition models.
Created
2020-11-28
18 commits to main branch, last one 3 years ago
HadreamAssistant, 你的智能家居/自定义语音助手, 支持树莓派/Linux
Created
2021-11-14
43 commits to main branch, last one 9 months ago
This is FreeSwitch module that can do VAD and ASR with IFLYTEK websocket api.
Created
2022-07-01
7 commits to main branch, last one about a year ago
A local and uncensored AI entity.
Created
2024-02-02
86 commits to main branch, last one 2 months ago