22 results found Sort:

265
6.6k
mit
75
Automagically synchronize subtitles with video.
Created 2019-02-24
369 commits to master branch, last one 2 months ago
459
4.0k
other
49
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Created 2022-11-24
4,267 commits to main branch, last one a day ago
faster_whisper GUI with PySide6
Created 2023-07-18
107 commits to main branch, last one 2 days ago
229
825
unknown
45
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Created 2017-04-18
115 commits to master branch, last one 2 years ago
92
718
mit
28
An audio/acoustic activity detection and audio segmentation tool
Created 2015-09-17
388 commits to master branch, last one about a year ago
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processi...
Created 2023-08-01
560 commits to main branch, last one a day ago
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
Created 2023-02-14
65 commits to main branch, last one 3 months ago
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
Created 2023-12-16
78 commits to main branch, last one 3 months ago
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Created 2019-11-28
212 commits to main branch, last one 3 months ago
10
143
apache-2.0
11
On-device voice activity detection (VAD) powered by deep learning
Created 2021-09-14
257 commits to main branch, last one about a month ago
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
Created 2021-02-22
10 commits to main branch, last one 2 years ago
22
112
mit
12
Enumerate user mode shared memory mappings on Windows.
Created 2020-01-24
91 commits to master branch, last one 3 years ago
An Tensorflow Re-Implement of CVPR 2019 "Object-centric Auto-Encoders and Dummy Anomalies for Abnormal Event Detection in Video"
Created 2019-08-05
70 commits to master branch, last one 2 years ago
44
92
unknown
4
webrtc中apm相关代码的提取,包括AEC/NS/AGC/VAD ,另外还包括mp3/aac编码器、SoundTouch
Created 2018-11-26
25 commits to master branch, last one 11 months ago
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
Created 2021-01-12
68 commits to master branch, last one 2 years ago
Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)
Created 2023-03-05
29 commits to main branch, last one 6 months ago
A voice activity detection (VAD) library for Unity.
Created 2023-06-28
57 commits to main branch, last one 5 months ago
PyTorch implementation of automatic speech recognition models.
Created 2020-11-28
18 commits to main branch, last one 3 years ago
HadreamAssistant, 你的智能家居/自定义语音助手, 支持树莓派/Linux
Created 2021-11-14
43 commits to main branch, last one 9 months ago
This is FreeSwitch module that can do VAD and ASR with IFLYTEK websocket api.
Created 2022-07-01
7 commits to main branch, last one about a year ago
6
33
agpl-3.0
5
A local and uncensored AI entity.
Created 2024-02-02
86 commits to main branch, last one 2 months ago