27 results found Sort:

790
7.4k
other
69
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Created 2022-11-24
4,732 commits to main branch, last one 17 hours ago
283
6.9k
mit
77
Automagically synchronize subtitles with video.
Created 2019-02-24
377 commits to master branch, last one 24 days ago
443
4.6k
mit
48
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Created 2020-11-23
447 commits to master branch, last one 26 days ago
faster_whisper GUI with PySide6
Created 2023-07-18
112 commits to main branch, last one 15 days ago
162
1.1k
apache-2.0
32
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, Lich...
Created 2022-09-04
197 commits to master branch, last one 3 months ago
235
845
unknown
44
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Created 2017-04-18
115 commits to master branch, last one 3 years ago
96
751
mit
26
An audio/acoustic activity detection and audio segmentation tool
Created 2015-09-17
438 commits to main branch, last one 10 days ago
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processi...
Created 2023-08-01
975 commits to main branch, last one 24 hours ago
Runtime Audio Importer plugin for Unreal Engine. Importing audio of various formats at runtime.
Created 2020-12-10
420 commits to main branch, last one 14 days ago
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
Created 2023-12-16
81 commits to main branch, last one 3 months ago
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
Created 2023-02-14
68 commits to main branch, last one about a month ago
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Created 2019-11-28
220 commits to main branch, last one 21 days ago
11
183
apache-2.0
12
On-device voice activity detection (VAD) powered by deep learning
Created 2021-09-14
280 commits to main branch, last one about a month ago
Python bindings of WebRTC Audio Processing
Created 2017-02-24
24 commits to master branch, last one 3 months ago
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
Created 2021-02-22
10 commits to main branch, last one 3 years ago
22
117
mit
12
Enumerate user mode shared memory mappings on Windows.
Created 2020-01-24
91 commits to master branch, last one 3 years ago
An Tensorflow Re-Implement of CVPR 2019 "Object-centric Auto-Encoders and Dummy Anomalies for Abnormal Event Detection in Video"
Created 2019-08-05
70 commits to master branch, last one 2 years ago
45
97
unknown
4
webrtc中apm相关代码的提取,包括AEC/NS/AGC/VAD ,另外还包括mp3/aac编码器、SoundTouch
Created 2018-11-26
25 commits to master branch, last one about a year ago
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
Created 2021-01-12
68 commits to master branch, last one 3 years ago
Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)
Created 2023-03-05
29 commits to main branch, last one about a year ago
14
54
agpl-3.0
5
A local and uncensored AI entity.
Created 2024-02-02
88 commits to main branch, last one 2 months ago
Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2
Created 2023-05-01
185 commits to main branch, last one 5 days ago
A voice activity detection (VAD) library for Unity.
Created 2023-06-28
57 commits to main branch, last one about a year ago
HadreamAssistant, 你的智能家居/自定义语音助手, 支持树莓派/Linux
Created 2021-11-14
45 commits to main branch, last one 5 months ago
PyTorch implementation of automatic speech recognition models.
Created 2020-11-28
18 commits to main branch, last one 3 years ago
This is FreeSwitch module that can do VAD and ASR with IFLYTEK websocket api.
Created 2022-07-01
7 commits to main branch, last one 2 years ago