27 results found Sort:

748
7.0k
other
65
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Created 2022-11-24
4,702 commits to main branch, last one a day ago
280
6.8k
mit
77
Automagically synchronize subtitles with video.
Created 2019-02-24
375 commits to master branch, last one about a month ago
429
4.4k
mit
49
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Created 2020-11-23
436 commits to master branch, last one 8 days ago
faster_whisper GUI with PySide6
Created 2023-07-18
110 commits to main branch, last one 2 months ago
159
1.1k
apache-2.0
32
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, Lich...
Created 2022-09-04
197 commits to master branch, last one 2 months ago
235
842
unknown
44
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Created 2017-04-18
115 commits to master branch, last one 3 years ago
96
745
mit
27
An audio/acoustic activity detection and audio segmentation tool
Created 2015-09-17
435 commits to master branch, last one 20 days ago
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processi...
Created 2023-08-01
924 commits to main branch, last one a day ago
Runtime Audio Importer plugin for Unreal Engine. Importing audio of various formats at runtime.
Created 2020-12-10
418 commits to main branch, last one 4 days ago
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
Created 2023-12-16
81 commits to main branch, last one 2 months ago
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
Created 2023-02-14
68 commits to main branch, last one 9 days ago
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Created 2019-11-28
212 commits to main branch, last one 9 months ago
11
179
apache-2.0
11
On-device voice activity detection (VAD) powered by deep learning
Created 2021-09-14
280 commits to main branch, last one 6 days ago
Python bindings of WebRTC Audio Processing
Created 2017-02-24
24 commits to master branch, last one 2 months ago
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
Created 2021-02-22
10 commits to main branch, last one 3 years ago
22
115
mit
12
Enumerate user mode shared memory mappings on Windows.
Created 2020-01-24
91 commits to master branch, last one 3 years ago
An Tensorflow Re-Implement of CVPR 2019 "Object-centric Auto-Encoders and Dummy Anomalies for Abnormal Event Detection in Video"
Created 2019-08-05
70 commits to master branch, last one 2 years ago
45
97
unknown
4
webrtc中apm相关代码的提取,包括AEC/NS/AGC/VAD ,另外还包括mp3/aac编码器、SoundTouch
Created 2018-11-26
25 commits to master branch, last one about a year ago
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
Created 2021-01-12
68 commits to master branch, last one 3 years ago
Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)
Created 2023-03-05
29 commits to main branch, last one 12 months ago
A voice activity detection (VAD) library for Unity.
Created 2023-06-28
57 commits to main branch, last one 11 months ago
Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2
Created 2023-05-01
174 commits to main branch, last one 8 days ago
12
50
agpl-3.0
5
A local and uncensored AI entity.
Created 2024-02-02
88 commits to main branch, last one about a month ago
HadreamAssistant, 你的智能家居/自定义语音助手, 支持树莓派/Linux
Created 2021-11-14
45 commits to main branch, last one 4 months ago
PyTorch implementation of automatic speech recognition models.
Created 2020-11-28
18 commits to main branch, last one 3 years ago
This is FreeSwitch module that can do VAD and ASR with IFLYTEK websocket api.
Created 2022-07-01
7 commits to main branch, last one 2 years ago