111 results found Sort:

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Created 2016-03-07
2,403 commits to develop branch, last one 3 days ago
Reading list for research topics in multimodal machine learning
Created 2019-05-27
435 commits to master branch, last one 3 months ago
401
4.1k
mit
51
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Created 2020-11-23
417 commits to master branch, last one 4 days ago
Foundation Architecture for (M)LLMs
Created 2022-11-17
123 commits to main branch, last one 5 months ago
499
2.3k
other
96
WaveNet vocoder
Created 2017-12-27
261 commits to master branch, last one 3 years ago
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Created 2017-10-31
221 commits to master branch, last one about a year ago
224
1.6k
apache-2.0
78
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Created 2019-01-19
122 commits to master branch, last one 7 days ago
158
1.4k
apache-2.0
21
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
Created 2021-08-05
3,126 commits to MassiveScaleToucan branch, last one 11 hours ago
AI powered speech denoising and enhancement
Created 2023-11-15
10 commits to main branch, last one 3 months ago
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Created 2019-01-31
139 commits to master branch, last one 2 years ago
Open source audio annotation tool for humans
Created 2019-10-03
256 commits to main branch, last one 19 days ago
132
1.0k
mit
16
General Speech Restoration
Created 2021-09-06
92 commits to main branch, last one 6 months ago
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Created 2024-06-04
25 commits to main branch, last one about a month ago
245
744
apache-2.0
24
Speech Algorithms
Created 2020-05-11
137 commits to master branch, last one 3 months ago
A neural network for end-to-end speech denoising
Created 2017-06-19
3 commits to master branch, last one 7 years ago
160
567
mit
8
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
Created 2020-05-11
101 commits to master branch, last one 2 years ago
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Created 2021-07-19
22 commits to main branch, last one 2 years ago
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Created 2020-12-18
104 commits to main branch, last one about a year ago
43
510
mit
18
Speech, Language, Audio, Music Processing with Large Language Model
Created 2023-10-23
638 commits to main branch, last one 9 hours ago
语音方向实验室/公司/资源/实习等,欢迎推荐或自荐
Created 2021-11-04
105 commits to main branch, last one 4 days ago
[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning
Created 2021-03-05
1,258 commits to main branch, last one 8 months ago
100
473
mit
67
Speech recognition toolkit for the arduino
Created 2012-08-12
133 commits to 4.x-workingBranch branch, last one 3 years ago
79
447
bsd-3-clause
10
:sound: spafe: Simplified Python Audio Features Extraction
Created 2019-09-16
373 commits to master branch, last one 3 months ago
87
439
mit
22
Problem Agnostic Speech Encoder
Created 2018-11-14
899 commits to master branch, last one 4 years ago
79
438
other
23
A python wrapper for Speech Signal Processing Toolkit (SPTK).
Created 2015-08-30
359 commits to master branch, last one 2 months ago
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
Created 2020-06-16
80 commits to master branch, last one 3 years ago
47
431
gpl-3.0
15
Novoic's audio feature extraction library
Created 2020-05-15
12 commits to master branch, last one 4 years ago