119 results found Sort:
- Filter by Primary Language:
- Python (74)
- Jupyter Notebook (6)
- HTML (4)
- JavaScript (4)
- C (3)
- C++ (3)
- Java (1)
- Forth (1)
- MATLAB (1)
- C# (1)
- Shell (1)
- Svelte (1)
- Swift (1)
- TypeScript (1)
- +
A PyTorch-based Speech Toolkit
asr
audio
pytorch
huggingface
transformers
deep-learning
language-model
speech-to-text
speech-toolkit
audio-processing
speech-processing
speech-separation
speechrecognition
voice-recognition
speech-enhancement
speech-recognition
speaker-diarization
speaker-recognition
speaker-verification
spoken-language-understanding
Created
2020-04-28
10,486 commits to develop branch, last one 2 days ago
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Created
2016-03-07
2,400 commits to main branch, last one 6 months ago
Reading list for research topics in multimodal machine learning
Created
2019-05-27
435 commits to master branch, last one 9 months ago
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Created
2020-11-23
458 commits to master branch, last one 7 days ago
Foundation Architecture for (M)LLMs
Created
2022-11-17
123 commits to main branch, last one 11 months ago
WaveNet vocoder
Created
2017-12-27
261 commits to master branch, last one 4 years ago
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Created
2023-01-13
253 commits to master branch, last one about a month ago
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Created
2017-10-31
221 commits to master branch, last one about a year ago
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Created
2019-01-19
123 commits to master branch, last one 5 months ago
AI powered speech denoising and enhancement
Created
2023-11-15
13 commits to main branch, last one 3 months ago
Controllable and fast Text-to-Speech for over 7000 languages!
Created
2021-08-05
3,161 commits to MassiveScaleToucan branch, last one 4 months ago
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Created
2019-01-31
139 commits to master branch, last one 2 years ago
SincNet is a neural architecture for efficiently processing raw audio samples.
Created
2018-07-10
69 commits to master branch, last one 4 years ago
General Speech Restoration
Created
2021-09-06
99 commits to main branch, last one about a month ago
Open source audio annotation tool for humans
Created
2019-10-03
259 commits to main branch, last one about a month ago
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Created
2024-06-04
25 commits to main branch, last one 7 months ago
You can find the speech algorithms you want here
Created
2020-05-11
139 commits to master branch, last one 2 months ago
Speech, Language, Audio, Music Processing with Large Language Model
Created
2023-10-23
886 commits to main branch, last one 25 days ago
A neural network for end-to-end speech denoising
Created
2017-06-19
3 commits to master branch, last one 7 years ago
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
Created
2024-05-24
9 commits to main branch, last one 3 months ago
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
Created
2020-05-11
101 commits to master branch, last one 2 years ago
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Created
2021-07-19
22 commits to main branch, last one 2 years ago
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Created
2020-12-18
104 commits to main branch, last one about a year ago
语音方向实验室/公司/资源/实习等,欢迎推荐或自荐
Created
2021-11-04
109 commits to main branch, last one 4 months ago
[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning
Created
2021-03-05
1,258 commits to main branch, last one about a year ago
Speech recognition toolkit for the arduino
This repository has been archived
(exclude archived)
Created
2012-08-12
133 commits to 4.x-workingBranch branch, last one 3 years ago
:sound: spafe: Simplified Python Audio Features Extraction
Created
2019-09-16
377 commits to master branch, last one 11 days ago
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
Created
2020-06-16
80 commits to master branch, last one 4 years ago
UniSpeech - Large Scale Self-Supervised Learning for Speech
Created
2021-07-14
73 commits to main branch, last one 12 months ago
A python wrapper for Speech Signal Processing Toolkit (SPTK).
Created
2015-08-30
359 commits to master branch, last one 8 months ago