Search Results - RepositoryStats

speechbrain speechbrain

1.5k

9.6k

apache-2.0

134

A PyTorch-based Speech Toolkit

Created 2020-04-28

10,486 commits to develop branch, last one 2 days ago

pyannote-audio pyannote

854

7.2k

mit

78

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pytorch pretrained-models speaker-embedding speech-processing speaker-diarization speaker-recognition speaker-verification speaker-change-detection voice-activity-detection speech-activity-detection overlapped-speech-detection

Created 2016-03-07

2,400 commits to main branch, last one 6 months ago

awesome-multimodal-ml pliang279

873

6.4k

mit

179

Reading list for research topics in multimodal machine learning

robotics healthcare reading-list deep-learning computer-vision machine-learning speech-processing multimodal-learning reinforcement-learning representation-learning natural-language-processing

Created 2019-05-27

435 commits to master branch, last one 9 months ago

silero-vad snakers4

526

5.4k

mit

54

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

vad onnx speech pytorch onnxruntime onnx-runtime voice-control voice-commands voice-detection speech-processing voice-recognition voice-activity-detection

Created 2020-11-23

458 commits to master branch, last one 7 days ago

torchscale microsoft

215

3.1k

mit

44

Foundation Architecture for (M)LLMs

multimodal transformer translation computer-vision machine-learning speech-processing pretrained-language-model natural-language-processing

Created 2022-11-17

123 commits to main branch, last one 11 months ago

wavenet_vocoder r9y9

499

2.4k

other

95

WaveNet vocoder

python speech pytorch wavenet neural-vocoder wavenet-vocoder speech-synthesis speech-processing

Created 2017-12-27

261 commits to master branch, last one 4 years ago

whisper-timestamped linto-ai

179

2.3k

agpl-3.0

34

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Created 2023-01-13

253 commits to master branch, last one about a month ago

deepvoice3_pytorch r9y9

487

2.0k

other

92

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

tts python pytorch end-to-end multi-speaker machine-learning speech-synthesis speech-processing

Created 2017-10-31

221 commits to master branch, last one about a year ago

awesome-diarization wq2012

232

1.7k

apache-2.0

75

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

awesome awesome-list deep-learning machine-learning speech-processing speech-recognition speaker-diarization

Created 2019-01-19

123 commits to master branch, last one 5 months ago

resemble-enhance resemble-ai

192

1.7k

mit

21

AI powered speech denoising and enhancement

denoise speech-denoising speech-processing speech-enhancement

Created 2023-11-15

13 commits to main branch, last one 3 months ago

IMS-Toucan DigitalPhonetics

179

1.6k

apache-2.0

22

Controllable and fast Text-to-Speech for over 7000 languages!

tts speech pytorch toolkit deep-learning text-to-speech speech-synthesis speech-processing

Created 2021-08-05

3,161 commits to MassiveScaleToucan branch, last one 4 months ago

open-speech-corpora coqui-ai

142

1.3k

mit

56

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

stt tts voice-cloning speech-to-text text-to-speech speech-synthesis speech-processing speech-separation voice-recognition speech-recognition voice-activity-detection speech-emotion-recognition

Created 2019-01-31

139 commits to master branch, last one 2 years ago

SincNet mravanelli

264

1.2k

mit

33

SincNet is a neural architecture for efficiently processing raw audio samples.

Created 2018-07-10

69 commits to master branch, last one 4 years ago

voicefixer haoheliu

134

1.1k

mit

17

General Speech Restoration

mel tts speech denoise vocoder declipping dereverberation speech-analysis speech-synthesis super-resolution speech-processing speech-enhancement

Created 2021-09-06

99 commits to main branch, last one about a month ago

audino midas-research

132

1.1k

mit

24

Open source audio annotation tool for humans

python datasets annotation-tool audio-annotation audio-processing machine-learning speech-processing

Created 2019-10-03

259 commits to main branch, last one about a month ago

StreamSpeech ictnlp

80

1.0k

mit

13

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Created 2024-06-04

25 commits to main branch, last one 7 months ago

SpeechAlgorithms Ryuk17

248

793

apache-2.0

22

You can find the speech algorithms you want here

speech-processing

Created 2020-05-11

139 commits to master branch, last one 2 months ago

SLAM-LLM X-LANCE

76

766

mit

23

Speech, Language, Audio, Music Processing with Large Language Model

peft audio-processing music-processing speech-processing large-language-model multimodal-large-language-models

Created 2023-10-23

886 commits to main branch, last one 25 days ago

speech-denoising-wavenet drethage

164

690

mit

19

A neural network for end-to-end speech denoising

speech wavenet end-to-end deep-learning neural-networks machine-learning speech-denoising speech-processing

Created 2017-06-19

3 commits to master branch, last one 7 years ago

CrisperWhisper nyrahealth

30

646

other

15

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

asr audio filler speech whisper verbatim detection timestamps recognition transcription speech-processing speech-recognition

Created 2024-05-24

9 commits to main branch, last one 3 months ago

DTLN breizhn

161

605

mit

9

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

onnx audio keras tf-lite dtln-model tensorflow raspberry-pi deep-learning dns-challenge noise-reduction real-time-audio audio-processing speech-denoising noise-suppression speech-processing speech-enhancement

Created 2020-05-11

101 commits to master branch, last one 2 years ago

Speech-Backbones huawei-noah

125

578

unknown

22

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

speech-synthesis speech-processing speech-recognition

Created 2021-07-19

22 commits to main branch, last one 2 years ago

FullSubNet Audio-WestlakeU

157

562

mit

8

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

band audio paper speech pytorch sub-band denoising full-band narrow-band single-channel noise-reduction pretrained-model speech-processing speech-separation speech-enhancement reproducible-research

Created 2020-12-18

104 commits to main branch, last one about a year ago

Speech-Resources ddlBoJack

68

550

unknown

20

语音方向实验室/公司/资源/实习等，欢迎推荐或自荐

speech speech-processing

Created 2021-11-04

109 commits to main branch, last one 4 months ago

MultiBench pliang279

80

529

mit

15

[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning

robotics healthcare deep-learning computer-vision machine-learning speech-processing multimodal-learning representation-learning natural-language-processing

Created 2021-03-05

1,258 commits to main branch, last one about a year ago

uSpeech arjo129

102

474

mit

66

Speech recognition toolkit for the arduino

signal arduino speech-processing speech-recognition

This repository has been archived (exclude archived)

Created 2012-08-12

133 commits to 4.x-workingBranch branch, last one 3 years ago

spafe SuperKogito

79

468

bsd-3-clause

11

:sound: spafe: Simplified Python Audio Features Extraction

Created 2019-09-16

377 commits to master branch, last one 11 days ago

Tutorial_Separation gemengtju

95

459

unknown

21

This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.

deep-learning speech-analysis signal-processing speech-processing speech-separation deep-neural-networks

Created 2020-06-16

80 commits to master branch, last one 4 years ago

UniSpeech microsoft

74

454

other

18

UniSpeech - Large Scale Self-Supervised Learning for Speech

speech pytorch diarization speech-processing speech-separation speech-diarization speech-recognition speaker-verification

Created 2021-07-14

73 commits to main branch, last one 12 months ago

pysptk r9y9

78

441

other

22

A python wrapper for Speech Signal Processing Toolkit (SPTK).

dsp sptk python speech python-wrapper speech-synthesis speech-processing digital-signal-processing

Created 2015-08-30

359 commits to master branch, last one 8 months ago