Search Results - RepositoryStats

kaldi kaldi-asr

5.3k

14.7k

other

696

kaldi-asr/kaldi is the official location of the Kaldi project.

cuda kaldi shell speech speaker-id c-plus-plus speech-to-text speech-recognition speaker-verification

Created 2015-04-20

9,394 commits to master branch, last one 2 months ago

speechbrain speechbrain

1.5k

9.6k

apache-2.0

134

A PyTorch-based Speech Toolkit

Created 2020-04-28

10,486 commits to develop branch, last one a day ago

vosk-api alphacep

1.2k

9.1k

apache-2.0

120

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Created 2019-09-03

519 commits to master branch, last one 25 days ago

pyannote-audio pyannote

854

7.1k

mit

78

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pytorch pretrained-models speaker-embedding speech-processing speaker-diarization speaker-recognition speaker-verification speaker-change-detection voice-activity-detection speech-activity-detection overlapped-speech-detection

Created 2016-03-07

2,400 commits to main branch, last one 6 months ago

awesome-speech-recognition-speech-synthesis-papers zzw922cn

514

3.0k

mit

186

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Created 2017-04-28

181 commits to master branch, last one about a year ago

3D-Speaker modelscope

151

1.8k

apache-2.0

22

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

sdpn cnceleb campplus eres2net voxceleb 3d-speaker modelscope speaker-diarization speaker-verification language-identification

Created 2023-03-06

355 commits to main branch, last one 12 days ago

delta Delta-ML

288

1.6k

apache-2.0

64

DELTA is a deep learning based natural language and speech processing platform. LF AI & DATA Projects: https://lfaidata.foundation/projects/delta/

Created 2019-05-29

932 commits to master branch, last one 4 years ago

SincNet mravanelli

264

1.2k

mit

33

SincNet is a neural architecture for efficiently processing raw audio samples.

Created 2018-07-10

69 commits to master branch, last one 4 years ago

voxceleb_trainer clovaai

278

1.1k

mit

29

In defence of metric learning for speaker recognition

voxceleb metric-learning speaker-recognition speaker-verification

Created 2020-03-26

55 commits to master branch, last one 2 years ago

wespeaker wenet-e2e

132

860

apache-2.0

18

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Created 2021-09-28

315 commits to master branch, last one about a month ago

ECAPA-TDNN TaoRuijie

120

663

mit

5

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

voxceleb1 voxceleb2 ecapa-tdnn speaker-recognition speaker-verification

Created 2021-10-25

27 commits to main branch, last one about a year ago

dla markovka17

111

630

mit

25

Deep learning for audio processing

tts deep-learning keyword-spotting voice-conversion signal-processing speech-recognition speaker-verification

Created 2020-08-23

169 commits to 2024 branch, last one 3 months ago

PyTorch_Speaker_Verification HarryVolek

165

582

bsd-3-clause

19

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

pytorch speaker-verification speaker-identification

Created 2018-09-20

25 commits to master branch, last one 5 years ago

UniSpeech microsoft

74

453

other

18

UniSpeech - Large Scale Self-Supervised Learning for Speech

speech pytorch diarization speech-processing speech-separation speech-diarization speech-recognition speaker-verification

Created 2021-07-14

73 commits to main branch, last one 11 months ago

speaker-id google

40

407

apache-2.0

17

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

source-separation speaker-diarization speaker-recognition speaker-verification speaker-identification

Created 2018-10-05

273 commits to master branch, last one about a month ago

RawNet Jungjee

55

373

mit

14

Official repository for RawNet, RawNet2, and RawNet3

rawnet pytorch spk-embd voxceleb2 speaker-embeddings speaker-verification extracted-speaker-embeddings

Created 2019-03-18

149 commits to master branch, last one about a year ago

speechbrain.github.io speechbrain

29

365

unknown

40

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN a...

Created 2019-08-31

482 commits to master branch, last one 3 months ago

Speaker_Verification Janghyun1230

103

365

mit

22

Tensorflow implementation of "Generalized End-to-End Loss for Speaker Verification"

speaker-verification

Created 2018-05-30

35 commits to master branch, last one 3 years ago

dvector yistLin

46

278

unknown

11

Speaker embedding (d-vector) trained with GE2E loss

ge2e dvector pytorch torchscript speaker-encoder speaker-embedding speaker-verification

Created 2020-03-27

60 commits to master branch, last one about a year ago

speechlib NavodPeiris

18

199

mit

5

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

ai whisper-ai transcription faster-whisper speaker-diarization speaker-recognition speaker-verification automatic-speech-recognition

Created 2024-01-07

34 commits to main branch, last one about a month ago

speaker_extraction xuchenglin28

31

175

gpl-3.0

7

target speaker extraction and verification for multi-talker speech

source-separation speaker-extraction multi-talker-speech speaker-verification

Created 2018-10-28

19 commits to master branch, last one 4 years ago

FAKEBOB FAKEBOB-adversarial-attack

29

104

bsd-2-clause

6

Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)

gmm-ubm ivector ivector-plda adversarial-attacks speaker-verification speaker-identification speaker-recognition-systems open-set-speaker-identification close-set-speaker-identification

Created 2019-11-06

77 commits to master branch, last one 2 years ago

D-TDNN yuyq96

23

87

unknown

3

PyTorch implementation of Densely Connected Time Delay Neural Network

d-tdnn speech speaker-embedding speaker-adaptation speaker-diarization speaker-recognition speaker-verification time-delay-neural-network temporal-convolutional-network

Created 2020-08-08

28 commits to master branch, last one about a year ago

keras-sincnet grausof

26

72

unknown

4

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

asr cnn audio keras timit waveform filtering tensorflow deep-learning neural-network audio-processing machine-learning speech-processing speech-recognition speaker-recognition speaker-verification artificial-intelligence digital-signal-processing convolutional-neural-networks

Created 2018-11-23

20 commits to master branch, last one 3 years ago

SpeakerProfiling shangeth

22

65

mit

3

Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf

cnn lstm speech wav2vec classification audio-processing speech-processing speaker-recognition speaker-verification

Created 2021-02-03

32 commits to main branch, last one 3 years ago

OpenSpeaker zycv

13

64

apache-2.0

4

OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognition including multi-platform deployment and model optimization.

speaker-recognition speaker-verification voiceprint-recognition

Created 2021-09-30

14 commits to master branch, last one 3 years ago