Trending repositories for topic speaker-recognition

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

12,491 (+16)

apache-2.0

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

6,546 (+15)

mit

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

9,088 (+12)

apache-2.0

wenet-e2e/wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

770 (+6)

apache-2.0

TaoRuijie/ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

621 (+1)

mit

Last 3 days (relative gain)

wenet-e2e/wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

770 (+0.8%)

apache-2.0

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

6,546 (+0.2%)

mit

TaoRuijie/ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

621 (+0.2%)

mit

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

9,088 (+0.1%)

apache-2.0

NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

12,491 (+0.1%)

apache-2.0

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

12,491 (+43)

apache-2.0

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

6,546 (+41)

mit

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

9,088 (+26)

apache-2.0

wenet-e2e/wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

770 (+12)

apache-2.0

yeyupiaoling/VoiceprintRecognition-Pytorch

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the sam...

849 (+6)

apache-2.0

SamirPaulb/real-time-voice-translator

A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.

212 (+5)

gpl-2.0

yeyupiaoling/VoiceprintRecognition-Tensorflow

使用Tensorflow实现声纹识别

301 (+3)

apache-2.0

TaoRuijie/ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

621 (+3)

mit

NavodPeiris/speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

168 (+2)

mit

Speech-Interaction-Technology-Aalto-U/itsp

Introduction to Speech Processing

77 (+1)

cc-by-sa-4.0

yeyupiaoling/VoiceprintRecognition-PaddlePaddle

本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型，同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法

241 (+1)

apache-2.0

mravanelli/SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

1,145 (+1)

mit

google/uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

1,565 (+1)

apache-2.0

Last week (relative gain)

SamirPaulb/real-time-voice-translator

A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.

212 (+2%)

gpl-2.0

wenet-e2e/wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

770 (+2%)

apache-2.0

Speech-Interaction-Technology-Aalto-U/itsp

Introduction to Speech Processing

77 (+1%)

cc-by-sa-4.0

NavodPeiris/speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

168 (+1%)

mit

yeyupiaoling/VoiceprintRecognition-Tensorflow

使用Tensorflow实现声纹识别

301 (+1%)

apache-2.0

yeyupiaoling/VoiceprintRecognition-Pytorch

849 (+0.7%)

apache-2.0

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

6,546 (+0.6%)

mit

TaoRuijie/ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

621 (+0.5%)

mit

yeyupiaoling/VoiceprintRecognition-PaddlePaddle

本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型，同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法

241 (+0.4%)

apache-2.0

NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

12,491 (+0.3%)

apache-2.0

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

9,088 (+0.3%)

apache-2.0

mravanelli/SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

1,145 (+0.1%)

mit

google/uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

1,565 (+0.1%)

apache-2.0

Last month (new repositories)

no newly created repositories trending in the last month

Last month (absolute gain)

NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

12,491 (+314)

apache-2.0

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

6,546 (+177)

mit

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

9,088 (+134)

apache-2.0

wenet-e2e/wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

770 (+39)

apache-2.0

yeyupiaoling/VoiceprintRecognition-Pytorch

849 (+36)

apache-2.0

SamirPaulb/real-time-voice-translator

A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.

212 (+17)

gpl-2.0

NavodPeiris/speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

168 (+11)

mit

google/speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

384 (+11)

apache-2.0

clovaai/voxceleb_trainer

In defence of metric learning for speaker recognition

1,071 (+11)

mit

yeyupiaoling/VoiceprintRecognition-PaddlePaddle

本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型，同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法

241 (+8)

apache-2.0

TaoRuijie/ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

621 (+8)

mit

yeyupiaoling/VoiceprintRecognition-Tensorflow

使用Tensorflow实现声纹识别

301 (+7)

apache-2.0

Picovoice/falcon

On-device speaker diarization powered by deep learning

32 (+6)

apache-2.0

google/uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

1,565 (+5)

apache-2.0

Speech-Interaction-Technology-Aalto-U/itsp

Introduction to Speech Processing

77 (+4)

cc-by-sa-4.0

taylorlu/Speaker-Diarization

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

473 (+3)

apache-2.0

mravanelli/SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

1,145 (+3)

mit

Picovoice/eagle

On-device speaker recognition engine powered by deep learning

30 (+2)

apache-2.0

astorfi/3D-convolutional-speaker-recognition

:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification

784 (+2)

apache-2.0

athena-team/athena

an open-source implementation of sequence-to-sequence based speech processing engine

956 (+2)

apache-2.0

Last month (relative gain)

Picovoice/falcon

On-device speaker diarization powered by deep learning

32 (+23%)

apache-2.0

SamirPaulb/real-time-voice-translator

A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.

212 (+9%)

gpl-2.0

Picovoice/eagle

On-device speaker recognition engine powered by deep learning

30 (+7%)

apache-2.0

NavodPeiris/speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

168 (+7%)

mit

Speech-Interaction-Technology-Aalto-U/itsp

Introduction to Speech Processing

77 (+5%)

cc-by-sa-4.0

wenet-e2e/wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

770 (+5%)

apache-2.0

yeyupiaoling/VoiceprintRecognition-Pytorch

849 (+4%)

apache-2.0

yeyupiaoling/VoiceprintRecognition-PaddlePaddle

本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型，同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法

241 (+3%)

apache-2.0

google/speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

384 (+3%)

apache-2.0

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

6,546 (+3%)

mit

SpeakerGuard/SpeakerGuard

a Pytorch library for security research on speaker recognition, released in "Towards Understanding and Mitigating Audio Adversarial Examples for Speaker Recognition" accepted by TDSC

37 (+3%)

NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

12,491 (+3%)

apache-2.0

yeyupiaoling/VoiceprintRecognition-Tensorflow

使用Tensorflow实现声纹识别

301 (+2%)

apache-2.0

ZhaZhaFon/resource_speech

语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download

47 (+2%)

gpl-3.0

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

9,088 (+1%)

apache-2.0

TaoRuijie/ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

621 (+1%)

mit

clovaai/voxceleb_trainer

In defence of metric learning for speaker recognition

1,071 (+1%)

mit

yeyupiaoling/VoiceprintRecognition-Keras

基于Kersa实现的声纹识别模型

132 (+0.8%)

apache-2.0

taylorlu/Speaker-Diarization

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

473 (+0.6%)

apache-2.0

Speaker-Identification/You-Only-Speak-Once

Deep Learning - one shot learning for speaker recognition using Filter Banks

162 (+0.6%)

Last 12-months (new repositories)

NavodPeiris/speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

168

mit

Last 12-months (absolute gain)

NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

12,491 (+3,735)

apache-2.0

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

6,546 (+2,284)

mit

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

9,088 (+2,099)

apache-2.0

wenet-e2e/wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

770 (+342)

apache-2.0

yeyupiaoling/VoiceprintRecognition-Pytorch

849 (+340)

apache-2.0

SamirPaulb/real-time-voice-translator

A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.

212 (+187)

gpl-2.0

clovaai/voxceleb_trainer

In defence of metric learning for speaker recognition

1,071 (+171)

mit

NavodPeiris/speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

168 (+164)

mit

TaoRuijie/ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

621 (+154)

mit

nuaazs/VAF_2

Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.

403 (+134)

google/speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

384 (+122)

apache-2.0

mravanelli/SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

1,145 (+77)

mit

yeyupiaoling/VoiceprintRecognition-PaddlePaddle

本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型，同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法

241 (+76)

apache-2.0

georgygospodinov/speech_course

Deep Learning for Speech

81 (+45)

google/uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

1,565 (+44)

apache-2.0

Speech-Interaction-Technology-Aalto-U/itsp

Introduction to Speech Processing

77 (+42)

cc-by-sa-4.0

taylorlu/Speaker-Diarization

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

473 (+41)

apache-2.0

Speaker-Identification/You-Only-Speak-Once

Deep Learning - one shot learning for speaker recognition using Filter Banks

162 (+40)

athena-team/athena

an open-source implementation of sequence-to-sequence based speech processing engine

956 (+35)

apache-2.0

yeyupiaoling/VoiceprintRecognition-Tensorflow

使用Tensorflow实现声纹识别

301 (+31)

apache-2.0

Last 12-months (relative gain)

NavodPeiris/speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

168 (+4,100%)

mit

SamirPaulb/real-time-voice-translator

A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.

212 (+748%)

gpl-2.0

Picovoice/falcon

On-device speaker diarization powered by deep learning

32 (+540%)

apache-2.0

Picovoice/eagle

On-device speaker recognition engine powered by deep learning

30 (+173%)

apache-2.0

georgygospodinov/speech_course

Deep Learning for Speech

81 (+125%)

Speech-Interaction-Technology-Aalto-U/itsp

Introduction to Speech Processing

77 (+120%)

cc-by-sa-4.0

wenet-e2e/wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

770 (+80%)

apache-2.0

SpeakerGuard/SpeakerGuard

a Pytorch library for security research on speaker recognition, released in "Towards Understanding and Mitigating Audio Adversarial Examples for Speaker Recognition" accepted by TDSC

37 (+68%)

yeyupiaoling/VoiceprintRecognition-Pytorch

849 (+67%)

apache-2.0

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

6,546 (+54%)

mit

nuaazs/VAF_2

Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.

403 (+50%)

thuiar/MIntRec

MIntRec: A New Dataset for Multimodal Intent Recognition (ACM MM 2022)

78 (+47%)

mit

google/speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

384 (+47%)

apache-2.0

yeyupiaoling/VoiceprintRecognition-PaddlePaddle

本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型，同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法

241 (+46%)

apache-2.0

NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

12,491 (+43%)

apache-2.0

Wadaboa/titanet

Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO

59 (+37%)

mit

TaoRuijie/ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

621 (+33%)

mit

Speaker-Identification/You-Only-Speak-Once

Deep Learning - one shot learning for speaker recognition using Filter Banks

162 (+33%)

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

9,088 (+30%)

apache-2.0

zycv/OpenSpeaker

OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognition including multi-platform deployment and model optimization.

62 (+29%)

apache-2.0