Trending repositories for topic speaker-diarization

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

7,403 (+38)

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

3,901 (+18)

bsd-2-clause

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

6,546 (+15)

mit

modelscope/3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

1,335 (+13)

apache-2.0

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

9,088 (+12)

apache-2.0

Purfview/whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

1,418 (+10)

juanmc2005/diart

A python package to build AI-powered real-time audio applications

1,124 (+7)

mit

transcriptionstream/transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

765 (+6)

gpl-3.0

wenet-e2e/wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

770 (+6)

apache-2.0

linto-ai/whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

2,130 (+6)

agpl-3.0

espnet/espnet

End-to-End Speech Processing Toolkit

8,608 (+6)

apache-2.0

revdotcom/reverb

Open source inference code for Rev's model

347 (+1)

apache-2.0

wq2012/SpectralCluster

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

517 (+1)

apache-2.0

Last 3 days (relative gain)

modelscope/3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

1,335 (+1.0%)

apache-2.0

transcriptionstream/transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

765 (+0.8%)

gpl-3.0

wenet-e2e/wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

770 (+0.8%)

apache-2.0

Purfview/whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

1,418 (+0.7%)

juanmc2005/diart

A python package to build AI-powered real-time audio applications

1,124 (+0.6%)

mit

modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

7,403 (+0.5%)

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

3,901 (+0.5%)

bsd-2-clause

revdotcom/reverb

Open source inference code for Rev's model

347 (+0.3%)

apache-2.0

linto-ai/whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

2,130 (+0.3%)

agpl-3.0

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

6,546 (+0.2%)

mit

wq2012/SpectralCluster

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

517 (+0.2%)

apache-2.0

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

9,088 (+0.1%)

apache-2.0

espnet/espnet

End-to-End Speech Processing Toolkit

8,608 (+0.1%)

apache-2.0

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

7,403 (+89)

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

6,546 (+41)

mit

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

3,901 (+35)

bsd-2-clause

modelscope/3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

1,335 (+26)

apache-2.0

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

9,088 (+26)

apache-2.0

Purfview/whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

1,418 (+23)

espnet/espnet

End-to-End Speech Processing Toolkit

8,608 (+14)

apache-2.0

wenet-e2e/wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

770 (+12)

apache-2.0

linto-ai/whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

2,130 (+12)

agpl-3.0

juanmc2005/diart

A python package to build AI-powered real-time audio applications

1,124 (+11)

mit

transcriptionstream/transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

765 (+8)

gpl-3.0

revdotcom/reverb

Open source inference code for Rev's model

347 (+5)

apache-2.0

Audio-WestlakeU/FS-EEND

The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024] and "LS-EEND: long-form streaming en...

94 (+2)

mit

NavodPeiris/speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

168 (+2)

mit

hitachi-speech/EEND

End-to-End Neural Diarization

381 (+2)

mit

wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,645 (+2)

apache-2.0

DongKeon/Awesome-Speaker-Diarization

Some comprehensive papers about speaker diarization

236 (+1)

wq2012/SpectralCluster

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

517 (+1)

apache-2.0

yinruiqing/pyannote-whisper

No description

536 (+1)

google/uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

1,565 (+1)

apache-2.0

Last week (relative gain)

Audio-WestlakeU/FS-EEND

94 (+2%)

mit

modelscope/3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

1,335 (+2%)

apache-2.0

Purfview/whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

1,418 (+2%)

wenet-e2e/wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

770 (+2%)

apache-2.0

revdotcom/reverb

Open source inference code for Rev's model

347 (+1%)

apache-2.0

modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

7,403 (+1%)

NavodPeiris/speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

168 (+1%)

mit

transcriptionstream/transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

765 (+1%)

gpl-3.0

juanmc2005/diart

A python package to build AI-powered real-time audio applications

1,124 (+1.0%)

mit

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

3,901 (+0.9%)

bsd-2-clause

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

6,546 (+0.6%)

mit

linto-ai/whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

2,130 (+0.6%)

agpl-3.0

hitachi-speech/EEND

End-to-End Neural Diarization

381 (+0.5%)

mit

DongKeon/Awesome-Speaker-Diarization

Some comprehensive papers about speaker diarization

236 (+0.4%)

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

9,088 (+0.3%)

apache-2.0

wq2012/SpectralCluster

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

517 (+0.2%)

apache-2.0

yinruiqing/pyannote-whisper

No description

536 (+0.2%)

espnet/espnet

End-to-End Speech Processing Toolkit

8,608 (+0.2%)

apache-2.0

wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,645 (+0.1%)

apache-2.0

google/uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

1,565 (+0.1%)

apache-2.0

Last month (new repositories)

no newly created repositories trending in the last month

Last month (absolute gain)

modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

7,403 (+367)

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

6,546 (+177)

mit

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

3,901 (+159)

bsd-2-clause

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

9,088 (+134)

apache-2.0

espnet/espnet

End-to-End Speech Processing Toolkit

8,608 (+93)

apache-2.0

Purfview/whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

1,418 (+84)

modelscope/3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

1,335 (+82)

apache-2.0

linto-ai/whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

2,130 (+73)

agpl-3.0

wenet-e2e/wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

770 (+39)

apache-2.0

juanmc2005/diart

A python package to build AI-powered real-time audio applications

1,124 (+31)

mit

transcriptionstream/transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

765 (+23)

gpl-3.0

wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,645 (+18)

apache-2.0

DongKeon/Awesome-Speaker-Diarization

Some comprehensive papers about speaker diarization

236 (+15)

yinruiqing/pyannote-whisper

No description

536 (+14)

revdotcom/reverb

Open source inference code for Rev's model

347 (+13)

apache-2.0

NavodPeiris/speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

168 (+11)

mit

google/speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

384 (+11)

apache-2.0

Audio-WestlakeU/FS-EEND

94 (+9)

mit

Picovoice/falcon

On-device speaker diarization powered by deep learning

32 (+6)

apache-2.0

wq2012/SpectralCluster

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

517 (+5)

apache-2.0

Last month (relative gain)

Picovoice/falcon

On-device speaker diarization powered by deep learning

32 (+23%)

apache-2.0

Audio-WestlakeU/FS-EEND

94 (+11%)

mit

clement-pages/gryannote

Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.

50 (+9%)

mit

nttcslab-sp/mamba-diarization

Official repository for Mamba-based Segmentation Model for Speaker Diarization

27 (+8%)

NavodPeiris/speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

168 (+7%)

mit

DongKeon/Awesome-Speaker-Diarization

Some comprehensive papers about speaker diarization

236 (+7%)

modelscope/3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

1,335 (+7%)

apache-2.0

Purfview/whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

1,418 (+6%)

wenet-e2e/wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

770 (+5%)

apache-2.0

modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

7,403 (+5%)

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

3,901 (+4%)

bsd-2-clause

revdotcom/reverb

Open source inference code for Rev's model

347 (+4%)

apache-2.0

linto-ai/whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

2,130 (+4%)

agpl-3.0

transcriptionstream/transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

765 (+3%)

gpl-3.0

google/speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

384 (+3%)

apache-2.0

juanmc2005/diart

A python package to build AI-powered real-time audio applications

1,124 (+3%)

mit

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

6,546 (+3%)

mit

yinruiqing/pyannote-whisper

No description

536 (+3%)

ZhaZhaFon/resource_speech

语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download

47 (+2%)

gpl-3.0

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

9,088 (+1%)

apache-2.0

Last 12-months (new repositories)

revdotcom/reverb

Open source inference code for Rev's model

347

apache-2.0

NavodPeiris/speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

168

mit

clement-pages/gryannote

Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.

mit

nttcslab-sp/mamba-diarization

Official repository for Mamba-based Segmentation Model for Speaker Diarization

Last 12-months (absolute gain)

modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

7,403 (+5,800)

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

3,901 (+2,488)

bsd-2-clause

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

6,546 (+2,284)

mit

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

9,088 (+2,099)

apache-2.0

espnet/espnet

End-to-End Speech Processing Toolkit

8,608 (+1,139)

apache-2.0

linto-ai/whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

2,130 (+985)

agpl-3.0

modelscope/3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

1,335 (+980)

apache-2.0

Purfview/whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

1,418 (+969)

transcriptionstream/transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

765 (+751)

gpl-3.0

juanmc2005/diart

A python package to build AI-powered real-time audio applications

1,124 (+518)

mit

revdotcom/reverb

Open source inference code for Rev's model

347 (+346)

apache-2.0

wenet-e2e/wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

770 (+342)

apache-2.0

wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,645 (+290)

apache-2.0

yinruiqing/pyannote-whisper

No description

536 (+215)

NavodPeiris/speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

168 (+164)

mit

DongKeon/Awesome-Speaker-Diarization

Some comprehensive papers about speaker diarization

236 (+160)

nuaazs/VAF_2

Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.

403 (+134)

google/speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

384 (+122)

apache-2.0

Audio-WestlakeU/FS-EEND

94 (+49)

mit

clement-pages/gryannote

Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.

50 (+47)

mit

Last 12-months (relative gain)

transcriptionstream/transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

765 (+5,364%)

gpl-3.0

NavodPeiris/speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

168 (+4,100%)

mit

Picovoice/falcon

On-device speaker diarization powered by deep learning

32 (+540%)

apache-2.0

nttcslab-sp/mamba-diarization

Official repository for Mamba-based Segmentation Model for Speaker Diarization

27 (+440%)

modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

7,403 (+362%)

modelscope/3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

1,335 (+276%)

apache-2.0

Purfview/whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

1,418 (+216%)

DongKeon/Awesome-Speaker-Diarization

Some comprehensive papers about speaker diarization

236 (+211%)

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

3,901 (+176%)

bsd-2-clause

Audio-WestlakeU/FS-EEND

94 (+109%)

mit

FrenchKrab/IS2023-powerset-diarization

Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.

72 (+106%)

linto-ai/whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

2,130 (+86%)

agpl-3.0

juanmc2005/diart

A python package to build AI-powered real-time audio applications

1,124 (+85%)

mit

wenet-e2e/wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

770 (+80%)

apache-2.0

yinruiqing/pyannote-whisper

No description

536 (+67%)

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

6,546 (+54%)

mit

nuaazs/VAF_2

Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.

403 (+50%)

google/speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

384 (+47%)

apache-2.0

nezhar/speech-condenser

A tool for summarizing dialogues from videos or audio

80 (+43%)

JaesungHuh/SimpleDiarization

Simple Diarization model

43 (+39%)

mit