Trending repositories for topic speaker-diarization
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
turnkey self-hosted offline transcription and diarization service with llm summary
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
turnkey self-hosted offline transcription and diarization service with llm summary
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
A python package to build AI-powered real-time audio applications
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
turnkey self-hosted offline transcription and diarization service with llm summary
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024] and "LS-EEND: long-form streaming en...
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024] and "LS-EEND: long-form streaming en...
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
turnkey self-hosted offline transcription and diarization service with llm summary
A python package to build AI-powered real-time audio applications
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Some comprehensive papers about speaker diarization
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
turnkey self-hosted offline transcription and diarization service with llm summary
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024] and "LS-EEND: long-form streaming en...
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024] and "LS-EEND: long-form streaming en...
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
Official repository for Mamba-based Segmentation Model for Speaker Diarization
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
turnkey self-hosted offline transcription and diarization service with llm summary
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
Official repository for Mamba-based Segmentation Model for Speaker Diarization
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
turnkey self-hosted offline transcription and diarization service with llm summary
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024] and "LS-EEND: long-form streaming en...
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
turnkey self-hosted offline transcription and diarization service with llm summary
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Official repository for Mamba-based Segmentation Model for Speaker Diarization
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
Some comprehensive papers about speaker diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024] and "LS-EEND: long-form streaming en...
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.