Trending repositories for topic speaker-diarization
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
turnkey self-hosted offline transcription and diarization service with llm summary
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Some comprehensive papers about speaker diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
turnkey self-hosted offline transcription and diarization service with llm summary
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
turnkey self-hosted offline transcription and diarization service with llm summary
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
A python package to build AI-powered real-time audio applications
Some comprehensive papers about speaker diarization
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
turnkey self-hosted offline transcription and diarization service with llm summary
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
turnkey self-hosted offline transcription and diarization service with llm summary
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
turnkey self-hosted offline transcription and diarization service with llm summary
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
turnkey self-hosted offline transcription and diarization service with llm summary
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]
turnkey self-hosted offline transcription and diarization service with llm summary
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
Some comprehensive papers about speaker diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]
A python package to build AI-powered real-time audio applications
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way