Statistics for topic speaker-recognition
RepositoryStats tracks 518,986 Github repositories, of these 42 are tagged with the speaker-recognition topic. The most common primary language for repositories using this topic is Python (30).
Stargazers over time for topic speaker-recognition
Most starred repositories for topic speaker-recognition (view more)
Trending repositories for topic speaker-recognition (view more)
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.
本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the sam...
A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the sam...
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
a Pytorch library for security research on speaker recognition, released in "Towards Understanding and Mitigating Audio Adversarial Examples for Speaker Recognition" accepted by TDSC
A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the sam...
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
Introduction to Speech Processing
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the sam...