33 results found Sort:
- Filter by Primary Language:
- Python (21)
- Jupyter Notebook (6)
- C++ (1)
- HTML (1)
- Shell (1)
- +
kaldi-asr/kaldi is the official location of the Kaldi project.
Created
2015-04-20
9,388 commits to master branch, last one about a month ago
A PyTorch-based Speech Toolkit
asr
audio
pytorch
huggingface
transformers
deep-learning
language-model
speech-to-text
speech-toolkit
audio-processing
speech-processing
speech-separation
speechrecognition
voice-recognition
speech-enhancement
speech-recognition
speaker-diarization
speaker-recognition
speaker-verification
spoken-language-understanding
Created
2020-04-28
10,276 commits to develop branch, last one 6 days ago
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Created
2019-09-03
518 commits to master branch, last one 7 days ago
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Created
2016-03-07
2,415 commits to develop branch, last one 9 days ago
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Created
2017-04-28
181 commits to master branch, last one about a year ago
DELTA is a deep learning based natural language and speech processing platform.
Created
2019-05-29
932 commits to master branch, last one 3 years ago
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Created
2023-03-06
326 commits to main branch, last one 23 days ago
SincNet is a neural architecture for efficiently processing raw audio samples.
Created
2018-07-10
69 commits to master branch, last one 3 years ago
In defence of metric learning for speaker recognition
Created
2020-03-26
55 commits to master branch, last one 2 years ago
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Created
2021-09-28
306 commits to master branch, last one 6 days ago
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Created
2021-10-25
27 commits to main branch, last one about a year ago
Deep learning for audio processing
Created
2020-08-23
154 commits to 2024 branch, last one 3 days ago
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
Created
2018-09-20
25 commits to master branch, last one 4 years ago
UniSpeech - Large Scale Self-Supervised Learning for Speech
Created
2021-07-14
73 commits to main branch, last one 7 months ago
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
Created
2018-10-05
272 commits to master branch, last one 27 days ago
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN a...
timit
speech
speech-api
beamforming
librispeech
deeplearning
deep-learning
neural-network
speech-to-text
neural-networks
speech-analysis
speech-processing
speech-recognizer
speech-separation
speechrecognition
speech-recognition
speaker-recognition
speaker-verification
speaker-identification
speech-emotion-recognition
Created
2019-08-31
466 commits to master branch, last one about a month ago
Official repository for RawNet, RawNet2, and RawNet3
Created
2019-03-18
149 commits to master branch, last one 8 months ago
Tensorflow implementation of "Generalized End-to-End Loss for Speaker Verification"
Created
2018-05-30
35 commits to master branch, last one 3 years ago
Speaker embedding (d-vector) trained with GE2E loss
Created
2020-03-27
60 commits to master branch, last one about a year ago
target speaker extraction and verification for multi-talker speech
Created
2018-10-28
19 commits to master branch, last one 3 years ago
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Created
2024-01-07
33 commits to main branch, last one about a month ago
Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)
Created
2019-11-06
77 commits to master branch, last one 2 years ago
PyTorch implementation of Densely Connected Time Delay Neural Network
Created
2020-08-08
28 commits to master branch, last one about a year ago
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Created
2018-11-23
20 commits to master branch, last one 3 years ago
Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf
Created
2021-02-03
32 commits to main branch, last one 3 years ago
OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognition including multi-platform deployment and model optimization.
Created
2021-09-30
14 commits to master branch, last one 2 years ago
Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO
Created
2021-10-19
54 commits to main branch, last one 2 years ago
A curated list of speaker-embedding speaker-verification, speaker-identification resources.
Created
2021-05-27
18 commits to main branch, last one 3 years ago
A toolbox of audio models and algorithms based on MindSpore
Created
2022-09-07
289 commits to main branch, last one 3 months ago
This repo is to list the references papers of 《Speaker Recognition Based on Deep Learning: An Overview》
Created
2021-06-25
2 commits to master branch, last one 3 years ago