16 results found Sort:

1.2k
9.2k
apache-2.0
123
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Created 2019-09-03
519 commits to master branch, last one about a month ago
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
Created 2018-09-20
25 commits to master branch, last one 5 years ago
40
410
apache-2.0
17
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
Created 2018-10-05
279 commits to master branch, last one 12 days ago
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN a...
Created 2019-08-31
482 commits to master branch, last one 4 months ago
Deep Learning - one shot learning for speaker recognition using Filter Banks
Created 2019-11-20
43 commits to master branch, last one 5 years ago
16
139
bsd-3-clause
6
Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"
Created 2024-06-05
33 commits to main branch, last one 4 months ago
9
118
bsd-3-clause
7
[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
Created 2024-05-15
51 commits to main branch, last one 5 months ago
Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)
Created 2019-11-06
77 commits to master branch, last one 2 years ago
19
102
apache-2.0
7
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
This repository has been archived (exclude archived)
Created 2021-08-23
69 commits to main branch, last one 3 years ago
A tool for summarizing dialogues from videos or audio
Created 2023-01-05
8 commits to main branch, last one about a year ago
12
69
unknown
1
打造最简单的TTS前端集合,最简单的有声小说制作工作流。基于正则规则对小说进行分句,基于RoBERTa对小说中的对话进行说话人识别,从而实现一键式生成多人有声小说。多说话人的语音合成,高质量的有声小说制作。
Created 2025-03-05
18 commits to main branch, last one 14 days ago
Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO
Created 2021-10-19
54 commits to main branch, last one 2 years ago
Speakerbox: Fine-tune Audio Transformers for speaker identification.
Created 2022-01-25
71 commits to main branch, last one about a year ago
this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews
Created 2022-05-12
148 commits to master branch, last one 8 months ago
5
33
apache-2.0
9
On-device speaker recognition engine powered by deep learning
Created 2023-05-03
87 commits to main branch, last one 26 days ago