15 results found Sort:
- Filter by Primary Language:
- Python (10)
- Jupyter Notebook (3)
- Forth (1)
- HTML (1)
- +
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Created
2019-09-03
518 commits to master branch, last one 19 days ago
SincNet is a neural architecture for efficiently processing raw audio samples.
Created
2018-07-10
69 commits to master branch, last one 3 years ago
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
Created
2018-09-20
25 commits to master branch, last one 4 years ago
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
Created
2018-10-05
272 commits to master branch, last one about a month ago
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN a...
timit
speech
speech-api
beamforming
librispeech
deeplearning
deep-learning
neural-network
speech-to-text
neural-networks
speech-analysis
speech-processing
speech-recognizer
speech-separation
speechrecognition
speech-recognition
speaker-recognition
speaker-verification
speaker-identification
speech-emotion-recognition
Created
2019-08-31
466 commits to master branch, last one 2 months ago
Deep Learning - one shot learning for speaker recognition using Filter Banks
Created
2019-11-20
43 commits to master branch, last one 4 years ago
Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"
Created
2024-06-05
33 commits to main branch, last one 9 days ago
[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
Created
2024-05-15
51 commits to main branch, last one about a month ago
Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)
Created
2019-11-06
77 commits to master branch, last one 2 years ago
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
This repository has been archived
(exclude archived)
Created
2021-08-23
69 commits to main branch, last one 3 years ago
A tool for summarizing dialogues from videos or audio
Created
2023-01-05
8 commits to main branch, last one about a year ago
Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO
Created
2021-10-19
54 commits to main branch, last one 2 years ago
Speakerbox: Fine-tune Audio Transformers for speaker identification.
Created
2022-01-25
71 commits to main branch, last one 9 months ago
this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews
Created
2022-05-12
148 commits to master branch, last one 3 months ago
On-device speaker recognition engine powered by deep learning
Created
2023-05-03
76 commits to main branch, last one 7 days ago