Search Results - RepositoryStats

16 results found Sort:

Filter by Primary Language:
Python (11)
Jupyter Notebook (3)
Forth (1)
HTML (1)
+

vosk-api alphacep

1.2k

9.2k

apache-2.0

123

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Created 2019-09-03

519 commits to master branch, last one about a month ago

SincNet mravanelli

265

1.2k

mit

33

SincNet is a neural architecture for efficiently processing raw audio samples.

Created 2018-07-10

69 commits to master branch, last one 4 years ago

PyTorch_Speaker_Verification HarryVolek

165

583

bsd-3-clause

19

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

pytorch speaker-verification speaker-identification

Created 2018-09-20

25 commits to master branch, last one 5 years ago

speaker-id google

40

410

apache-2.0

17

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

source-separation speaker-diarization speaker-recognition speaker-verification speaker-identification

Created 2018-10-05

279 commits to master branch, last one 12 days ago

speechbrain.github.io speechbrain

29

365

unknown

40

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN a...

Created 2019-08-31

482 commits to master branch, last one 4 months ago

You-Only-Speak-Once Speaker-Identification

41

165

unknown

5

Deep Learning - one shot learning for speaker recognition using Filter Banks

audio speech deep-speaker triplet-loss deep-learning neural-network siamese-networks one-shot-learning speaker-recognition voice-authentication speaker-identification

Created 2019-11-20

43 commits to master branch, last one 5 years ago

Audio-Mamba-AuM kaistmm

16

139

bsd-3-clause

6

Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"

audio mamba pytorch audio-mamba deep-learning state-space-model audio-classification speech-classification speaker-identification representation-learning

Created 2024-06-05

33 commits to main branch, last one 4 months ago

ssamba SiavashShams

9

118

bsd-3-clause

7

[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model

audio mamba deep-learning keyword-spotting state-space-model emotion-recognition audio-classification speaker-identification representation-learning self-supervised-learning

Created 2024-05-15

51 commits to main branch, last one 5 months ago

FAKEBOB FAKEBOB-adversarial-attack

29

104

bsd-2-clause

6

Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)

gmm-ubm ivector ivector-plda adversarial-attacks speaker-verification speaker-identification speaker-recognition-systems open-set-speaker-identification close-set-speaker-identification

Created 2019-11-06

77 commits to master branch, last one 2 years ago

UHV-OTS-Speech Appen

19

102

apache-2.0

7

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

topic-detection accent-detection speech-annotation speech-processing speech-seperation audio-segmentation speech-recognition speaker-diarization speech-transcription gender-classification speaker-identification synthetic-speech-detection

This repository has been archived (exclude archived)

Created 2021-08-23

69 commits to main branch, last one 3 years ago

speech-condenser nezhar

10

82

unknown

4

A tool for summarizing dialogues from videos or audio

asr summarization speach-recognition speaker-diarization speaker-identification

Created 2023-01-05

8 commits to main branch, last one about a year ago

easytts Warma10032

12

69

unknown

1

打造最简单的TTS前端集合，最简单的有声小说制作工作流。基于正则规则对小说进行分句，基于RoBERTa对小说中的对话进行说话人识别，从而实现一键式生成多人有声小说。多说话人的语音合成，高质量的有声小说制作。

ai nlp tts pyqt audio-generation speaker-identification

Created 2025-03-05

18 commits to main branch, last one 14 days ago

titanet Wadaboa

13

62

mit

1

Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO

ml4cv unibo nvidia titanet d-vectors speaker-embeddings speaker-recognition speaker-verification speaker-identification

Created 2021-10-19

54 commits to main branch, last one 2 years ago

speakerbox CouncilDataProject

6

56

mit

6

Speakerbox: Fine-tune Audio Transformers for speaker identification.

speaker-id transformers audio-classification speaker-identification

Created 2022-01-25

71 commits to main branch, last one about a year ago

whisper-streamlit jojojaeger

17

47

other

1

this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews

asr whisper clinical-research speaker-identification

Created 2022-05-12

148 commits to master branch, last one 8 months ago

eagle Picovoice

5

33

apache-2.0

9

On-device speaker recognition engine powered by deep learning

speaker-embedding speaker-recognition speaker-identification

Created 2023-05-03

87 commits to main branch, last one 26 days ago