6 results found Sort:

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Created 2016-03-07
2,415 commits to develop branch, last one 9 days ago
235
842
unknown
44
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Created 2017-04-18
115 commits to master branch, last one 3 years ago
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Created 2018-03-12
246 commits to master branch, last one 12 days ago
29
142
gpl-3.0
5
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
Created 2020-03-31
35 commits to master branch, last one 3 years ago
The codebase for Data-driven general-purpose voice activity detection.
Created 2020-05-20
42 commits to master branch, last one 2 years ago
Speaker change detection using SincNet and an LSTM/Transformer
Created 2022-11-17
3 commits to master branch, last one 4 months ago