18 results found Sort:

371
3.2k
gpl-3.0
83
a library for audio and music analysis
Created 2009-12-04
4,161 commits to master branch, last one 5 months ago
Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras
Created 2019-04-27
80 commits to master branch, last one about a year ago
.NET DSP library with a lot of audio processing functions
Created 2017-10-05
271 commits to master branch, last one about a year ago
75
435
bsd-3-clause
9
:sound: spafe: Simplified Python Audio Features Extraction
Created 2019-09-16
371 commits to master branch, last one a day ago
74
363
gpl-3.0
26
A C++ Library for Audio Analysis
Created 2014-06-22
100 commits to master branch, last one 2 years ago
Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem emplo...
Created 2018-03-16
197 commits to master branch, last one about a year ago
24
214
apache-2.0
17
A suite of speech signal processing tools
Created 2017-09-13
832 commits to master branch, last one 4 days ago
:sound: :boy: :girl:Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)
Created 2018-09-23
37 commits to master branch, last one about a year ago
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
Created 2021-02-25
156 commits to master branch, last one about a month ago
13
152
apache-2.0
9
A differentiable version of SPTK
Created 2022-03-08
561 commits to master branch, last one 11 days ago
16
138
apache-2.0
9
Synchronize your subtitles using machine learning
Created 2018-03-21
70 commits to master branch, last one 4 years ago
Personal wake word detector
Created 2020-06-06
71 commits to master branch, last one about a year ago
Zafar's Audio Functions in Python for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT spectrogram, CQT chromagram, DCT, DST, MDCT, inverse MDCT.
Created 2020-08-17
250 commits to master branch, last one 3 months ago
The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines accent based audio record. The result of the model could be use...
Created 2020-08-18
10 commits to master branch, last one 2 years ago