19 results found Sort:

382
3.3k
gpl-3.0
84
a library for audio and music analysis
Created 2009-12-04
4,161 commits to master branch, last one 11 months ago
Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras
Created 2019-04-27
80 commits to master branch, last one about a year ago
.NET DSP library with a lot of audio processing functions
Created 2017-10-05
271 commits to master branch, last one 2 years ago
79
461
bsd-3-clause
11
:sound: spafe: Simplified Python Audio Features Extraction
Created 2019-09-16
373 commits to master branch, last one 6 months ago
75
374
gpl-3.0
26
A C++ Library for Audio Analysis
Created 2014-06-22
100 commits to master branch, last one 3 years ago
Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem emplo...
Created 2018-03-16
197 commits to master branch, last one about a year ago
26
227
apache-2.0
18
A suite of speech signal processing tools
Created 2017-09-13
892 commits to master branch, last one 16 days ago
:sound: :boy: :girl:Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)
Created 2018-09-23
37 commits to master branch, last one about a year ago
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
Created 2021-02-25
164 commits to master branch, last one about a month ago
14
168
apache-2.0
10
A differentiable version of SPTK
Created 2022-03-08
731 commits to master branch, last one 5 days ago
16
149
apache-2.0
10
Synchronize your subtitles using machine learning
Created 2018-03-21
70 commits to master branch, last one 5 years ago
Personal wake word detector
Created 2020-06-06
71 commits to master branch, last one about a year ago
The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines accent based audio record. The result of the model could be use...
Created 2020-08-18
10 commits to master branch, last one 3 years ago
Zafar's Audio Functions in Python for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT spectrogram, CQT chromagram, DCT, DST, MDCT, inverse MDCT.
Created 2020-08-17
250 commits to master branch, last one 10 months ago
Spectra extraction tutorials based on torch and torchaudio.
Created 2020-02-26
21 commits to master branch, last one about a year ago