22 results found Sort:
- Filter by Primary Language:
- Python (9)
- HTML (3)
- Jupyter Notebook (3)
- MATLAB (3)
- C++ (2)
- C (1)
- +
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
Created
2023-11-19
170 commits to main branch, last one 14 days ago
Praat: Doing Phonetics By Computer
Created
2014-04-11
8,991 commits to master branch, last one a day ago
A high-quality speech analysis, manipulation and synthesis system
Created
2015-11-15
326 commits to master branch, last one about a month ago
General Speech Restoration
Created
2021-09-06
92 commits to main branch, last one 8 months ago
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. ...
asr
prosody
acoustic
adaptation
interspeech
transmission
audio-signals
interspeech2023
interspeech2024
speech-analysis
lexical-analysis
speech-synthesis
language-modeling
signal-processing
speech-production
speech-technology
speech-recognition
linguistic-analysis
machine-translation
self-supervised-learning
Created
2023-06-26
855 commits to main branch, last one 4 months ago
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
Created
2020-06-16
80 commits to master branch, last one 3 years ago
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN a...
timit
speech
speech-api
beamforming
librispeech
deeplearning
deep-learning
neural-network
speech-to-text
neural-networks
speech-analysis
speech-processing
speech-recognizer
speech-separation
speechrecognition
speech-recognition
speaker-recognition
speaker-verification
speaker-identification
speech-emotion-recognition
Created
2019-08-31
482 commits to master branch, last one 12 days ago
feature extraction from speech signals
Created
2017-08-05
153 commits to master branch, last one about a year ago
My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. It breaks utterances and detects syllable boundaries, fundament...
Created
2018-11-29
56 commits to master branch, last one 3 years ago
General Speech Restoration
Created
2021-09-26
62 commits to main branch, last one 11 months ago
A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.
Created
2018-12-27
55 commits to master branch, last one 3 years ago
Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
Created
2017-05-02
52 commits to master branch, last one 3 years ago
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
Created
2019-12-03
27 commits to master branch, last one 3 years ago
Pitch detection and pitch tracking, voicing unvoicing detection (VAD),基音检测
Created
2019-04-01
103 commits to master branch, last one 2 years ago
Localized Narratives
Created
2020-01-23
27 commits to master branch, last one 3 years ago
Introduction to Speech Processing
Created
2022-05-09
254 commits to main branch, last one 3 months ago
An opensource harmonizer implementation leveraging the DISTRHO Plugin Framework.
Created
2020-10-03
97 commits to master branch, last one 8 months ago
SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)
Created
2024-10-07
9 commits to main branch, last one 17 days ago
STT 한글 문장 인식기 출력 스크립트의 외자 오류율(CER), 단어 오류율(WER)을 계산하는 Python 함수 패키지
Created
2022-04-05
29 commits to main branch, last one about a year ago
SpeCT - Speech Corpus Toolkit for Praat. Documentation: https://lennes.github.io/spect/
Created
2017-03-08
49 commits to master branch, last one about a year ago
MATLAB real-time/interactive speech tools. This series is obsolete. SP3ARK is the up-to-date series (will be).
Created
2017-10-12
22 commits to master branch, last one 3 years ago
An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning
Created
2022-03-18
40 commits to main branch, last one 2 years ago