Trending repositories for topic speech-enhancement
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]
A must-read paper for speech separation based on neural networks
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]
A must-read paper for speech separation based on neural networks
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
The PyTorch-based audio source separation toolkit for researchers
A must-read paper for speech separation based on neural networks
Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.
VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]
The official implementation of GTCRN, an ultra-lite speech enhancement model.
Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.
Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]
The official implementation of GTCRN, an ultra-lite speech enhancement model.
Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.
A must-read paper for speech separation based on neural networks
The PyTorch-based audio source separation toolkit for researchers
The PyTorch-based audio source separation toolkit for researchers
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
The official implementation of GTCRN, an ultra-lite speech enhancement model.
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]
VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration
A must-read paper for speech separation based on neural networks
Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Clarity Challenge toolkit - software for building Clarity Challenge systems
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec lo...
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec lo...
Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.
VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration
The official implementation of GTCRN, an ultra-lite speech enhancement model.
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Clarity Challenge toolkit - software for building Clarity Challenge systems
SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech denoising using an ONNX model. This repository contains everything...
Speech to Phoneme, Bandwidth Extension and Speaker Verification using the Vibravox dataset.
Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Real-time speech enhancement mobile app using Nested U-Net
TCNN Temporal convolutional neural network for real-time speech enhancement in the time domain
The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]
VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration
Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.
SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech denoising using an ONNX model. This repository contains everything...
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec lo...
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
The PyTorch-based audio source separation toolkit for researchers
The official implementation of GTCRN, an ultra-lite speech enhancement model.
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]
A must-read paper for speech separation based on neural networks
VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.
Python implementation of performance metrics in Loizou's Speech Enhancement book
Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.
Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.
SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech denoising using an ONNX model. This repository contains everything...
Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Introduction to Speech Processing
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)
Clarity Challenge toolkit - software for building Clarity Challenge systems
Real-time speech enhancement mobile app using Nested U-Net
The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023
Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy...