Trending repositories for topic speech-enhancement

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

8,999 (+15)

apache-2.0

resemble-ai/resemble-enhance

AI powered speech denoising and enhancement

1,475 (+9)

mit

Rikorose/DeepFilterNet

Noise supression using deep filtering

2,576 (+5)

espnet/espnet

End-to-End Speech Processing Toolkit

8,549 (+5)

apache-2.0

hangtingchen/Beam-Guided-TasNet

Beam-guided TasNet

48 (+1)

bsd-3-clause

Audio-WestlakeU/RealMAN

A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]

98 (+1)

huyanxin/phasen

A unofficial Pytorch implementation of Microsoft's PHASEN

224 (+1)

JusperLee/Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

763 (+1)

Last 3 days (relative gain)

hangtingchen/Beam-Guided-TasNet

Beam-guided TasNet

48 (+2%)

bsd-3-clause

Audio-WestlakeU/RealMAN

A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]

98 (+1%)

resemble-ai/resemble-enhance

AI powered speech denoising and enhancement

1,475 (+0.6%)

mit

huyanxin/phasen

A unofficial Pytorch implementation of Microsoft's PHASEN

224 (+0.4%)

Rikorose/DeepFilterNet

Noise supression using deep filtering

2,576 (+0.2%)

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

8,999 (+0.2%)

apache-2.0

JusperLee/Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

763 (+0.1%)

espnet/espnet

End-to-End Speech Processing Toolkit

8,549 (+0.1%)

apache-2.0

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

8,999 (+27)

apache-2.0

resemble-ai/resemble-enhance

AI powered speech denoising and enhancement

1,475 (+20)

mit

Rikorose/DeepFilterNet

Noise supression using deep filtering

2,576 (+14)

espnet/espnet

End-to-End Speech Processing Toolkit

8,549 (+11)

apache-2.0

haoheliu/voicefixer

General Speech Restoration

1,050 (+5)

mit

yxlu-0102/MP-SENet

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

326 (+4)

mit

asteroid-team/asteroid

The PyTorch-based audio source separation toolkit for researchers

2,290 (+4)

mit

JusperLee/Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

763 (+2)

hangtingchen/Beam-Guided-TasNet

Beam-guided TasNet

48 (+1)

bsd-3-clause

Speech-Interaction-Technology-Aalto-U/itsp

Introduction to Speech Processing

75 (+1)

cc-by-sa-4.0

haoxiangsnr/spiking-fullsubnet

Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.

78 (+1)

mit

skirdey/voicerestore

VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration

90 (+1)

mit

Audio-WestlakeU/RealMAN

A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]

98 (+1)

Xiaobin-Rong/gtcrn

The official implementation of GTCRN, an ultra-lite speech enhancement model.

220 (+1)

mit

huyanxin/phasen

A unofficial Pytorch implementation of Microsoft's PHASEN

224 (+1)

aishoot/LSTM_PIT_Speech_Separation

Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.

308 (+1)

Last week (relative gain)

hangtingchen/Beam-Guided-TasNet

Beam-guided TasNet

48 (+2%)

bsd-3-clause

resemble-ai/resemble-enhance

AI powered speech denoising and enhancement

1,475 (+1%)

mit

Speech-Interaction-Technology-Aalto-U/itsp

Introduction to Speech Processing

75 (+1%)

cc-by-sa-4.0

haoxiangsnr/spiking-fullsubnet

Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.

78 (+1%)

mit

yxlu-0102/MP-SENet

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

326 (+1%)

mit

skirdey/voicerestore

VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration

90 (+1%)

mit

Audio-WestlakeU/RealMAN

A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]

98 (+1%)

Rikorose/DeepFilterNet

Noise supression using deep filtering

2,576 (+0.5%)

haoheliu/voicefixer

General Speech Restoration

1,050 (+0.5%)

mit

Xiaobin-Rong/gtcrn

The official implementation of GTCRN, an ultra-lite speech enhancement model.

220 (+0.5%)

mit

huyanxin/phasen

A unofficial Pytorch implementation of Microsoft's PHASEN

224 (+0.4%)

aishoot/LSTM_PIT_Speech_Separation

Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.

308 (+0.3%)

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

8,999 (+0.3%)

apache-2.0

JusperLee/Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

763 (+0.3%)

asteroid-team/asteroid

The PyTorch-based audio source separation toolkit for researchers

2,290 (+0.2%)

mit

espnet/espnet

End-to-End Speech Processing Toolkit

8,549 (+0.1%)

apache-2.0

Last month (new repositories)

no newly created repositories trending in the last month

Last month (absolute gain)

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

8,999 (+134)

apache-2.0

espnet/espnet

End-to-End Speech Processing Toolkit

8,549 (+93)

apache-2.0

resemble-ai/resemble-enhance

AI powered speech denoising and enhancement

1,475 (+79)

mit

Rikorose/DeepFilterNet

Noise supression using deep filtering

2,576 (+60)

asteroid-team/asteroid

The PyTorch-based audio source separation toolkit for researchers

2,290 (+25)

mit

ictnlp/StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

963 (+24)

mit

haoheliu/voicefixer

General Speech Restoration

1,050 (+22)

mit

yxlu-0102/MP-SENet

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

326 (+17)

mit

Xiaobin-Rong/gtcrn

The official implementation of GTCRN, an ultra-lite speech enhancement model.

220 (+16)

mit

Audio-WestlakeU/RealMAN

A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]

98 (+12)

skirdey/voicerestore

VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration

90 (+9)

mit

JusperLee/Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

763 (+9)

Speech-Interaction-Technology-Aalto-U/itsp

Introduction to Speech Processing

75 (+8)

cc-by-sa-4.0

haoxiangsnr/spiking-fullsubnet

Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.

78 (+8)

mit

Audio-WestlakeU/FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

554 (+8)

mit

claritychallenge/clarity

Clarity Challenge toolkit - software for building Clarity Challenge systems

135 (+7)

mit

Xiaobin-Rong/deepvqe

An unofficial implementation of DeepVQE proposed by Microsoft Corp.

72 (+6)

breizhn/DTLN

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

587 (+6)

mit

nanless/universal-speech-enhancement

Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec lo...

41 (+5)

mit

audiolabs/torch-pesq

PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio

152 (+3)

mit

Last month (relative gain)

Audio-WestlakeU/RealMAN

A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]

98 (+14%)

nanless/universal-speech-enhancement

41 (+14%)

mit

Speech-Interaction-Technology-Aalto-U/itsp

Introduction to Speech Processing

75 (+12%)

cc-by-sa-4.0

haoxiangsnr/spiking-fullsubnet

Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.

78 (+11%)

mit

skirdey/voicerestore

VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration

90 (+11%)

mit

Xiaobin-Rong/deepvqe

An unofficial implementation of DeepVQE proposed by Microsoft Corp.

72 (+9%)

Xiaobin-Rong/gtcrn

The official implementation of GTCRN, an ultra-lite speech enhancement model.

220 (+8%)

mit

resemble-ai/resemble-enhance

AI powered speech denoising and enhancement

1,475 (+6%)

mit

yxlu-0102/MP-SENet

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

326 (+6%)

mit

claritychallenge/clarity

Clarity Challenge toolkit - software for building Clarity Challenge systems

135 (+5%)

mit

yuyun2000/SpeechDenoiser

SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech denoising using an ONNX model. This repository contains everything...

45 (+5%)

jhauret/vibravox

Speech to Phoneme, Bandwidth Extension and Speaker Verification using the Vibravox dataset.

27 (+4%)

mit

line/open-universe

Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.

74 (+3%)

apache-2.0

Xiaobin-Rong/TRT-SE

An example of a speech enhancement model deployed with TensorRT.

39 (+3%)

ictnlp/StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

963 (+3%)

mit

Rikorose/DeepFilterNet

Noise supression using deep filtering

2,576 (+2%)

Takaaki-Saeki/ssl_speech_restoration

SelfRemaster: SSL Speech Restoration

87 (+2%)

mit

JaeBinCHA7/Nested-U-Net-based-Real-time-Speech-Enhancement-Mobile-App

Real-time speech enhancement mobile app using Nested U-Net

45 (+2%)

mit

LXP-Never/TCNN

TCNN Temporal convolutional neural network for real-time speech enhancement in the time domain

47 (+2%)

RookieJunChen/Inter-SubNet

The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.

95 (+2%)

apache-2.0

Last 12-months (new repositories)

ictnlp/StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

963

mit

Audio-WestlakeU/RealMAN

A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]

skirdey/voicerestore

VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration

mit

line/open-universe

Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.

apache-2.0

yuyun2000/SpeechDenoiser

nanless/universal-speech-enhancement

mit

Last 12-months (absolute gain)

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

8,999 (+2,089)

apache-2.0

resemble-ai/resemble-enhance

AI powered speech denoising and enhancement

1,475 (+1,452)

mit

espnet/espnet

End-to-End Speech Processing Toolkit

8,549 (+1,131)

apache-2.0

Rikorose/DeepFilterNet

Noise supression using deep filtering

2,576 (+1,022)

ictnlp/StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

963 (+957)

mit

asteroid-team/asteroid

The PyTorch-based audio source separation toolkit for researchers

2,290 (+311)

mit

haoheliu/voicefixer

General Speech Restoration

1,050 (+295)

mit

Xiaobin-Rong/gtcrn

The official implementation of GTCRN, an ultra-lite speech enhancement model.

220 (+218)

mit

yxlu-0102/MP-SENet

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

326 (+214)

mit

Audio-WestlakeU/RealMAN

A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]

98 (+97)

JusperLee/Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

763 (+93)

skirdey/voicerestore

VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration

90 (+89)

mit

breizhn/DTLN

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

587 (+81)

mit

double22a/speech_dataset

The dataset of Speech Recognition

388 (+77)

apache-2.0

Audio-WestlakeU/FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

554 (+77)

mit

haoxiangsnr/spiking-fullsubnet

Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.

78 (+69)

mit

schmiph2/pysepm

Python implementation of performance metrics in Loizou's Speech Enhancement book

391 (+68)

gpl-3.0

line/open-universe

Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.

74 (+67)

apache-2.0

audiolabs/torch-pesq

PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio

152 (+66)

mit

Xiaobin-Rong/deepvqe

An unofficial implementation of DeepVQE proposed by Microsoft Corp.

72 (+53)

Last 12-months (relative gain)

ictnlp/StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

963 (+15,950%)

mit

resemble-ai/resemble-enhance

AI powered speech denoising and enhancement

1,475 (+6,313%)

mit

line/open-universe

Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.

74 (+957%)

apache-2.0

haoxiangsnr/spiking-fullsubnet

Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.

78 (+767%)

mit

yuyun2000/SpeechDenoiser

45 (+543%)

Audio-WestlakeU/RVAE-EM

Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]

42 (+320%)

mit

Xiaobin-Rong/deepvqe

An unofficial implementation of DeepVQE proposed by Microsoft Corp.

72 (+279%)

yxlu-0102/MP-SENet

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

326 (+191%)

mit

Speech-Interaction-Technology-Aalto-U/itsp

Introduction to Speech Processing

75 (+127%)

cc-by-sa-4.0

will-rice/denoisers

Simple PyTorch Denoisers for Waveform Audio

32 (+100%)

apache-2.0

audiolabs/torch-pesq

PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio

152 (+77%)

mit

haoxiangsnr/llm-tse

Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)

32 (+68%)

Rikorose/DeepFilterNet

Noise supression using deep filtering

2,576 (+66%)

claritychallenge/clarity

Clarity Challenge toolkit - software for building Clarity Challenge systems

135 (+63%)

mit

Picovoice/koala

On-device noise suppression powered by deep learning

63 (+50%)

apache-2.0

seorim0/NUNet-TLS

Nested U-Net with two-level skip connections for speech enhancement

30 (+50%)

mit

JaeBinCHA7/Nested-U-Net-based-Real-time-Speech-Enhancement-Mobile-App

Real-time speech enhancement mobile app using Nested U-Net

45 (+45%)

mit

Audio-WestlakeU/McNet

The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023

108 (+42%)

madhavmk/Noise2Noise-audio_denoising_without_clean_training_data

Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy...

178 (+40%)

mit

haoheliu/voicefixer

General Speech Restoration

1,050 (+39%)

mit