Trending repositories for topic audio-processing
Cross-platform, customizable ML solutions for live and streaming media.
A C++ based, lightweight music and noise remover for YouTube and other internet media, using DeepFilterNet for audio enhancement.
InspireMusic: A Unified Framework for Music, Song, Audio Generation.
An implementation of the system-wide JamesDSP audio processing engine for non-rooted Android devices
LedFx is a network based LED effect engine designed to deliver advanced real-time audio effects to a wide variety of devices.
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
A Web and Native UI for ffmpeg-wasm: convert video, audio and images using the power of ffmpeg, directly from your web browser or from your computer.
Arcan - [Display Server, Multimedia Framework, Game Engine] -> "Desktop Engine"
List of articles related to deep learning applied to music
[EMNLP2024 Demo] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.
Audio time stretch and pitch shift library. Enables music tempo adjustment, transposition, "smooth scrub" and "live pause".
Checkrr Scans your library files for corrupt media and optionally replaces the files via sonarr and radarr
A C++ based, lightweight music and noise remover for YouTube and other internet media, using DeepFilterNet for audio enhancement.
InspireMusic: A Unified Framework for Music, Song, Audio Generation.
A Web and Native UI for ffmpeg-wasm: convert video, audio and images using the power of ffmpeg, directly from your web browser or from your computer.
[EMNLP2024 Demo] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.
Audio time stretch and pitch shift library. Enables music tempo adjustment, transposition, "smooth scrub" and "live pause".
An implementation of the system-wide JamesDSP audio processing engine for non-rooted Android devices
Checkrr Scans your library files for corrupt media and optionally replaces the files via sonarr and radarr
LedFx is a network based LED effect engine designed to deliver advanced real-time audio effects to a wide variety of devices.
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
Speech, Language, Audio, Music Processing with Large Language Model
Arcan - [Display Server, Multimedia Framework, Game Engine] -> "Desktop Engine"
Cross-platform, customizable ML solutions for live and streaming media.
My curated list of audio DSP and plugin development resources
PipeWire Guide. Learn about how PipeWire gives your Linux system a Professional Audio/Video Processing workflow.
Cross-platform, customizable ML solutions for live and streaming media.
A C++ based, lightweight music and noise remover for YouTube and other internet media, using DeepFilterNet for audio enhancement.
InspireMusic: A Unified Framework for Music, Song, Audio Generation.
An implementation of the system-wide JamesDSP audio processing engine for non-rooted Android devices
LedFx is a network based LED effect engine designed to deliver advanced real-time audio effects to a wide variety of devices.
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
My curated list of audio DSP and plugin development resources
The collection of pre-trained, state-of-the-art AI models for ailia SDK
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
Arcan - [Display Server, Multimedia Framework, Game Engine] -> "Desktop Engine"
A little package that brings sound to any Go application. Suitable for playback and audio-processing.
InspireMusic: A Unified Framework for Music, Song, Audio Generation.
A C++ based, lightweight music and noise remover for YouTube and other internet media, using DeepFilterNet for audio enhancement.
A Web and Native UI for ffmpeg-wasm: convert video, audio and images using the power of ffmpeg, directly from your web browser or from your computer.
Fast audio player, recorder, converter for Windows, Linux & Android
[EMNLP2024 Demo] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
Easily train a good VC model with voice data <= 10 mins!
Python API & command-line tool to easily transcribe speech-based video files into clean text
Speech, Language, Audio, Music Processing with Large Language Model
Audio time stretch and pitch shift library. Enables music tempo adjustment, transposition, "smooth scrub" and "live pause".
An implementation of the system-wide JamesDSP audio processing engine for non-rooted Android devices
Checkrr Scans your library files for corrupt media and optionally replaces the files via sonarr and radarr
Cross-platform, customizable ML solutions for live and streaming media.
A C++ based, lightweight music and noise remover for YouTube and other internet media, using DeepFilterNet for audio enhancement.
InspireMusic: A Unified Framework for Music, Song, Audio Generation.
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
LedFx is a network based LED effect engine designed to deliver advanced real-time audio effects to a wide variety of devices.
An implementation of the system-wide JamesDSP audio processing engine for non-rooted Android devices
The collection of pre-trained, state-of-the-art AI models for ailia SDK
Data manipulation and transformation for audio signal processing, powered by PyTorch
PipeWire Guide. Learn about how PipeWire gives your Linux system a Professional Audio/Video Processing workflow.
A C++ based, lightweight music and noise remover for YouTube and other internet media, using DeepFilterNet for audio enhancement.
This project is a Python bot that automates the process of logging into Gmail, joining a Google Meet, recording the audio of the meeting, and then generating a summary, key points, action items, and s...
Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024
Digital Multi-Effect Pedal with Reverb, Delay, Tremolo, Looper, and Neural Networks for Amp Modeling
A Web and Native UI for ffmpeg-wasm: convert video, audio and images using the power of ffmpeg, directly from your web browser or from your computer.
Easily train a good VC model with voice data <= 10 mins!
[EMNLP2024 Demo] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.
Example effects code and binaries for the Cleveland Music Co. Hothouse Digital Signal Processing Pedal Kit
Fast audio player, recorder, converter for Windows, Linux & Android
A complete, cross-platform solution to record, convert, filter and stream audio and video.
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"
Checkrr Scans your library files for corrupt media and optionally replaces the files via sonarr and radarr
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
A C++ based, lightweight music and noise remover for YouTube and other internet media, using DeepFilterNet for audio enhancement.
InspireMusic: A Unified Framework for Music, Song, Audio Generation.
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.
A high-performance, "quantum-inspired" Fast Fourier Transform (FFT) library written in pure and safe Rust.
Easily train a good VC model with voice data <= 10 mins!
an architecture for neural network inference in real-time audio applications
Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound
WhisperClip simplifies your life by automatically transcribing audio recordings and saving the text directly to your clipboard. With just a click of a button, you can effortlessly convert spoken words...
Audio time stretch and pitch shift library. Enables music tempo adjustment, transposition, "smooth scrub" and "live pause".
[EMNLP2024 Demo] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.
Cross-platform, customizable ML solutions for live and streaming media.
A library for audio and music analysis, feature extraction.
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
PipeWire Guide. Learn about how PipeWire gives your Linux system a Professional Audio/Video Processing workflow.
Isolate vocals, drums, bass, and other instrumental stems from any song
The collection of pre-trained, state-of-the-art AI models for ailia SDK
An implementation of the system-wide JamesDSP audio processing engine for non-rooted Android devices
A C++ based, lightweight music and noise remover for YouTube and other internet media, using DeepFilterNet for audio enhancement.
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Audio time stretch and pitch shift library. Enables music tempo adjustment, transposition, "smooth scrub" and "live pause".
[EMNLP2024 Demo] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.
Digital Multi-Effect Pedal with Reverb, Delay, Tremolo, Looper, and Neural Networks for Amp Modeling
A music theory library in Rust for generating songs🎶
Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound
AI-powered YouTube Notes Generator: Create detailed notes from YouTube videos. Streamlit UI for easy use.
A collection amazing audio tools for working with audio and sound files in comfyUI
A Web and Native UI for ffmpeg-wasm: convert video, audio and images using the power of ffmpeg, directly from your web browser or from your computer.
Easily train a good VC model with voice data <= 10 mins!
Fast audio player, recorder, converter for Windows, Linux & Android