Trending repositories for topic audio
GUI for a Vocal Remover that uses Deep Neural Networks.
BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.
The free and privacy-friendly screen recorder with no limits 🎥
A collection of header only classes, permissively licensed, to provide basic useful tasks with the bare-minimum of dependencies.
MediaCMS is a modern, fully featured open source video and media CMS, written in Python/Django and React, featuring a REST API.
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
Custom elements (web components) for making audio and video player controls that look great in your website or app.
A React component for playing a variety of URLs, including file paths, YouTube, Facebook, Twitch, SoundCloud, Streamable, Vimeo, Wistia and DailyMotion
Set app volumes with real sliders! deej is an Arduino & Go project to let you build your own hardware mixer for Windows and Linux
SRS is a simple, high-efficiency, real-time video server supporting RTMP, WebRTC, HLS, HTTP-FLV, SRT, MPEG-DASH, and GB28181.
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
An official implementation for SSAMBA: Self-Supervised Audio Mamba
an architecture for neural network inference in real-time audio applications
A simple web audio recording library with encoding to MP3 and optional streaming/chunked output. Now with React hook and component!
A collection of header only classes, permissively licensed, to provide basic useful tasks with the bare-minimum of dependencies.
Open-source media server for real-time, low latency, programmable video and audio mixing.
Amp Rack is a Guitar / Voice Audio Effects Processor for Android. Amp Rack is an Open Source LADSPA Plugins Host for Android. More than 150 high quality audio plugins are available which can be added ...
A local GUI music source separation tool built on Tkinter and demucs serving as a free and open source Stem Player
A sleek desktop music player and tagger for offline music 🪕 With experimental features like map view, GPT analysis, artist toolkit. Built with Svelte and Tauri
A Free Roland TB-303 Plugin for Windows, MacOS and Linux: VST2, VST3, LV2, CLAP and AU. A Juce port of Open303 engine.
TIDAL Media Downloader Next Generation! Up to HiRes Lossless / TIDAL MAX 24-bit, 192 kHz.
Record Audio from the User's Microphone in Apps that are Deployed to the Web. (via Browser Media-API, REACT-based, Streamlit Custom Component)
ESP32 synth: all-in-one acid combo TB303 + TB303 + drum machine + fx chain, cd-quality, no lag
Script to enable audio support on many Chrome devices
Custom elements (web components) for making audio and video player controls that look great in your website or app.
GUI for a Vocal Remover that uses Deep Neural Networks.
The free and privacy-friendly screen recorder with no limits 🎥
A sleek desktop music player and tagger for offline music 🪕 With experimental features like map view, GPT analysis, artist toolkit. Built with Svelte and Tauri
BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.
SRS is a simple, high-efficiency, real-time video server supporting RTMP, WebRTC, HLS, HTTP-FLV, SRT, MPEG-DASH, and GB28181.
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
Set app volumes with real sliders! deej is an Arduino & Go project to let you build your own hardware mixer for Windows and Linux
A collection of header only classes, permissively licensed, to provide basic useful tasks with the bare-minimum of dependencies.
MediaCMS is a modern, fully featured open source video and media CMS, written in Python/Django and React, featuring a REST API.
A React component for playing a variety of URLs, including file paths, YouTube, Facebook, Twitch, SoundCloud, Streamable, Vimeo, Wistia and DailyMotion
An official implementation for SSAMBA: Self-Supervised Audio Mamba
A sleek desktop music player and tagger for offline music 🪕 With experimental features like map view, GPT analysis, artist toolkit. Built with Svelte and Tauri
Open-source media server for real-time, low latency, programmable video and audio mixing.
an architecture for neural network inference in real-time audio applications
light-weight TUI music player with Soundcloud & Youtube support, with Effects. Win & Linux
TIDAL Media Downloader Next Generation! Up to HiRes Lossless / TIDAL MAX 24-bit, 192 kHz.
A simple web audio recording library with encoding to MP3 and optional streaming/chunked output. Now with React hook and component!
A Web and Native UI for ffmpeg-wasm: convert video, audio and images using the power of ffmpeg, directly from your web browser or from your computer.
A collection of header only classes, permissively licensed, to provide basic useful tasks with the bare-minimum of dependencies.
This repository contains a Python script that allows users to download the audio from a YouTube video, transcribe it into text, detect the language and save the transcription in txt file automatically...
Official DeepSound repository migrated from jpinsoft.net. DeepSound is a freeware steganography tool and audio converter that hides secret data into audio files. The application also enables you to ex...
Self-contained 5W Class-D amplifier inside a Speakon connector
An official implementation for SSAMBA: Self-Supervised Audio Mamba
BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.
GUI for a Vocal Remover that uses Deep Neural Networks.
The free and privacy-friendly screen recorder with no limits 🎥
Radient turns many data types (not just text) into vectors for similarity search, clustering, regression analysis, and more.
SRS is a simple, high-efficiency, real-time video server supporting RTMP, WebRTC, HLS, HTTP-FLV, SRT, MPEG-DASH, and GB28181.
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
Set app volumes with real sliders! deej is an Arduino & Go project to let you build your own hardware mixer for Windows and Linux
A sleek desktop music player and tagger for offline music 🪕 With experimental features like map view, GPT analysis, artist toolkit. Built with Svelte and Tauri
Radient turns many data types (not just text) into vectors for similarity search, clustering, regression analysis, and more.
An official implementation for SSAMBA: Self-Supervised Audio Mamba
Self-contained 5W Class-D amplifier inside a Speakon connector
A sleek desktop music player and tagger for offline music 🪕 With experimental features like map view, GPT analysis, artist toolkit. Built with Svelte and Tauri
light-weight TUI music player with Soundcloud & Youtube support, with Effects. Win & Linux
Open-source media server for real-time, low latency, programmable video and audio mixing.
an architecture for neural network inference in real-time audio applications
Official DeepSound repository migrated from jpinsoft.net. DeepSound is a freeware steganography tool and audio converter that hides secret data into audio files. The application also enables you to ex...
TIDAL Media Downloader Next Generation! Up to HiRes Lossless / TIDAL MAX 24-bit, 192 kHz.
API for a Vocal Remover that uses Deep Neural Networks.
Split jack(headphones)/speakers outputs into individual sinks on Linux to allow simultaneous playback (listen to different audio streams on each port)
React library for audio recording and visualization using the Web Audio API
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
Vaporizer2 hybrid wavetable additive / subtractive VST / AU / AAX synthesizer / sampler workstation plugin
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"
Radient turns many data types (not just text) into vectors for similarity search, clustering, regression analysis, and more.
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
An all-in-one sound and music management addon for the Godot game engine.
TIDAL Media Downloader Next Generation! Up to HiRes Lossless / TIDAL MAX 24-bit, 192 kHz.
A Free Roland TB-303 Plugin for Windows, MacOS and Linux: VST2, VST3, LV2, CLAP and AU. A Juce port of Open303 engine.
GUI for a Vocal Remover that uses Deep Neural Networks.
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.
SRS is a simple, high-efficiency, real-time video server supporting RTMP, WebRTC, HLS, HTTP-FLV, SRT, MPEG-DASH, and GB28181.
The free and privacy-friendly screen recorder with no limits 🎥
Set app volumes with real sliders! deej is an Arduino & Go project to let you build your own hardware mixer for Windows and Linux
💿 Free software that works great, and also happens to be open-source Python.
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding). Translations: 🇺🇸 🇨🇳 🇯🇵 🇮🇹 🇰🇷 🇷🇺 🇧🇷 🇪🇸
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Process audio/video data in the browser using WebCodecs. 基于 WebCodecs 在浏览器中处理音视频数据。
Vaporizer2 hybrid wavetable additive / subtractive VST / AU / AAX synthesizer / sampler workstation plugin
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
This repository contains a Python script that allows users to download the audio from a YouTube video, transcribe it into text, detect the language and save the transcription in txt file automatically...
Script to enable audio support on many Chrome devices
Extension to passthrough pipewire audio to WebRTC Screenshare
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
The Fraunhofer MPEG-H decoder (mpeghdec) is a C/C++ implementation of the MPEG-H Audio standard as defined in ISO/IEC 23008-3:2022
The Hugging Face Course on Transformers for Audio
AI based tool to convert vocals lyrics and pitch from music to autogenerate Ultrastar Deluxe, Midi and notes. It automatic tapping, adding text, pitch vocals and creates karaoke files.
Programmatic minimalistic audio visualizations.
MilkDrop 3.0, supports any audio source, double-preset (.milk2), loading presets based on beat detection and much more...
A simple music and sound effect player for the Godot Engine
Asynchronous audio loading from remote or local destination. It has two layers of configurable cache system: RAM and Disk.