Trending repositories for topic audio
GUI for a Vocal Remover that uses Deep Neural Networks.
BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.
SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
NSMusicS,Multi platform Multi mode Music Software ,Electron(Vue3+Vite+TypeScript)
The high-speed OpenGL, OpenCL, OpenAL, OpenXR, GLFW, SDL, Vulkan, Assimp, WebGPU, and DirectX bindings library your mother warned you about.
The free and privacy-friendly screen recorder with no limits 🎥
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
💿 Free software that works great, and also happens to be open-source Python.
JUCE is an open-source cross-platform C++ application framework for desktop and mobile applications, including VST, VST3, AU, AUv3, LV2 and AAX audio plug-ins.
UI components and hooks for building video/audio players on the web. Robust, customizable, and accessible. Modern alternative to JW Player and Video.js.
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
A simple MP3 and AAC Decoder (not only) for Arduino based on libhelix
GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch
Sample code which records system (speaker) sounds (what you hear) in Python.
TIDAL Media Downloader Next Generation! Up to HiRes Lossless / TIDAL MAX 24-bit, 192 kHz.
🎵 Privacy-focused, cross-platform, self-hostable Tidal instance.
NSMusicS,Multi platform Multi mode Music Software ,Electron(Vue3+Vite+TypeScript)
An audio recording helper for React. Provides a component and a hook to help with audio recording.
Official DeepSound repository migrated from jpinsoft.net. DeepSound is a freeware steganography tool and audio converter that hides secret data into audio files. The application also enables you to ex...
WebM multiplexer in pure TypeScript with support for WebCodecs API, video & audio.
light-weight TUI music player with Soundcloud & Youtube built-in. Effects, Themes, Midi Support for Win & Linux
YouTube video and audio extractor for iOS, watchOS, visionOS, tvOS and macOS
A sleek desktop music player and tagger for offline music 🪕. With gapless playback, smart playlists, and a map view! Built with Svelte and Tauri
A Library of Audio Steganography & Watermarking Algorithms
GUI for a Vocal Remover that uses Deep Neural Networks.
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
The free and privacy-friendly screen recorder with no limits 🎥
NSMusicS,Multi platform Multi mode Music Software ,Electron(Vue3+Vite+TypeScript)
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.
💿 Free software that works great, and also happens to be open-source Python.
UI components and hooks for building video/audio players on the web. Robust, customizable, and accessible. Modern alternative to JW Player and Video.js.
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
A One-Stop Multi-Level SoundSystem Abstraction (or say sound/audio engine). Suitable for being a solid foundation for Pro-Audio Applications(e.g. a DAW) or other sound related apps.
Versatile AI-driven audio upscaler to enhance the quality of any audio.
A Web and Native UI for ffmpeg-wasm: convert video, audio and images using the power of ffmpeg, directly from your web browser or from your computer.
NSMusicS,Multi platform Multi mode Music Software ,Electron(Vue3+Vite+TypeScript)
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
Models and datasets for training deep learning automatic mixing models
TIDAL Media Downloader Next Generation! Up to HiRes Lossless / TIDAL MAX 24-bit, 192 kHz.
🎵 Privacy-focused, cross-platform, self-hostable Tidal instance.
Official DeepSound repository migrated from jpinsoft.net. DeepSound is a freeware steganography tool and audio converter that hides secret data into audio files. The application also enables you to ex...
Sample code which records system (speaker) sounds (what you hear) in Python.
A simple MEMS I2S microphone and audio processing library for ESP32.
API for a Vocal Remover that uses Deep Neural Networks.
A simple MP3 and AAC Decoder (not only) for Arduino based on libhelix
light-weight TUI music player with Soundcloud & Youtube built-in. Effects, Themes, Midi Support for Win & Linux
YouTube video and audio extractor for iOS, watchOS, visionOS, tvOS and macOS
GUI for a Vocal Remover that uses Deep Neural Networks.
The free and privacy-friendly screen recorder with no limits 🎥
SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
💿 Free software that works great, and also happens to be open-source Python.
UI components and hooks for building video/audio players on the web. Robust, customizable, and accessible. Modern alternative to JW Player and Video.js.
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
Demux media files in the browser using WebAssembly, designed for WebCodecs 在浏览器中实现媒体文件的解封装,专为WebCodecs设计
Versatile AI-driven audio upscaler to enhance the quality of any audio.
logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source separation systems.
TIDAL Media Downloader Next Generation! Up to HiRes Lossless / TIDAL MAX 24-bit, 192 kHz.
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
A Web and Native UI for ffmpeg-wasm: convert video, audio and images using the power of ffmpeg, directly from your web browser or from your computer.
GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch
A simple yet effective Audio-to-Midi Automatic Piano Transcription system
Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities
Extract audio from those anime games with original filenames, paths and more
CTAG TBD >>to be determined<< an extendible open source Eurorack sound module
Additional examples to compliment TI's Bluetooth Low Energy Stack offerings.
Open source podcast instrument for Android in Kotlin with media3, supporting YouTube channels.
A simple MEMS I2S microphone and audio processing library for ESP32.
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Vaporizer2 hybrid wavetable additive / subtractive VST / AU / AAX synthesizer / sampler workstation plugin
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
TIDAL Media Downloader Next Generation! Up to HiRes Lossless / TIDAL MAX 24-bit, 192 kHz.
Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.
An all-in-one sound and music management addon for the Godot game engine.
Open source podcast instrument for Android in Kotlin with media3, supporting YouTube channels.
Official DeepSound repository migrated from jpinsoft.net. DeepSound is a freeware steganography tool and audio converter that hides secret data into audio files. The application also enables you to ex...
[IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer
Neural network inference template for real-time cricital audio environments - presented at ADC23
GUI for a Vocal Remover that uses Deep Neural Networks.
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.
The free and privacy-friendly screen recorder with no limits 🎥
Set app volumes with real sliders! deej is an Arduino & Go project to let you build your own hardware mixer for Windows and Linux
SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
💿 Free software that works great, and also happens to be open-source Python.
A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding). Translations: 🇺🇸 🇨🇳 🇯🇵 🇮🇹 🇰🇷 🇷🇺 🇧🇷 🇪🇸
A React component for playing a variety of URLs, including file paths, YouTube, Facebook, Twitch, SoundCloud, Streamable, Vimeo, Wistia and DailyMotion
a simple oscilloscope/vectorscope/spectroscope for your terminal
Vaporizer2 hybrid wavetable additive / subtractive VST / AU / AAX synthesizer / sampler workstation plugin
A Free Roland TB-303 Plugin for Windows, MacOS and Linux: VST2, VST3, LV2, CLAP and AU. A Juce port of Open303 engine.
The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
The BEST music separation model with help of A.I. ... to my ears ! 👂👂
Media server for real-time, low latency, programmable video and audio mixing.
Advanced Swift library for media conversion and manipulation
Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"
An audio visualizer for React. Provides separate components to visualize both live audio and audio blobs.
Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities"