Trending repositories for topic audio
The free and privacy-friendly screen recorder with no limits 🎥
GUI for a Vocal Remover that uses Deep Neural Networks.
SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.
Custom elements (web components) for making audio and video player controls that look great in your website or app.
A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding). Translations: 🇺🇸 🇨🇳 🇯🇵 🇮🇹 🇰🇷 🇷🇺 🇧🇷 🇪🇸
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
Open source podcast instrument for Android supporting contents from YouTube and YT Music as well as normal podcasts.
WebAV is an SDK built on WebCodecs, designed for creating and editing video files on the web platform. WebAV 是基于 WebCodecs 构建的 SDK,用于在 Web 平台上创建/编辑视频文件。
html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式,支持pc和Android、iOS部分浏览器、Hybrid App(提供Android iOS App源码)、微信,提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码
UI components and hooks for building video/audio players on the web. Robust, customizable, and accessible. Modern alternative to JW Player and Video.js.
Audio playback and capture library written in C, in a single source file.
It sets up an audio stream server on an M5Cardputer, capturing microphone input and streaming it over a Wi-Fi connection to a web page.
SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
Open source podcast instrument for Android supporting contents from YouTube and YT Music as well as normal podcasts.
ESP32 synth: all-in-one acid combo of two TB303's, a drum machine with fx chain, cd-quality
Native Python PTSL (Pro Tools Scripting Library) RPC interface
A collection of music service (iTunes, Qobuz, Spotify, TIDAL) APIs for media information retrieval and semi-automated music tagging.
All of my audio research in one handy repository, including research in dat files for sounds.dat54 and game.dat151, as well as all of my nametable findings and more!
整理(索引) Web 音视频相关的 API、SDK、文章、对外产品,帮助前端开发者入门/进阶音视频领域,推动音视频技术在 Web 平台的应用实践。
Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".
The free and privacy-friendly screen recorder with no limits 🎥
GUI for a Vocal Remover that uses Deep Neural Networks.
SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.
Stream and file based music metadata parser for node. Supporting a wide range of audio and tag formats.
MediaCMS is a modern, fully featured open source video and media CMS, written in Python/Django and React, featuring a REST API.
Custom elements (web components) for making audio and video player controls that look great in your website or app.
WebAV is an SDK built on WebCodecs, designed for creating and editing video files on the web platform. WebAV 是基于 WebCodecs 构建的 SDK,用于在 Web 平台上创建/编辑视频文件。
It sets up an audio stream server on an M5Cardputer, capturing microphone input and streaming it over a Wi-Fi connection to a web page.
SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
Simple and easy-to-use screen recorder for Windows. With a built-in file merge tool.
Native Python PTSL (Pro Tools Scripting Library) RPC interface
A simple and modern audio flyout for Windows 10/11, built with Fluent 2 Design principles.
Open source podcast instrument for Android supporting contents from YouTube and YT Music as well as normal podcasts.
ESP32 synth: all-in-one acid combo of two TB303's, a drum machine with fx chain, cd-quality
TIDAL Media Downloader Next Generation! Up to HiRes Lossless / TIDAL MAX 24-bit, 192 kHz.
An AirPlay Audio-Receiver for your Personal Computer or ARM-SoC (e.g. Raspberry Pi)
整理(索引) Web 音视频相关的 API、SDK、文章、对外产品,帮助前端开发者入门/进阶音视频领域,推动音视频技术在 Web 平台的应用实践。
Play the healing frequencies of various sets of tuning forks: Solfeggio, Organs, Mineral nutrients, Ohm, Chakras, Cosmic octave, Otto, DNA nucleotides... or custom.
Stream and file based music metadata parser for node. Supporting a wide range of audio and tag formats.
The free and privacy-friendly screen recorder with no limits 🎥
GUI for a Vocal Remover that uses Deep Neural Networks.
Custom elements (web components) for making audio and video player controls that look great in your website or app.
WebAV is an SDK built on WebCodecs, designed for creating and editing video files on the web platform. WebAV 是基于 WebCodecs 构建的 SDK,用于在 Web 平台上创建/编辑视频文件。
SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.
A simple and modern audio flyout for Windows 10/11, built with Fluent 2 Design principles.
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.
SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
An AirPlay Audio-Receiver for your Personal Computer or ARM-SoC (e.g. Raspberry Pi)
Modern MP3 Player to listen your local music files on Android Lollipop 5.1.1+ & compatible with Android Auto.
It sets up an audio stream server on an M5Cardputer, capturing microphone input and streaming it over a Wi-Fi connection to a web page.
整理(索引) Web 音视频相关的 API、SDK、文章、对外产品,帮助前端开发者入门/进阶音视频领域,推动音视频技术在 Web 平台的应用实践。
WebAV is an SDK built on WebCodecs, designed for creating and editing video files on the web platform. WebAV 是基于 WebCodecs 构建的 SDK,用于在 Web 平台上创建/编辑视频文件。
Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models
Custom elements (web components) for making audio and video player controls that look great in your website or app.
Open source podcast instrument for Android supporting contents from YouTube and YT Music as well as normal podcasts.
an architecture for neural network inference in real-time audio applications
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
TIDAL Media Downloader Next Generation! Up to HiRes Lossless / TIDAL MAX 24-bit, 192 kHz.
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.
Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models
A simple and modern audio flyout for Windows 10/11, built with Fluent 2 Design principles.
Open source podcast instrument for Android supporting contents from YouTube and YT Music as well as normal podcasts.
An all-in-one sound and music management addon for the Godot game engine.
an architecture for neural network inference in real-time audio applications
SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
[IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer
整理(索引) Web 音视频相关的 API、SDK、文章、对外产品,帮助前端开发者入门/进阶音视频领域,推动音视频技术在 Web 平台的应用实践。
[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
GUI for a Vocal Remover that uses Deep Neural Networks.
The free and privacy-friendly screen recorder with no limits 🎥
BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.
Set app volumes with real sliders! deej is an Arduino & Go project to let you build your own hardware mixer for Windows and Linux
SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
💿 Free software that works great, and also happens to be open-source Python.
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
A React component for playing a variety of URLs, including file paths, YouTube, Facebook, Twitch, SoundCloud, Streamable, Vimeo, Wistia and DailyMotion
[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
A Web and Native UI for ffmpeg-wasm: convert video, audio and images using the power of ffmpeg, directly from your web browser or from your computer.
Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"
a simple oscilloscope/vectorscope/spectroscope for your terminal
Media server for real-time, low latency, programmable video and audio mixing.
App that will record system audio and send it off to the Shazam API to be identified. For when your phone's microphone just can't quite capture the song well enough for Shazam to figure it out.
Extract audio from those anime games with original filenames, paths and more
Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities"