101 results found Sort:

284
3.6k
apache-2.0
51
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
Created 2022-05-03
240 commits to main branch, last one about a month ago
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downlo...
Created 2024-07-29
68 commits to main branch, last one 8 days ago
95
1.8k
agpl-3.0
33
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
Created 2023-08-26
119 commits to main branch, last one 9 months ago
42
1.7k
agpl-3.0
28
an editor for spoken-word audio with automatic transcription
Created 2021-09-03
465 commits to main branch, last one about a year ago
74
1.6k
apache-2.0
25
Instant, controllable, local pre-trained AI models in Rust
Created 2023-05-24
1,412 commits to main branch, last one a day ago
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
Created 2023-05-10
17 commits to main branch, last one 8 months ago
45
1.2k
unknown
14
「硬地骇客 - 两个月 $12000 ARR 实践之路」是由 硬地骇客 团队编著,本书是关于 Podwise 产品历程的忠实记录:内容包含 灵感 - 构建 - 发布 - 增长 - 复盘 五个章节。如果你觉得一个人读不够过瘾,欢迎加入「硬地骇客」官方知识星球与专家们一起讨论!Podwise 的故事才刚刚开始,我们也将在星球持续分享我们的认知,成功可能无法复制,但失败一定可以借鉴。现在就点击下方链接加...
Created 2024-03-05
51 commits to main branch, last one 4 months ago
125
1.2k
unknown
14
Simple GUI for ByteDance's Piano Transcription with Pedals
Created 2021-02-03
193 commits to master branch, last one 4 months ago
90
1.1k
mit
21
A python package to build AI-powered real-time audio applications
Created 2021-08-09
342 commits to main branch, last one 7 months ago
This repository has no description...
Created 2024-05-18
220 commits to master branch, last one 8 days ago
视频音频生成字幕,生成srt文件。无需申请第三方API,本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.
Created 2022-04-29
40 commits to main branch, last one about a year ago
Generate subtitles, summaries, and chapters from videos in seconds
Created 2023-03-22
38 commits to main branch, last one about a year ago
turnkey self-hosted offline transcription and diarization service with llm summary
Created 2023-11-13
60 commits to main branch, last one 3 months ago
Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration
Created 2022-11-06
428 commits to master branch, last one about a year ago
78
761
gpl-3.0
17
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
Created 2022-12-23
348 commits to main branch, last one 3 months ago
25
715
mit
10
A command-line application to convert images, PDFs, and audio files to text using Apple's APIs
Created 2022-12-03
24 commits to main branch, last one about a year ago
🎤 The easiest way to transcribe audio in Swift
Created 2023-03-29
63 commits to master branch, last one about a year ago
68
600
apache-2.0
34
On-device streaming speech-to-text engine powered by deep learning
Created 2018-10-28
313 commits to master branch, last one 16 days ago
OBS plugin for local speech recognition and captioning using AI
Created 2023-08-10
223 commits to master branch, last one about a month ago
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
Created 2018-11-01
531 commits to master branch, last one 3 years ago
63
534
mit
11
Effortlessly add AI-generated transcription subtitles to your videos
Created 2022-10-11
51 commits to main branch, last one about a month ago
110
527
gpl-3.0
13
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
Created 2023-05-12
369 commits to main branch, last one 2 days ago
Self-hosted AI audio transcription
Created 2024-10-04
113 commits to main branch, last one 2 months ago
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
Created 2024-05-24
9 commits to main branch, last one 11 days ago
Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.
Created 2023-11-12
23 commits to main branch, last one 4 months ago
Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/
Created 2019-12-16
286 commits to master branch, last one 28 days ago
27
436
apache-2.0
20
On-device speech-to-text engine powered by deep learning
Created 2020-01-14
293 commits to master branch, last one 6 days ago
Fast text based video editing, node Electron Os X desktop app, with Backbone front end.
Created 2016-09-08
588 commits to master branch, last one 4 years ago
25
413
bsd-3-clause
9
Command line speech recognition and transcription for macOS
Created 2022-02-23
69 commits to master branch, last one 2 months ago
85
377
apache-2.0
25
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
Created 2018-06-18
646 commits to master branch, last one 3 years ago