118 results found Sort:

675
4.4k
mit
53
AI wearables. Put it on, speak, transcribe, automatically
Created 2024-03-22
6,817 commits to main branch, last one 6 hours ago
314
3.8k
apache-2.0
52
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
Created 2022-05-03
242 commits to main branch, last one 2 months ago
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isola...
Created 2024-07-29
95 commits to main branch, last one 8 hours ago
118
2.1k
agpl-3.0
34
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
Created 2023-08-26
121 commits to main branch, last one about a month ago
91
1.8k
apache-2.0
24
Instant, controllable, local pre-trained AI models in Rust
Created 2023-05-24
1,457 commits to main branch, last one 21 hours ago
43
1.7k
agpl-3.0
29
an editor for spoken-word audio with automatic transcription
Created 2021-09-03
465 commits to main branch, last one about a year ago
This repository has no description...
Created 2024-05-18
452 commits to master branch, last one 16 days ago
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
Created 2023-05-10
17 commits to main branch, last one 11 months ago
46
1.3k
unknown
14
「硬地骇客 - 两个月 $12000 ARR 实践之路」是由 硬地骇客 团队编著,本书是关于 Podwise 产品历程的忠实记录:内容包含 灵感 - 构建 - 发布 - 增长 - 复盘 五个章节。如果你觉得一个人读不够过瘾,欢迎加入「硬地骇客」官方知识星球与专家们一起讨论!Podwise 的故事才刚刚开始,我们也将在星球持续分享我们的认知,成功可能无法复制,但失败一定可以借鉴。现在就点击下方链接加...
Created 2024-03-05
51 commits to main branch, last one 7 months ago
130
1.2k
unknown
14
Simple GUI for ByteDance's Piano Transcription with Pedals
Created 2021-02-03
196 commits to master branch, last one about a month ago
95
1.2k
mit
18
A python package to build AI-powered real-time audio applications
Created 2021-08-09
347 commits to main branch, last one about a month ago
视频音频生成字幕,生成srt文件。无需申请第三方API,本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.
Created 2022-04-29
40 commits to main branch, last one about a year ago
Generate subtitles, summaries, and chapters from videos in seconds
Created 2023-03-22
38 commits to main branch, last one about a year ago
turnkey self-hosted offline transcription and diarization service with llm summary
Created 2023-11-13
60 commits to main branch, last one 5 months ago
79
807
gpl-3.0
17
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
Created 2022-12-23
348 commits to main branch, last one 6 months ago
Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration
Created 2022-11-06
428 commits to master branch, last one 2 years ago
26
737
mit
10
A command-line application to convert images, PDFs, and audio files to text using Apple's APIs
Created 2022-12-03
24 commits to main branch, last one about a year ago
OBS plugin for local speech recognition and captioning using AI
Created 2023-08-10
226 commits to master branch, last one about a month ago
143
709
gpl-3.0
12
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
Created 2023-05-12
431 commits to main branch, last one 25 days ago
Self-hosted AI audio transcription
Created 2024-10-04
138 commits to main branch, last one 24 days ago
🎤 The easiest way to transcribe audio in Swift
Created 2023-03-29
63 commits to master branch, last one about a year ago
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
Created 2024-05-24
9 commits to main branch, last one 3 months ago
70
619
apache-2.0
32
On-device streaming speech-to-text engine powered by deep learning
Created 2018-10-28
331 commits to master branch, last one 3 days ago
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
Created 2018-11-01
531 commits to master branch, last one 3 years ago
63
540
mit
10
Effortlessly add AI-generated transcription subtitles to your videos
Created 2022-10-11
51 commits to main branch, last one 4 months ago
Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.
Created 2023-11-12
24 commits to main branch, last one 23 days ago
Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/
Created 2019-12-16
295 commits to master branch, last one 26 days ago
26
449
bsd-3-clause
9
Command line speech recognition and transcription for macOS
Created 2022-02-23
69 commits to master branch, last one 4 months ago
27
448
apache-2.0
17
On-device speech-to-text engine powered by deep learning
Created 2020-01-14
313 commits to master branch, last one 4 days ago
Fast text based video editing, node Electron Os X desktop app, with Backbone front end.
Created 2016-09-08
588 commits to master branch, last one 4 years ago