77 results found Sort:

228
3.0k
apache-2.0
52
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
Created 2022-05-03
179 commits to main branch, last one about a month ago
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
Created 2023-05-10
17 commits to main branch, last one about a month ago
33
1.0k
unknown
11
「硬地骇客 - 两个月 $12000 ARR 实践之路」是由 硬地骇客 团队编著,本书是关于 Podwise 产品历程的忠实记录:内容包含 灵感 - 构建 - 发布 - 增长 - 复盘 五个章节。如果你觉得一个人读不够过瘾,欢迎加入「硬地骇客」官方知识星球与专家们一起讨论!Podwise 的故事才刚刚开始,我们也将在星球持续分享我们的认知,成功可能无法复制,但失败一定可以借鉴。现在就点击下方链接加...
Created 2024-03-05
49 commits to main branch, last one 17 hours ago
120
1.0k
unknown
10
Simple GUI for ByteDance's Piano Transcription with Pedals
Created 2021-02-03
173 commits to master branch, last one 4 months ago
64
962
agpl-3.0
20
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
Created 2023-08-26
119 commits to main branch, last one 2 months ago
72
842
mit
20
A python package to build AI-powered real-time audio applications
Created 2021-08-09
342 commits to main branch, last one 6 days ago
Generate subtitles, summaries, and chapters from videos in seconds
Created 2023-03-22
38 commits to main branch, last one about a year ago
Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration
Created 2022-11-06
428 commits to master branch, last one about a year ago
视频音频生成字幕,生成srt文件。无需申请第三方API,本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.
Created 2022-04-29
40 commits to main branch, last one about a year ago
21
646
agpl-3.0
18
an editor for spoken-word audio with automatic transcription
Created 2021-09-03
465 commits to main branch, last one 7 months ago
67
620
gpl-3.0
16
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
Created 2022-12-23
255 commits to main branch, last one 24 days ago
20
585
mit
10
A command-line application to convert images, PDFs, and audio files to text using Apple's APIs
Created 2022-12-03
24 commits to main branch, last one about a year ago
66
564
apache-2.0
34
On-device streaming speech-to-text engine powered by deep learning
Created 2018-10-28
270 commits to master branch, last one about a month ago
turnkey self-hosted offline transcription and diarization service with llm summary
Created 2023-11-13
58 commits to main branch, last one 27 days ago
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
Created 2018-11-01
531 commits to master branch, last one 2 years ago
🎤 The easiest way to transcribe audio in Swift
Created 2023-03-29
63 commits to master branch, last one 9 months ago
60
508
mit
10
Effortlessly add AI-generated transcription subtitles to your videos
Created 2022-10-11
47 commits to main branch, last one 2 months ago
Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/
Created 2019-12-16
276 commits to master branch, last one 21 days ago
Fast text based video editing, node Electron Os X desktop app, with Backbone front end.
Created 2016-09-08
588 commits to master branch, last one 4 years ago
23
413
apache-2.0
18
On-device speech-to-text engine powered by deep learning
Created 2020-01-14
266 commits to master branch, last one about a month ago
82
370
apache-2.0
25
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
Created 2018-06-18
646 commits to master branch, last one 3 years ago
Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.
Created 2023-11-12
20 commits to main branch, last one 4 months ago
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
Created 2022-10-06
33 commits to main branch, last one about a year ago
44
297
gpl-3.0
13
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
Created 2023-05-12
330 commits to main branch, last one 5 days ago
17
290
bsd-3-clause
7
Command line speech recognition and transcription for macOS
Created 2022-02-23
67 commits to master branch, last one 2 months ago
23
282
gpl-2.0
9
OBS plugin for local speech recognition and captioning using AI
Created 2023-08-10
164 commits to master branch, last one a day ago
38
270
bsd-3-clause
16
Gecko - A Tool for Effective Annotation of Human Conversations
Created 2019-04-24
680 commits to master branch, last one about a year ago
Hangulize transcribes non-Korean words into Hangul
Created 2018-05-19
545 commits to main branch, last one 8 months ago
PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models
Created 2023-11-20
8 commits to main branch, last one 5 months ago
14
207
apache-2.0
19
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection.
Created 2022-06-03
15 commits to main branch, last one about a year ago