Trending repositories for topic whisper

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

ggerganov/whisper.cpp

Port of OpenAI's Whisper model in C/C++

36,452 (+77)

mit

m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

12,965 (+49)

bsd-2-clause

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

13,065 (+45)

mit

modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

7,403 (+38)

xorbitsai/inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...

5,735 (+31)

apache-2.0

chidiwilliams/buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

12,864 (+24)

mit

nodetool-ai/nodetool

NodeTool - Your Creative AI Playground

82 (+20)

agpl-3.0

abus-aikorea/voice-pro

Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downlo...

2,372 (+19)

mit

NexaAI/nexa-sdk

Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR...

5,096 (+19)

apache-2.0

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

3,901 (+18)

bsd-2-clause

PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...

11,291 (+17)

apache-2.0

thewh1teagle/vibe

Transcribe on your own!

1,409 (+15)

mit

fedirz/faster-whisper-server

No description

910 (+15)

mit

tmoroney/auto-subs

Generate Subtitles & Diarize Speakers in Davinci Resolve using AI.

698 (+13)

mit

niedev/RTranslator

Open source real-time translation app for Android that runs locally

6,971 (+13)

apache-2.0

collabora/WhisperLive

A nearly-live implementation of OpenAI's Whisper.

2,219 (+12)

mit

buxuku/video-subtitle-master

批量为视频或者音频生成字幕，并可批量将字幕翻译成其它语言。这是一个客户端工具, 跨平台支持 mac 和 windows 系统, 支持百度，火山，deeplx, openai, deepseek, ollama 等多个翻译服务

501 (+11)

mit

Purfview/whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

1,418 (+10)

pluja/whishper

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

1,726 (+8)

agpl-3.0

lenML/Speech-AI-Forge

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

913 (+7)

agpl-3.0

Last 3 days (relative gain)

nodetool-ai/nodetool

NodeTool - Your Creative AI Playground

82 (+32%)

agpl-3.0

bolna-ai/bolna

Full stack tools for building voice agents

107 (+4%)

mit

microsoft/ai-dev-gallery

An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.

223 (+2%)

mit

buxuku/video-subtitle-master

501 (+2%)

mit

machinelearningZH/audio-transcription

Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.

53 (+2%)

mit

tmoroney/auto-subs

Generate Subtitles & Diarize Speakers in Davinci Resolve using AI.

698 (+2%)

mit

fedirz/faster-whisper-server

No description

910 (+2%)

mit

thewh1teagle/vibe

Transcribe on your own!

1,409 (+1%)

mit

nyrahealth/CrisperWhisper

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

478 (+1%)

ivnvxd/hack-interview

AI-powered tool for real-time interview question transcription and response generation.

195 (+1%)

mit

remotion-dev/template-tiktok

Generate TikTok-style captions with Whisper.cpp

109 (+0.9%)

abus-aikorea/voice-pro

2,372 (+0.8%)

mit

transcriptionstream/transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

765 (+0.8%)

gpl-3.0

lenML/Speech-AI-Forge

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

913 (+0.8%)

agpl-3.0

vilassn/whisper_android

Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android

272 (+0.7%)

mit

URUWorks/TeroSubtitler

Tero Subtitler is an open source, cross-platform, and free subtitle editing software.

276 (+0.7%)

mpl-2.0

Purfview/whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

1,418 (+0.7%)

ntegrals/aura-voice

Aura is like Siri, but in your browser. An AI voice assistant optimized for low latency responses.

1,164 (+0.6%)

mit

azkadev/speech_to_text_telegram_bot_dart

Speech To Text Telegram Bot Dart

354 (+0.6%)

JosefAlbers/whisper-turbo-mlx

Blazing fast whisper turbo for ASR (speech-to-text) tasks

180 (+0.6%)

mit

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

ggerganov/whisper.cpp

Port of OpenAI's Whisper model in C/C++

36,452 (+183)

mit

NexaAI/nexa-sdk

5,096 (+120)

apache-2.0

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

13,065 (+110)

mit

m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

12,965 (+100)

bsd-2-clause

modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

7,403 (+89)

xorbitsai/inference

5,735 (+75)

apache-2.0

chidiwilliams/buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

12,864 (+57)

mit

nodetool-ai/nodetool

NodeTool - Your Creative AI Playground

82 (+45)

agpl-3.0

abus-aikorea/voice-pro

2,372 (+45)

mit

collabora/WhisperLive

A nearly-live implementation of OpenAI's Whisper.

2,219 (+39)

mit

fedirz/faster-whisper-server

No description

910 (+38)

mit

PaddlePaddle/PaddleSpeech

11,291 (+36)

apache-2.0

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

3,901 (+35)

bsd-2-clause

thewh1teagle/vibe

Transcribe on your own!

1,409 (+34)

mit

niedev/RTranslator

Open source real-time translation app for Android that runs locally

6,971 (+33)

apache-2.0

microsoft/ai-dev-gallery

An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.

223 (+32)

mit

argmaxinc/WhisperKit

On-device Speech Recognition for Apple Silicon

4,042 (+24)

mit

Purfview/whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

1,418 (+23)

pluja/whishper

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

1,726 (+23)

agpl-3.0

Chenyme/Chenyme-AAVT

这是一个全自动（音频）视频翻译项目。利用Whisper识别声音，AI大模型翻译字幕，最后合并字幕视频，生成翻译后的视频。

1,933 (+22)

mit

Last week (relative gain)

nodetool-ai/nodetool

NodeTool - Your Creative AI Playground

82 (+122%)

agpl-3.0

microsoft/ai-dev-gallery

An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.

223 (+17%)

mit

bolna-ai/bolna

Full stack tools for building voice agents

107 (+13%)

mit

pavelzbornik/whisperX-FastAPI

FastAPI service on top of WhisperX

56 (+6%)

fedirz/faster-whisper-server

No description

910 (+4%)

mit

buxuku/video-subtitle-master

501 (+4%)

mit

machinelearningZH/audio-transcription

Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.

53 (+4%)

mit

nyrahealth/CrisperWhisper

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

478 (+3%)

JacobLinCool/smart-whisper

Smart Whisper is a native Node.js addon designed for efficient and streamlined interaction with the whisper.cpp, with automatic model offloading and model manager.

35 (+3%)

mit

JosefAlbers/whisper-turbo-mlx

Blazing fast whisper turbo for ASR (speech-to-text) tasks

180 (+3%)

mit

tmoroney/auto-subs

Generate Subtitles & Diarize Speakers in Davinci Resolve using AI.

698 (+3%)

mit

ivnvxd/hack-interview

AI-powered tool for real-time interview question transcription and response generation.

195 (+3%)

mit

fengredrum/finetune-whisper-lora

Fine-Tune Whisper with Transformers and PEFT

41 (+3%)

mit

thewh1teagle/vibe

Transcribe on your own!

1,409 (+2%)

mit

NexaAI/nexa-sdk

5,096 (+2%)

apache-2.0

savbell/whisper-writer

💬📝 A small dictation app using OpenAI's Whisper speech recognition model.

386 (+2%)

gpl-3.0

j3soon/whisper-to-input

An Android keyboard that performs speech-to-text (STT/ASR) with OpenAI Whisper and input the recognized text; Supports English, Chinese, Japanese, etc. and even mixed languages.

51 (+2%)

abus-aikorea/voice-pro

2,372 (+2%)

mit

litongjava/whisper-cpp-server

whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++

54 (+2%)

mit

remotion-dev/template-tiktok

Generate TikTok-style captions with Whisper.cpp

109 (+2%)

Last month (new repositories)

no newly created repositories trending in the last month

Last month (absolute gain)

abus-aikorea/voice-pro

2,372 (+1,455)

mit

NexaAI/nexa-sdk

5,096 (+1,106)

apache-2.0

ggerganov/whisper.cpp

Port of OpenAI's Whisper model in C/C++

36,452 (+667)

mit

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

13,065 (+484)

mit

m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

12,965 (+399)

bsd-2-clause

modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

7,403 (+367)

ntegrals/aura-voice

Aura is like Siri, but in your browser. An AI voice assistant optimized for low latency responses.

1,164 (+337)

mit

xorbitsai/inference

5,735 (+286)

apache-2.0

chidiwilliams/buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

12,864 (+252)

mit

thewh1teagle/vibe

Transcribe on your own!

1,409 (+176)

mit

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

3,901 (+159)

bsd-2-clause

nyrahealth/CrisperWhisper

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

478 (+156)

CheshireCC/faster-whisper-GUI

faster_whisper GUI with PySide6

1,819 (+153)

agpl-3.0

microsoft/ai-dev-gallery

An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.

223 (+146)

mit

collabora/WhisperLive

A nearly-live implementation of OpenAI's Whisper.

2,219 (+146)

mit

Chenyme/Chenyme-AAVT

这是一个全自动（音频）视频翻译项目。利用Whisper识别声音，AI大模型翻译字幕，最后合并字幕视频，生成翻译后的视频。

1,933 (+139)

mit

niedev/RTranslator

Open source real-time translation app for Android that runs locally

6,971 (+131)

apache-2.0

fedirz/faster-whisper-server

No description

910 (+126)

mit

PaddlePaddle/PaddleSpeech

11,291 (+114)

apache-2.0

argmaxinc/WhisperKit

On-device Speech Recognition for Apple Silicon

4,042 (+95)

mit

Last month (relative gain)

nodetool-ai/nodetool

NodeTool - Your Creative AI Playground

82 (+193%)

agpl-3.0

microsoft/ai-dev-gallery

An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.

223 (+190%)

mit

abus-aikorea/voice-pro

2,372 (+159%)

mit

nyrahealth/CrisperWhisper

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

478 (+48%)

ntegrals/aura-voice

Aura is like Siri, but in your browser. An AI voice assistant optimized for low latency responses.

1,164 (+41%)

mit

aviaryan/voice-writing-electron

A real-time, instant dictation desktop application built on Electron that uses Whisper and GROQ under the hood

46 (+39%)

apache-2.0

bolna-ai/bolna

Full stack tools for building voice agents

107 (+35%)

mit

pavelzbornik/whisperX-FastAPI

FastAPI service on top of WhisperX

56 (+33%)

zjrwtx/videotopdf_ui

视频转图文并茂的pdf—videotopdf：打工人（会议记录）和学生党（网课笔记）等必备！使用地址：https://zjrwtxtechstudio-video-to-pdf.hf.space

28 (+33%)

NexaAI/nexa-sdk

5,096 (+28%)

apache-2.0

thewh1teagle/pyannote-rs

pyannote audio diarization in rust

38 (+27%)

mit

buxuku/video-subtitle-master

501 (+17%)

mit

fedirz/faster-whisper-server

No description

910 (+16%)

mit

j3soon/whisper-to-input

An Android keyboard that performs speech-to-text (STT/ASR) with OpenAI Whisper and input the recognized text; Supports English, Chinese, Japanese, etc. and even mixed languages.

51 (+16%)

machinelearningZH/audio-transcription

Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.

53 (+15%)

mit

thewh1teagle/vibe

Transcribe on your own!

1,409 (+14%)

mit

PreternaturalAI/AI

The definitive, open-source Swift framework for interfacing with generative AI.

93 (+13%)

mit

tmoroney/auto-subs

Generate Subtitles & Diarize Speakers in Davinci Resolve using AI.

698 (+13%)

mit

Pikurrot/whisper-gui

A simple GUI to use Whisper.

116 (+13%)

mit

vilassn/whisper_android

Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android

272 (+12%)

mit

Last 12-months (new repositories)

NexaAI/nexa-sdk

5,096

apache-2.0

argmaxinc/WhisperKit

On-device Speech Recognition for Apple Silicon

4,042

mit

abus-aikorea/voice-pro

2,372

mit

thewh1teagle/vibe

Transcribe on your own!

1,409

mit

harry0703/AudioNotes

快速提取音视频内容，整理成一份结构化的markdown笔记

1,135

mit

lenML/Speech-AI-Forge

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

913

agpl-3.0

fedirz/faster-whisper-server

No description

910

mit

mezbaul-h/june

Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit

729

mit

OwlAIProject/Owl

A personal wearable AI that runs locally

535

mit

buxuku/video-subtitle-master

501

mit

ai-ng/swift

Fast voice assistant powered by Groq, Cartesia, and Vercel.

500

mit

nyrahealth/CrisperWhisper

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

478

Bklieger/ScribeWizard

ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3

465

mit

apeatling/ollama-voice-mac

Mac compatible Ollama Voice

443

agpl-3.0

alexfazio/viral-clips-crew

Your CrewAI Powered Video Editing Assistant

436

mit

revdotcom/reverb

Open source inference code for Rev's model

347

apache-2.0

RayFernando1337/MLX-Auto-Subtitled-Video-Generator

Generate accurate transcripts using Apple's MLX framework

343

mit

azkadev/general_ai

GENERAL Ai Library For DART & Flutter

317

developersdigest/ai-devices

AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more

283

mit

microsoft/ai-dev-gallery

An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.

223

mit

Last 12-months (absolute gain)

ggerganov/whisper.cpp

Port of OpenAI's Whisper model in C/C++

36,452 (+9,967)

mit

niedev/RTranslator

Open source real-time translation app for Android that runs locally

6,971 (+6,843)

apache-2.0

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

13,065 (+6,774)

mit

m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

12,965 (+5,854)

bsd-2-clause

modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

7,403 (+5,800)

NexaAI/nexa-sdk

5,096 (+5,095)

apache-2.0

chidiwilliams/buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

12,864 (+4,443)

mit

xorbitsai/inference

5,735 (+4,342)

apache-2.0

argmaxinc/WhisperKit

On-device Speech Recognition for Apple Silicon

4,042 (+3,968)

mit

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

3,901 (+2,488)

bsd-2-clause

abus-aikorea/voice-pro

2,372 (+2,371)

mit

xenova/whisper-web

ML-powered speech recognition directly in your browser

2,661 (+2,251)

mit

Chenyme/Chenyme-AAVT

这是一个全自动（音频）视频翻译项目。利用Whisper识别声音，AI大模型翻译字幕，最后合并字幕视频，生成翻译后的视频。

1,933 (+1,932)

mit

collabora/WhisperLive

A nearly-live implementation of OpenAI's Whisper.

2,219 (+1,919)

mit

PaddlePaddle/PaddleSpeech

11,291 (+1,899)

apache-2.0

CheshireCC/faster-whisper-GUI

faster_whisper GUI with PySide6

1,819 (+1,604)

agpl-3.0

thewh1teagle/vibe

Transcribe on your own!

1,409 (+1,408)

mit

pluja/whishper

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

1,726 (+1,325)

agpl-3.0

harry0703/AudioNotes

快速提取音视频内容，整理成一份结构化的markdown笔记

1,135 (+1,134)

mit

ntegrals/aura-voice

Aura is like Siri, but in your browser. An AI voice assistant optimized for low latency responses.

1,164 (+1,123)

mit

Last 12-months (relative gain)

lenML/Speech-AI-Forge

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

913 (+12,943%)

agpl-3.0

alexfazio/viral-clips-crew

Your CrewAI Powered Video Editing Assistant

436 (+8,620%)

mit

Bklieger/ScribeWizard

ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3

465 (+7,650%)

mit

transcriptionstream/transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

765 (+5,364%)

gpl-3.0

argmaxinc/WhisperKit

On-device Speech Recognition for Apple Silicon

4,042 (+5,362%)

mit

niedev/RTranslator

Open source real-time translation app for Android that runs locally

6,971 (+5,346%)

apache-2.0

RayFernando1337/MLX-Auto-Subtitled-Video-Generator

Generate accurate transcripts using Apple's MLX framework

343 (+3,711%)

mit

substratusai/kubeai

AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports LLMs, embeddings, and speech-to-text.

590 (+3,588%)

apache-2.0

ntegrals/aura-voice

Aura is like Siri, but in your browser. An AI voice assistant optimized for low latency responses.

1,164 (+2,739%)

mit

voxos-ai/bolna

End-to-end platform for building voice first multimodal agents

396 (+2,729%)

mit

tmoroney/auto-subs

Generate Subtitles & Diarize Speakers in Davinci Resolve using AI.

698 (+2,307%)

mit

JigsawStack/insanely-fast-whisper-api

An API to transcribe audio with OpenAI's Whisper Large v3!

212 (+2,020%)

mit

Woolverine94/biniou

a self-hosted webui for 30+ generative ai

503 (+1,696%)

gpl-3.0

AlexisBalayre/AI-Powered-Meeting-Summarizer

Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.

72 (+1,340%)

mit

shreesha345/AI-short-creator

AI-short-creator is an AI-powered tool that turns long videos into short clips. It works best for videos with multiple speakers and topics, such as interviews and documentaries. AI-short-creator finds...

111 (+1,288%)

mit

aviaryan/voice-writing-electron

A real-time, instant dictation desktop application built on Electron that uses Whisper and GROQ under the hood

46 (+1,050%)

apache-2.0

OwlAIProject/Owl

A personal wearable AI that runs locally

535 (+992%)

mit

microsoft/ai-dev-gallery

An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.

223 (+962%)

mit

j3soon/whisper-to-input

An Android keyboard that performs speech-to-text (STT/ASR) with OpenAI Whisper and input the recognized text; Supports English, Chinese, Japanese, etc. and even mixed languages.

51 (+920%)

alxpez/alts

100% free, local & offline voice assistant with speech recognition

60 (+900%)