Statistics for topic whisper
RepositoryStats tracks 595,857 Github repositories, of these 297 are tagged with the whisper topic. The most common primary language for repositories using this topic is Python (156). Other languages include: TypeScript (32), Jupyter Notebook (19), JavaScript (16), C++ (11)
Stargazers over time for topic whisper
Most starred repositories for topic whisper (view more)
Trending repositories for topic whisper (view more)
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...
An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.
Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR...
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.
批量为视频或者音频生成字幕,并可批量将字幕翻译成其它语言。这是一个客户端工具, 跨平台支持 mac 和 windows 系统, 支持百度,火山,deeplx, openai, deepseek, ollama 等多个翻译服务
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downlo...
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR...
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downlo...
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
A real-time, instant dictation desktop application built on Electron that uses Whisper and GROQ under the hood
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR...
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downlo...
Open source real-time translation app for Android that runs locally
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3
Open source real-time translation app for Android that runs locally