Trending repositories for topic whisper
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
Swift native on-device speech recognition with Whisper for Apple Silicon
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Production First and Production Ready End-to-End Speech Recognition Toolkit
Simply forward a video or voice message in any language to the bot, and it will reply with a translation.
The definitive, open-source Swift framework for interfacing with generative AI.
AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more
Generate subtitles using OpenAI Whisper in Davinci Resolve editing software.
An API to transcribe audio with OpenAI's Whisper Large v3!
Takes your video and generates video title, description, hashtags, transcription, subtitles and more.
Introducing NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as initially crafted in C++ by ggerganov.
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...
Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。
This repository contains a Python script that allows users to download the audio from a YouTube video, transcribe it into text, detect the language and save the transcription in txt file automatically...
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more
Swift native on-device speech recognition with Whisper for Apple Silicon
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
Production First and Production Ready End-to-End Speech Recognition Toolkit
AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more
The definitive, open-source Swift framework for interfacing with generative AI.
Introducing NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as initially crafted in C++ by ggerganov.
Simply forward a video or voice message in any language to the bot, and it will reply with a translation.
🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates concise meeting minutes. Stay organized and efficient in your meetings, and get ready for Phase 2 where...
An API to transcribe audio with OpenAI's Whisper Large v3!
AI-short-creator is an AI-powered tool that turns long videos into short clips. It works best for videos with multiple speakers and topics, such as interviews and documentaries. AI-short-creator finds...
This project is a digital human that can talk and listen to you. It uses OpenAI's GPT-3 to generate responses, OpenAI's Whisper to transcript the audio, Eleven Labs to generate voice and Rhubarb Lip S...
Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...
Whisper Speech-to-Text is a JavaScript library for recording and transcribing user audio into text via OpenAI's Whisper, intended for web applications.
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)
💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
Simply forward a video or voice message in any language to the bot, and it will reply with a translation.
turnkey self-hosted offline transcription and diarization service with llm summary
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
Swift native on-device speech recognition with Whisper for Apple Silicon
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
turnkey self-hosted offline transcription and diarization service with llm summary
AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more
An API to transcribe audio with OpenAI's Whisper Large v3!
The definitive, open-source Swift framework for interfacing with generative AI.
From AI tools to TikTok video creation using FFMPEG, Microsoft Edge read aloud and OpenAI Whisper model
WhisperClip simplifies your life by automatically transcribing audio recordings and saving the text directly to your clipboard. With just a click of a button, you can effortlessly convert spoken words...
Simply forward a video or voice message in any language to the bot, and it will reply with a translation.
This project is a digital human that can talk and listen to you. It uses OpenAI's GPT-3 to generate responses, OpenAI's Whisper to transcript the audio, Eleven Labs to generate voice and Rhubarb Lip S...
Introducing NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as initially crafted in C++ by ggerganov.
Generate subtitles using OpenAI Whisper in Davinci Resolve editing software.
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection
OBS plugin for local speech recognition and captioning using AI
whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...
Swift native on-device speech recognition with Whisper for Apple Silicon
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.
turnkey self-hosted offline transcription and diarization service with llm summary
Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI]
An all-in-one AI audio playground using Cloudflare AI Workers to transcribe, analyze, summarize, and translate any audio file.
Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.
AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
Swift native on-device speech recognition with Whisper for Apple Silicon
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
ChatGPT Java SDK支持流式输出、Gpt插件、联网。支持OpenAI官方所有接口。ChatGPT的Java客户端。OpenAI GPT-3.5-Turb GPT-4 Api Client for Java
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
🤖 A Telegram bot that integrates with OpenAI's official ChatGPT APIs to provide answers, written in Python
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
Generate subtitles using OpenAI Whisper in Davinci Resolve editing software.
Swift native on-device speech recognition with Whisper for Apple Silicon
Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
This repository contains a Python script that allows users to download the audio from a YouTube video, transcribe it into text, detect the language and save the transcription in txt file automatically...
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
Tero Subtitler is an open source, cross-platform, and free subtitle editing software.
Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。
Implementation of the paper "Improved DeepFake Detection Using Whisper Features"
You AI companion. ChatGPT and translation for Monocle AR
🎬 Auto Captions for Final Cut Pro Powered by OpenAI's Whisper Model
An API to transcribe audio with OpenAI's Whisper Large v3!
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.