17 results found Sort:
- Filter by Primary Language:
- Python (13)
- Svelte (1)
- Tcl (1)
- TypeScript (1)
- +
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Created
2024-09-10
13 commits to main branch, last one 3 months ago
A simple, high-quality voice conversion tool focused on ease of use and performance.
Created
2023-08-07
3,450 commits to main branch, last one 2 days ago
High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体!
Created
2024-09-26
46 commits to main branch, last one about a month ago
✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
Created
2024-11-04
21 commits to main branch, last one about a month ago
A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.
Created
2023-08-16
36 commits to main branch, last one about a year ago
MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not lim...
Created
2024-08-12
54 commits to master branch, last one about a month ago
This repository has no description...
speech
speech-synthesis
speech-to-speech
text-translation
speech-processing
speech-recognition
speech-translation
machine-translation
speech-to-subtitles
disfluency-detection
punctuation-restoration
simultaneous-translation
cascaded-speech-translation
multimodal-machine-learning
natural-language-processing
multimodal-machine-translation
non-autoregressive-translation
Created
2019-09-18
155 commits to master branch, last one 3 years ago
A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.
Created
2025-01-16
9 commits to main branch, last one 24 days ago
Samantha OS1 is a conversational AI assistant powered by the Realtime API from OpenAI
Created
2024-10-17
13 commits to main branch, last one about a month ago
Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".
Created
2023-10-07
22 commits to main branch, last one 6 months ago
If you've ever had the wish to talk to your AI Waifu using quality characters and voices for character voicing, then I suggest Soul of Waifu. Don't miss the opportunity to touch your dream!
Created
2023-09-02
68 commits to main branch, last one 2 months ago
svelte component for using the openai realtime api
Created
2024-10-04
18 commits to main branch, last one about a month ago
💬 "Realtime" voice transcription and cloning using ElevenLabs's API.
Created
2023-02-22
17 commits to master branch, last one about a year ago
Code for the INTERSPEECH 2023 paper "Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models"
Created
2023-01-31
54 commits to main branch, last one about a month ago
An easy-to-use, fast, and easily integrable tool for evaluating audio LLM
Created
2024-11-11
62 commits to main branch, last one a day ago
Speech to text to speech using Elevenlabs
Created
2023-02-05
110 commits to master branch, last one about a year ago
This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming architecture for fluid conversations with immediate responses and na...
Created
2025-01-02
77 commits to main branch, last one a day ago