Statistics for topic voice-assistant
RepositoryStats tracks 579,129 Github repositories, of these 125 are tagged with the voice-assistant topic. The most common primary language for repositories using this topic is Python (54). Other languages include: JavaScript (11)
Stargazers over time for topic voice-assistant
Most starred repositories for topic voice-assistant (view more)
Trending repositories for topic voice-assistant (view more)
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
Open Source framework for voice and multimodal conversational AI
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
Home Assistant custom component that allows you to turn almost any camera and almost any speaker into a local voice assistant
M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song, adjusts system and Spotify volume, performs calculations, browses the web and internet, searc...
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
Open Source framework for voice and multimodal conversational AI
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
Home Assistant custom component that allows you to turn almost any camera and almost any speaker into a local voice assistant
M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song, adjusts system and Spotify volume, performs calculations, browses the web and internet, searc...
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
Open Source framework for voice and multimodal conversational AI
Voice assistant made as an experiment using neural networks for things like STT/TTS/Wake Word/NLU etc.
An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics applica...
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
A Python-based virtual assistant using Gemini AI. Features include voice recognition, text-to-speech, weather updates, news retrieval, jokes, Wikipedia info, and music management. Comes with an intera...
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
Open Source framework for voice and multimodal conversational AI
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
Your own personal voice assistant: Voice to Text to LLM to Speech, displayed in a web interface
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
Open Source framework for voice and multimodal conversational AI
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song, adjusts system and Spotify volume, performs calculations, browses the web and internet, searc...