Trending repositories for topic voice-assistant
TEN Agent is a world-class multimodal AI agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG.
Open Source framework for voice and multimodal conversational AI
An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics applica...
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
Home Assistant custom component that allows you to turn almost any camera and almost any speaker into a local voice assistant
M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song, adjusts system and Spotify volume, performs calculations, browses the web and internet, searc...
Your Own Personal Voice Assistant. It's a mini python project.
TEN Agent is a world-class multimodal AI agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG.
An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics applica...
Home Assistant custom component that allows you to turn almost any camera and almost any speaker into a local voice assistant
M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song, adjusts system and Spotify volume, performs calculations, browses the web and internet, searc...
Open Source framework for voice and multimodal conversational AI
Your Own Personal Voice Assistant. It's a mini python project.
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
TEN Agent is a world-class multimodal AI agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG.
Open Source framework for voice and multimodal conversational AI
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics applica...
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Home Assistant custom component that allows you to turn almost any camera and almost any speaker into a local voice assistant
Voice assistant made as an experiment using neural networks for things like STT/TTS/Wake Word/NLU etc.
Mycroft Core, the Mycroft Artificial Intelligence platform.
Your Own Personal Voice Assistant. It's a mini python project.
Virtually controlling computer using hand-gestures and voice commands. Using MediaPipe, OpenCV Python.
Adds support for Yandex Smart Home (Alice voice assistant) and Marusia voice assistant into Home Assistant
TEN Agent is a world-class multimodal AI agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG.
An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics applica...
Home Assistant custom component that allows you to turn almost any camera and almost any speaker into a local voice assistant
Open Voice OS and/or HiveMind installer using Ansible with an intuitive and easy Text-based User Interface
Open Source framework for voice and multimodal conversational AI
M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song, adjusts system and Spotify volume, performs calculations, browses the web and internet, searc...
Your Own Personal Voice Assistant. It's a mini python project.
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Virtually controlling computer using hand-gestures and voice commands. Using MediaPipe, OpenCV Python.
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
Adds support for Yandex Smart Home (Alice voice assistant) and Marusia voice assistant into Home Assistant
TEN Agent is a world-class multimodal AI agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG.
Open Source framework for voice and multimodal conversational AI
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
Voice assistant made as an experiment using neural networks for things like STT/TTS/Wake Word/NLU etc.
An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics applica...
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
A curated list of awesome things related to artificial intelligence tools
Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.
Home Assistant custom component that allows you to turn almost any camera and almost any speaker into a local voice assistant
TEN Agent is a world-class multimodal AI agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG.
An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics applica...
A Private and Open Source AI-Powered Voice Assistant & Multisensor for Home Assistant
Starter project for building real-time AI Voice Assistants
A Python-based virtual assistant using Gemini AI. Features include voice recognition, text-to-speech, weather updates, news retrieval, jokes, Wikipedia info, and music management. Comes with an intera...
Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)
Home Assistant custom component that allows you to turn almost any camera and almost any speaker into a local voice assistant
Your Own Personal Voice Assistant. It's a mini python project.
Arduinobot is an open-source 3D printed robot arm powered by ROS 2. Its simple design and low cost make it an excellent learning tool, featured in the "Robotics and ROS 2 - Learn by Doing! Manipulator...
Open Voice OS and/or HiveMind installer using Ansible with an intuitive and easy Text-based User Interface
Virtual Voice Assistant is a project that utilizes machine learning and natural language processing to enable users to control their devices using voice commands. Technologies used include TensorFlow,...
A curated list of awesome things related to artificial intelligence tools
Open Source framework for voice and multimodal conversational AI
TEN Agent is a world-class multimodal AI agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG.
Your own personal voice assistant: Voice to Text to LLM to Speech, displayed in a web interface
An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics applica...
A Private and Open Source AI-Powered Voice Assistant & Multisensor for Home Assistant
A Python-based virtual assistant using Gemini AI. Features include voice recognition, text-to-speech, weather updates, news retrieval, jokes, Wikipedia info, and music management. Comes with an intera...
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
Open Source framework for voice and multimodal conversational AI
TEN Agent is a world-class multimodal AI agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG.
Voice assistant made as an experiment using neural networks for things like STT/TTS/Wake Word/NLU etc.
An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics applica...
Your own personal voice assistant: Voice to Text to LLM to Speech, displayed in a web interface
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
A curated list of awesome things related to artificial intelligence tools
Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.
Virtually controlling computer using hand-gestures and voice commands. Using MediaPipe, OpenCV Python.
M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song, adjusts system and Spotify volume, performs calculations, browses the web and internet, searc...
TEN Agent is a world-class multimodal AI agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG.
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
Open Voice OS and/or HiveMind installer using Ansible with an intuitive and easy Text-based User Interface
M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song, adjusts system and Spotify volume, performs calculations, browses the web and internet, searc...
Your own personal voice assistant: Voice to Text to LLM to Speech, displayed in a web interface
Mili is an artificial assistant that can make your life easier. It can perform a variety of tasks, such as navigating to a place, playing songs, and much more.
Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)
Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.
Your Own Personal Voice Assistant. It's a mini python project.
Virtual Voice Assistant is a project that utilizes machine learning and natural language processing to enable users to control their devices using voice commands. Technologies used include TensorFlow,...
Home Assistant custom component that allows you to turn almost any camera and almost any speaker into a local voice assistant
Like ChatGPT's voice conversations with an AI, but entirely offline/private/trade-secret-friendly, using local AI models such as LLama 2 and Whisper