Trending repositories for topic voice-assistant
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
Open Source framework for voice and multimodal conversational AI
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
A curated list of awesome things related to artificial intelligence tools
Mycroft Core, the Mycroft Artificial Intelligence platform.
Home Assistant custom component that allows you to turn almost any camera and almost any speaker into a local voice assistant
An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics applica...
Voice assistant made as an experiment using neural networks for things like STT/TTS/Wake Word/NLU etc.
M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song, adjusts system and Spotify volume, performs calculations, browses the web and internet, searc...
Home Assistant custom component that allows you to turn almost any camera and almost any speaker into a local voice assistant
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
A curated list of awesome things related to artificial intelligence tools
M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song, adjusts system and Spotify volume, performs calculations, browses the web and internet, searc...
An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics applica...
Open Source framework for voice and multimodal conversational AI
Voice assistant made as an experiment using neural networks for things like STT/TTS/Wake Word/NLU etc.
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
Mycroft Core, the Mycroft Artificial Intelligence platform.
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
Open Source framework for voice and multimodal conversational AI
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
A curated list of awesome things related to artificial intelligence tools
Mycroft Core, the Mycroft Artificial Intelligence platform.
Home Assistant custom component that allows you to turn almost any camera and almost any speaker into a local voice assistant
An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics applica...
Voice assistant made as an experiment using neural networks for things like STT/TTS/Wake Word/NLU etc.
M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song, adjusts system and Spotify volume, performs calculations, browses the web and internet, searc...
Home Assistant custom component that allows you to turn almost any camera and almost any speaker into a local voice assistant
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
A curated list of awesome things related to artificial intelligence tools
M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song, adjusts system and Spotify volume, performs calculations, browses the web and internet, searc...
An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics applica...
Open Source framework for voice and multimodal conversational AI
Voice assistant made as an experiment using neural networks for things like STT/TTS/Wake Word/NLU etc.
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
Mycroft Core, the Mycroft Artificial Intelligence platform.
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
Open Source framework for voice and multimodal conversational AI
Voice assistant made as an experiment using neural networks for things like STT/TTS/Wake Word/NLU etc.
An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics applica...
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
A curated list of awesome things related to artificial intelligence tools
A Python-based virtual assistant using Gemini AI. Features include voice recognition, text-to-speech, weather updates, news retrieval, jokes, Wikipedia info, and music management. Comes with an intera...
Your Own Personal Voice Assistant. It's a mini python project.
Virtually controlling computer using hand-gestures and voice commands. Using MediaPipe, OpenCV Python.
Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
A Python-based virtual assistant using Gemini AI. Features include voice recognition, text-to-speech, weather updates, news retrieval, jokes, Wikipedia info, and music management. Comes with an intera...
An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics applica...
A Private and Open Source AI-Powered Voice Assistant & Multisensor for Home Assistant
Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)
Open Voice OS and/or HiveMind installer using Ansible with an intuitive and easy Text-based User Interface
Starter project for building real-time AI Voice Assistants
Your Own Personal Voice Assistant. It's a mini python project.
Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.
The ChatGPT Voice Assistant uses a Raspberry Pi (or desktop) to enable spoken conversation with OpenAI large language models. This implementation listens to speech, processes the conversation through ...
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
Open Source framework for voice and multimodal conversational AI
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
Your own personal voice assistant: Voice to Text to LLM to Speech, displayed in a web interface
An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics applica...
Open Voice OS and/or HiveMind installer using Ansible with an intuitive and easy Text-based User Interface
A Private and Open Source AI-Powered Voice Assistant & Multisensor for Home Assistant
A Python-based virtual assistant using Gemini AI. Features include voice recognition, text-to-speech, weather updates, news retrieval, jokes, Wikipedia info, and music management. Comes with an intera...
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
Open Source framework for voice and multimodal conversational AI
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
Voice assistant made as an experiment using neural networks for things like STT/TTS/Wake Word/NLU etc.
Your own personal voice assistant: Voice to Text to LLM to Speech, displayed in a web interface
An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics applica...
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.
A curated list of awesome things related to artificial intelligence tools
Virtually controlling computer using hand-gestures and voice commands. Using MediaPipe, OpenCV Python.
On-device voice assistant platform powered by deep learning
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song, adjusts system and Spotify volume, performs calculations, browses the web and internet, searc...
Your own personal voice assistant: Voice to Text to LLM to Speech, displayed in a web interface
Mili is an artificial assistant that can make your life easier. It can perform a variety of tasks, such as navigating to a place, playing songs, and much more.
Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)
Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.
Your Own Personal Voice Assistant. It's a mini python project.
Virtual Voice Assistant is a project that utilizes machine learning and natural language processing to enable users to control their devices using voice commands. Technologies used include TensorFlow,...
Like ChatGPT's voice conversations with an AI, but entirely offline/private/trade-secret-friendly, using local AI models such as LLama 2 and Whisper
The ChatGPT Voice Assistant uses a Raspberry Pi (or desktop) to enable spoken conversation with OpenAI large language models. This implementation listens to speech, processes the conversation through ...
Home Assistant custom component that allows you to turn almost any camera and almost any speaker into a local voice assistant
🔊😊 A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.