330 results found Sort:
- Filter by Primary Language:
- Python (229)
- Jupyter Notebook (33)
- TypeScript (6)
- C++ (5)
- JavaScript (5)
- HTML (4)
- Ruby (3)
- C# (2)
- MATLAB (1)
- Markdown (1)
- Go (1)
- Rust (1)
- Shell (1)
- Dart (1)
- +
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
Created
2023-06-04
1,122 commits to master branch, last one 2 days ago
☁️ Build multimodal AI applications with cloud-native stack
Created
2020-02-13
8,639 commits to master branch, last one 24 hours ago
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Created
2023-04-17
460 commits to main branch, last one 7 months ago
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Created
2019-07-23
1,229 commits to master branch, last one 5 days ago
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Created
2019-08-05
7,716 commits to main branch, last one 15 hours ago
build ai agents that have the full context, open source, runs locally, developer friendly. 24/7 screen, mic, keyboard recording and control
Created
2024-06-19
2,845 commits to main branch, last one 16 hours ago
Visualize streams of multimodal data. Free, fast, easy to use, and simple to integrate. Built in Rust.
Created
2022-04-08
5,087 commits to main branch, last one 23 hours ago
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
Created
2019-04-02
3,406 commits to main branch, last one a day ago
Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, cod...
Created
2023-03-19
4,841 commits to v2-dev branch, last one a day ago
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Created
2018-06-27
1,099 commits to main branch, last one about a month ago
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
Created
2023-04-02
94 commits to master branch, last one 8 months ago
notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under ...
Created
2022-09-04
1,641 commits to main branch, last one 2 days ago
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL...
Created
2023-08-01
1,272 commits to main branch, last one a day ago
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
Created
2023-05-21
272 commits to main branch, last one about a month ago
Build real-time multimodal AI applications 🤖🎙️📹
Created
2023-10-19
908 commits to main branch, last one 18 hours ago
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
Created
2021-10-28
711 commits to main branch, last one about a year ago
Curated tutorials and resources for Large Language Models, AI Painting, and more.
Created
2023-08-22
188 commits to main branch, last one 8 months ago
🪩 Create Disco Diffusion artworks in one line
Created
2022-06-30
385 commits to main branch, last one about a year ago
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Created
2021-08-11
309 commits to main branch, last one 10 months ago
TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compatib...
Created
2024-06-19
646 commits to main branch, last one 2 days ago
OpenMMLab Pre-training Toolbox and Benchmark
Created
2020-07-09
974 commits to main branch, last one about a month ago
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Created
2023-08-30
249 commits to main branch, last one about a month ago
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, ...
Created
2023-05-08
261 commits to main branch, last one 4 months ago
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Created
2024-01-26
139 commits to main branch, last one 2 months ago
Foundation Architecture for (M)LLMs
Created
2022-11-17
123 commits to main branch, last one 8 months ago
Represent, send, store and search multimodal data
Created
2021-12-14
1,462 commits to main branch, last one 2 months ago
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Created
2023-09-26
409 commits to main branch, last one 3 days ago
Easily compute clip embeddings and build a clip retrieval system with them
Created
2021-06-07
332 commits to main branch, last one 11 months ago
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Created
2022-01-29
712 commits to main branch, last one about a year ago
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
Created
2022-08-22
98 commits to main branch, last one a day ago