330 results found Sort:

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
Created 2023-06-04
1,122 commits to master branch, last one 2 days ago
2.2k
21.2k
apache-2.0
215
☁️ Build multimodal AI applications with cloud-native stack
Created 2020-02-13
8,639 commits to master branch, last one 24 hours ago
2.3k
20.8k
apache-2.0
157
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Created 2023-04-17
460 commits to main branch, last one 7 months ago
2.6k
20.4k
mit
307
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Created 2019-07-23
1,229 commits to master branch, last one 5 days ago
2.6k
12.5k
apache-2.0
210
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Created 2019-08-05
7,716 commits to main branch, last one 15 hours ago
697
11.1k
mit
63
build ai agents that have the full context, open source, runs locally, developer friendly. 24/7 screen, mic, keyboard recording and control
Created 2024-06-19
2,845 commits to main branch, last one 16 hours ago
367
7.3k
apache-2.0
61
Visualize streams of multimodal data. Free, fast, easy to use, and simple to integrate. Built in Rust.
Created 2022-04-08
5,087 commits to main branch, last one 23 hours ago
797
7.2k
apache-2.0
76
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
Created 2019-04-02
3,406 commits to main branch, last one a day ago
1.3k
5.7k
mit
69
Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, cod...
Created 2023-03-19
4,841 commits to v2-dev branch, last one a day ago
936
5.5k
other
114
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Created 2018-06-27
1,099 commits to main branch, last one about a month ago
495
5.5k
unknown
92
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
Created 2023-04-02
94 commits to master branch, last one 8 months ago
420
5.3k
mit
165
notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under ...
Created 2022-09-04
1,641 commits to main branch, last one 2 days ago
411
4.7k
apache-2.0
23
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL...
Created 2023-08-01
1,272 commits to main branch, last one a day ago
366
4.4k
apache-2.0
52
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
Created 2023-05-21
272 commits to main branch, last one about a month ago
477
4.3k
apache-2.0
51
Build real-time multimodal AI applications 🤖🎙️📹
Created 2023-10-19
908 commits to main branch, last one 18 hours ago
379
4.1k
apache-2.0
58
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
Created 2021-10-28
711 commits to main branch, last one about a year ago
Curated tutorials and resources for Large Language Models, AI Painting, and more.
Created 2023-08-22
188 commits to main branch, last one 8 months ago
249
3.8k
other
34
🪩 Create Disco Diffusion artworks in one line
Created 2022-06-30
385 commits to main branch, last one about a year ago
345
3.8k
mit
31
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Created 2021-08-11
309 commits to main branch, last one 10 months ago
339
3.6k
apache-2.0
41
TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compatib...
Created 2024-06-19
646 commits to main branch, last one 2 days ago
1.1k
3.5k
apache-2.0
30
OpenMMLab Pre-training Toolbox and Benchmark
Created 2020-07-09
974 commits to main branch, last one about a month ago
341
3.4k
bsd-3-clause
60
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Created 2023-08-30
249 commits to main branch, last one about a month ago
232
3.2k
apache-2.0
44
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, ...
Created 2023-05-08
261 commits to main branch, last one 4 months ago
300
3.2k
mit
53
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Created 2024-01-26
139 commits to main branch, last one 2 months ago
Foundation Architecture for (M)LLMs
Created 2022-11-17
123 commits to main branch, last one 8 months ago
232
3.0k
apache-2.0
46
Represent, send, store and search multimodal data
Created 2021-12-14
1,462 commits to main branch, last one 2 months ago
159
2.6k
apache-2.0
43
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Created 2023-09-26
409 commits to main branch, last one 3 days ago
Easily compute clip embeddings and build a clip retrieval system with them
Created 2021-06-07
332 commits to main branch, last one 11 months ago
249
2.4k
apache-2.0
21
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Created 2022-01-29
712 commits to main branch, last one about a year ago
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
Created 2022-08-22
98 commits to main branch, last one a day ago