317 results found Sort:

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
Created 2023-06-04
1,044 commits to master branch, last one 10 hours ago
2.2k
21.1k
apache-2.0
214
☁️ Build multimodal AI applications with cloud-native stack
Created 2020-02-13
8,620 commits to master branch, last one a day ago
2.2k
20.1k
apache-2.0
157
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Created 2023-04-17
460 commits to main branch, last one 5 months ago
2.5k
20.1k
mit
308
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Created 2019-07-23
1,214 commits to master branch, last one 8 days ago
2.5k
12.0k
apache-2.0
205
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Created 2019-08-05
7,458 commits to main branch, last one 10 hours ago
rewind.ai x cursor.com = AI powered by your 24/7 screen & voice local recording.
Created 2024-06-19
2,281 commits to main branch, last one 6 hours ago
792
7.1k
apache-2.0
77
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
Created 2019-04-02
3,343 commits to main branch, last one 21 hours ago
327
6.5k
apache-2.0
62
Visualize streams of multimodal data. Free, fast, easy to use, and simple to integrate. Built in Rust.
Created 2022-04-08
4,773 commits to main branch, last one 12 hours ago
939
5.5k
other
114
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Created 2018-06-27
1,098 commits to main branch, last one 8 days ago
1.3k
5.5k
mit
65
Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, cod...
Created 2023-03-19
3,366 commits to v1-dev branch, last one 9 days ago
487
5.4k
unknown
91
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
Created 2023-04-02
94 commits to master branch, last one 6 months ago
417
5.2k
mit
165
notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under ...
Created 2022-09-04
1,623 commits to main branch, last one 5 days ago
362
4.3k
apache-2.0
51
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
Created 2023-05-21
272 commits to main branch, last one 8 days ago
368
4.1k
apache-2.0
23
Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vis...
Created 2023-08-01
1,144 commits to main branch, last one 18 hours ago
376
4.0k
apache-2.0
58
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
Created 2021-10-28
711 commits to main branch, last one about a year ago
393
3.9k
apache-2.0
45
Build real-time multimodal AI applications 🤖🎙️📹
Created 2023-10-19
807 commits to main branch, last one 2 days ago
Curated tutorials and resources for Large Language Models, AI Painting, and more.
Created 2023-08-22
188 commits to main branch, last one 7 months ago
250
3.8k
other
34
🪩 Create Disco Diffusion artworks in one line
Created 2022-06-30
385 commits to main branch, last one about a year ago
338
3.7k
mit
31
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Created 2021-08-11
309 commits to main branch, last one 8 months ago
326
3.3k
bsd-3-clause
59
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Created 2023-08-30
249 commits to main branch, last one 4 days ago
232
3.2k
apache-2.0
43
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, ...
Created 2023-05-08
261 commits to main branch, last one 2 months ago
Foundation Architecture for (M)LLMs
Created 2022-11-17
123 commits to main branch, last one 6 months ago
234
3.0k
apache-2.0
45
Represent, send, store and search multimodal data
Created 2021-12-14
1,462 commits to main branch, last one about a month ago
273
3.0k
mit
49
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Created 2024-01-26
139 commits to main branch, last one about a month ago
154
2.5k
apache-2.0
43
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Created 2023-09-26
395 commits to main branch, last one 27 days ago
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
Created 2022-08-22
96 commits to main branch, last one 2 days ago
248
2.4k
apache-2.0
21
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Created 2022-01-29
712 commits to main branch, last one about a year ago
Easily compute clip embeddings and build a clip retrieval system with them
Created 2021-06-07
332 commits to main branch, last one 9 months ago
176
2.3k
mit
30
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
Created 2023-04-25
158 commits to main branch, last one 22 days ago