263 results found Sort:

2.2k
20.5k
apache-2.0
208
☁️ Build multimodal AI applications with cloud-native stack
Created 2020-02-13
8,545 commits to master branch, last one 13 days ago
2.4k
19.1k
mit
297
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Created 2019-07-23
1,180 commits to master branch, last one about a month ago
1.9k
17.7k
apache-2.0
157
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Created 2023-04-17
460 commits to main branch, last one about a month ago
2.2k
10.6k
apache-2.0
194
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Created 2019-08-05
6,640 commits to main branch, last one 9 hours ago
764
6.8k
apache-2.0
74
The easiest way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Multi-model Inference Graph/Pipelines, LLM/RAG apps, and more!
Created 2019-04-02
3,150 commits to main branch, last one 2 days ago
252
5.6k
apache-2.0
57
Visualize streams of multimodal data. Fast, easy to use, and simple to integrate. Built in Rust using egui.
Created 2022-04-08
3,977 commits to main branch, last one 14 hours ago
925
5.4k
other
115
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Created 2018-06-27
1,097 commits to main branch, last one about a month ago
451
5.0k
unknown
84
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
Created 2023-04-02
94 commits to master branch, last one 2 months ago
384
4.8k
mit
150
notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under ...
Created 2022-09-04
1,578 commits to main branch, last one 8 hours ago
1.1k
4.7k
mit
55
Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, cod...
Created 2023-03-19
3,280 commits to main branch, last one 2 days ago
348
4.1k
apache-2.0
51
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
Created 2023-05-21
264 commits to main branch, last one about a month ago
369
4.0k
apache-2.0
56
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
Created 2021-10-28
711 commits to main branch, last one 11 months ago
246
3.8k
other
34
🪩 Create Disco Diffusion artworks in one line
Created 2022-06-30
385 commits to main branch, last one about a year ago
Curated tutorials and resources for Large Language Models, AI Painting, and more.
Created 2023-08-22
188 commits to main branch, last one 2 months ago
320
3.4k
mit
30
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Created 2021-08-11
309 commits to main branch, last one 4 months ago
1.0k
3.3k
apache-2.0
31
OpenMMLab Pre-training Toolbox and Benchmark
Created 2020-07-09
973 commits to main branch, last one 5 months ago
229
3.2k
apache-2.0
43
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, ...
Created 2023-05-08
260 commits to main branch, last one 7 months ago
307
3.0k
bsd-3-clause
60
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Created 2023-08-30
225 commits to main branch, last one 5 months ago
Foundation Architecture for (M)LLMs
Created 2022-11-17
123 commits to main branch, last one 2 months ago
222
2.8k
apache-2.0
43
Represent, send, store and search multimodal data
Created 2021-12-14
1,452 commits to main branch, last one 16 days ago
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
Created 2022-08-22
88 commits to main branch, last one 14 days ago
247
2.4k
apache-2.0
21
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Created 2022-01-29
712 commits to main branch, last one 10 months ago
186
2.3k
mit
35
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Created 2024-01-26
94 commits to main branch, last one 6 days ago
Easily compute clip embeddings and build a clip retrieval system with them
Created 2021-06-07
332 commits to main branch, last one 5 months ago
203
2.1k
apache-2.0
19
ms-swift: Use PEFT or Full-parameter to finetune 250+ LLMs or 35+ MLLMs. (Qwen2, GLM4, Internlm2, Yi, Llama3, Llava, MiniCPM-V, Deepseek, Baichuan2, Phi3-Vision, ...)
Created 2023-08-01
667 commits to main branch, last one a day ago
159
2.0k
mit
26
mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model
Created 2023-04-25
140 commits to main branch, last one 2 months ago
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
Created 2020-10-13
588 commits to 2024-Version-2.0 branch, last one a day ago
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
Created 2023-09-26
331 commits to main branch, last one 13 days ago
21
1.9k
unknown
12
Conversational AI SDK for Android to enable text and voice conversations with actions (Java, Kotlin)
Created 2019-10-01
75 commits to master branch, last one about a month ago
39
1.8k
unknown
10
Conversational AI SDK for Flutter to enable text and voice conversations with actions (iOS and Android)
Created 2020-04-22
76 commits to master branch, last one about a month ago