263 results found Sort:
- Filter by Primary Language:
- Python (185)
- Jupyter Notebook (22)
- TypeScript (5)
- HTML (4)
- C++ (4)
- JavaScript (3)
- Ruby (3)
- Rust (2)
- C# (2)
- MATLAB (2)
- Shell (2)
- Vue (1)
- Markdown (1)
- C (1)
- +
☁️ Build multimodal AI applications with cloud-native stack
Created
2020-02-13
8,545 commits to master branch, last one 13 days ago
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Created
2019-07-23
1,180 commits to master branch, last one about a month ago
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Created
2023-04-17
460 commits to main branch, last one about a month ago
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Created
2019-08-05
6,640 commits to main branch, last one 9 hours ago
The easiest way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Multi-model Inference Graph/Pipelines, LLM/RAG apps, and more!
Created
2019-04-02
3,150 commits to main branch, last one 2 days ago
Visualize streams of multimodal data. Fast, easy to use, and simple to integrate. Built in Rust using egui.
Created
2022-04-08
3,977 commits to main branch, last one 14 hours ago
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Created
2018-06-27
1,097 commits to main branch, last one about a month ago
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
Created
2023-04-02
94 commits to master branch, last one 2 months ago
notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under ...
Created
2022-09-04
1,578 commits to main branch, last one 8 hours ago
Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, cod...
Created
2023-03-19
3,280 commits to main branch, last one 2 days ago
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
Created
2023-05-21
264 commits to main branch, last one about a month ago
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
Created
2021-10-28
711 commits to main branch, last one 11 months ago
🪩 Create Disco Diffusion artworks in one line
Created
2022-06-30
385 commits to main branch, last one about a year ago
Curated tutorials and resources for Large Language Models, AI Painting, and more.
Created
2023-08-22
188 commits to main branch, last one 2 months ago
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Created
2021-08-11
309 commits to main branch, last one 4 months ago
OpenMMLab Pre-training Toolbox and Benchmark
Created
2020-07-09
973 commits to main branch, last one 5 months ago
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, ...
Created
2023-05-08
260 commits to main branch, last one 7 months ago
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Created
2023-08-30
225 commits to main branch, last one 5 months ago
Foundation Architecture for (M)LLMs
Created
2022-11-17
123 commits to main branch, last one 2 months ago
Represent, send, store and search multimodal data
Created
2021-12-14
1,452 commits to main branch, last one 16 days ago
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
Created
2022-08-22
88 commits to main branch, last one 14 days ago
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Created
2022-01-29
712 commits to main branch, last one 10 months ago
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Created
2024-01-26
94 commits to main branch, last one 6 days ago
Easily compute clip embeddings and build a clip retrieval system with them
Created
2021-06-07
332 commits to main branch, last one 5 months ago
ms-swift: Use PEFT or Full-parameter to finetune 250+ LLMs or 35+ MLLMs. (Qwen2, GLM4, Internlm2, Yi, Llama3, Llava, MiniCPM-V, Deepseek, Baichuan2, Phi3-Vision, ...)
Created
2023-08-01
667 commits to main branch, last one a day ago
mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model
Created
2023-04-25
140 commits to main branch, last one 2 months ago
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
Created
2020-10-13
588 commits to 2024-Version-2.0 branch, last one a day ago
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
Created
2023-09-26
331 commits to main branch, last one 13 days ago
Conversational AI SDK for Android to enable text and voice conversations with actions (Java, Kotlin)
Created
2019-10-01
75 commits to master branch, last one about a month ago
Conversational AI SDK for Flutter to enable text and voice conversations with actions (iOS and Android)
Created
2020-04-22
76 commits to master branch, last one about a month ago