317 results found Sort:
- Filter by Primary Language:
- Python (223)
- Jupyter Notebook (32)
- C++ (5)
- TypeScript (5)
- HTML (4)
- JavaScript (4)
- Ruby (3)
- C# (2)
- Rust (2)
- Markdown (1)
- Go (1)
- Dart (1)
- Shell (1)
- MATLAB (1)
- +
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
Created
2023-06-04
1,044 commits to master branch, last one 10 hours ago
☁️ Build multimodal AI applications with cloud-native stack
Created
2020-02-13
8,620 commits to master branch, last one a day ago
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Created
2023-04-17
460 commits to main branch, last one 5 months ago
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Created
2019-07-23
1,214 commits to master branch, last one 8 days ago
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Created
2019-08-05
7,458 commits to main branch, last one 10 hours ago
rewind.ai x cursor.com = AI powered by your 24/7 screen & voice local recording.
Created
2024-06-19
2,281 commits to main branch, last one 6 hours ago
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
Created
2019-04-02
3,343 commits to main branch, last one 21 hours ago
Visualize streams of multimodal data. Free, fast, easy to use, and simple to integrate. Built in Rust.
Created
2022-04-08
4,773 commits to main branch, last one 12 hours ago
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Created
2018-06-27
1,098 commits to main branch, last one 8 days ago
Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, cod...
Created
2023-03-19
3,366 commits to v1-dev branch, last one 9 days ago
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
Created
2023-04-02
94 commits to master branch, last one 6 months ago
notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under ...
Created
2022-09-04
1,623 commits to main branch, last one 5 days ago
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
Created
2023-05-21
272 commits to main branch, last one 8 days ago
Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vis...
Created
2023-08-01
1,144 commits to main branch, last one 18 hours ago
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
Created
2021-10-28
711 commits to main branch, last one about a year ago
Build real-time multimodal AI applications 🤖🎙️📹
Created
2023-10-19
807 commits to main branch, last one 2 days ago
Curated tutorials and resources for Large Language Models, AI Painting, and more.
Created
2023-08-22
188 commits to main branch, last one 7 months ago
🪩 Create Disco Diffusion artworks in one line
Created
2022-06-30
385 commits to main branch, last one about a year ago
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Created
2021-08-11
309 commits to main branch, last one 8 months ago
OpenMMLab Pre-training Toolbox and Benchmark
Created
2020-07-09
974 commits to main branch, last one 5 days ago
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Created
2023-08-30
249 commits to main branch, last one 4 days ago
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, ...
Created
2023-05-08
261 commits to main branch, last one 2 months ago
Foundation Architecture for (M)LLMs
Created
2022-11-17
123 commits to main branch, last one 6 months ago
Represent, send, store and search multimodal data
Created
2021-12-14
1,462 commits to main branch, last one about a month ago
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Created
2024-01-26
139 commits to main branch, last one about a month ago
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Created
2023-09-26
395 commits to main branch, last one 27 days ago
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
Created
2022-08-22
96 commits to main branch, last one 2 days ago
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Created
2022-01-29
712 commits to main branch, last one about a year ago
Easily compute clip embeddings and build a clip retrieval system with them
Created
2021-06-07
332 commits to main branch, last one 9 months ago
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
Created
2023-04-25
158 commits to main branch, last one 22 days ago