71 results found Sort:
- Filter by Primary Language:
- Python (55)
- Jupyter Notebook (4)
- C++ (2)
- TypeScript (2)
- Ruby (1)
- Shell (1)
- Markdown (1)
- Go (1)
- Java (1)
- C# (1)
- +
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
Created
2023-06-26
3,796 commits to main branch, last one 22 hours ago
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Created
2023-04-17
460 commits to main branch, last one 8 months ago
SGLang is a fast serving framework for large language models and vision language models.
Created
2024-01-08
1,914 commits to main branch, last one 6 hours ago
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Created
2023-12-21
36 commits to master branch, last one 6 months ago
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Created
2023-07-11
334 commits to main branch, last one 8 days ago
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
Created
2023-08-01
301 commits to main branch, last one 4 days ago
中文nlp解决方案(大模型、数据、模型、训练、推理)
Created
2023-02-05
244 commits to main branch, last one 28 days ago
A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.
Created
2023-05-09
1,828 commits to master branch, last one 9 days ago
ChatGPT爆火,开启了通往AGI的关键一步,本项目旨在汇总那些ChatGPT的开源平替们,包括文本大模型、多模态大模型等,为大家提供一些便利
Created
2023-04-07
65 commits to main branch, last one about a year ago
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
Created
2023-12-01
1,167 commits to main branch, last one 8 hours ago
Build multimodal language agents for fast prototype and production
Created
2024-07-04
416 commits to main branch, last one 4 days ago
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for ...
Created
2023-05-18
43 commits to main branch, last one 5 months ago
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️
Created
2023-02-21
298 commits to main branch, last one 26 days ago
Tag manager and captioner for image datasets
Created
2023-03-08
559 commits to main branch, last one about a month ago
🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)
Created
2024-04-26
11 commits to main branch, last one 9 months ago
MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
Created
2024-04-16
197 commits to main branch, last one 24 hours ago
OpenCV+YOLO+LLAVA powered video surveillance system
Created
2024-10-07
14 commits to main branch, last one 2 months ago
A Framework of Small-scale Large Multimodal Models
Created
2024-02-21
223 commits to main branch, last one 2 days ago
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
Created
2023-10-08
31 commits to master branch, last one 11 months ago
Famous Vision Language Models and Their Architectures
Created
2024-02-15
231 commits to main branch, last one 4 months ago
Eagle Family: Exploring Model Designs, Data Recipes and Training Strategies for Frontier-Class Multimodal LLMs
Created
2024-06-27
102 commits to main branch, last one 2 days ago
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
Created
2023-07-05
1,046 commits to develop branch, last one 5 days ago
AI-powered assistant to help you with your daily tasks, powered by Llama 3.2. It can recognize your voice, process natural language, and perform various actions based on your commands: summarizing tex...
Created
2024-09-26
91 commits to main branch, last one about a month ago
Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation
Created
2024-01-24
271 commits to main branch, last one 2 months ago
Make Discord your LLM frontend ● Supports any OpenAI compatible API (Ollama, LM Studio, vLLM, OpenRouter, xAI, Mistral, Groq and more)
Created
2023-05-08
367 commits to main branch, last one 3 days ago
RESTai is an AIaaS (AI as a Service) open-source platform. Built on top of LlamaIndex & Langchain. Supports any public LLM supported by LlamaIndex and any local LLM suported by Ollama/vLLM/etc. Precis...
Created
2023-05-18
859 commits to master branch, last one 18 hours ago
An open-source implementation for training LLaVA-NeXT.
Created
2024-05-11
36 commits to master branch, last one 3 months ago
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
Created
2024-01-16
500 commits to develop branch, last one 8 days ago
[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
Created
2023-12-02
44 commits to main branch, last one 6 months ago
LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.
Created
2025-01-07
8 commits to main branch, last one 17 days ago