Search Results - RepositoryStats

ollama ollama

9.3k

117.2k

mit

684

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

go llm llms phi3 phi4 gemma llama llava gemma2 golang llama2 llama3 ollama mistral deepseek

Created 2023-06-26

3,802 commits to main branch, last one a day ago

LLaVA haotian-liu

2.3k

21.3k

apache-2.0

159

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

gpt-4 llama llava llama2 chatbot chatgpt llama-2 multimodal multi-modality foundation-models instruction-tuning vision-language-model visual-language-learning

Created 2023-04-17

460 commits to main branch, last one 8 months ago

sglang sgl-project

810

8.3k

apache-2.0

74

SGLang is a fast serving framework for large language models and vision language models.

llm moe vlm cuda llama llava llama2 llama3 pytorch deepseek llama3-1 inference deepseek-v3 llm-serving transformer deepseek-llm

Created 2024-01-08

1,938 commits to main branch, last one 12 hours ago

SUPIR Fanghua-Yu

404

4.8k

other

68

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

sdxl llava pytorch restoration deep-learning diffusion-models stable-diffusion super-resolution pytorch-lightning

Created 2023-12-21

36 commits to master branch, last one 6 months ago

xtuner InternLM

329

4.2k

apache-2.0

36

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

llm peft phi3 qwen agent llava llama2 llama3 chatbot mixtral msagent baichuan chatglm2 chatglm3 internlm llm-training conversational-ai large-language-models supervised-finetuning

Created 2023-07-11

334 commits to main branch, last one 11 days ago

data-juicer modelscope

198

3.5k

apache-2.0

19

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Created 2023-08-01

301 commits to main branch, last one 7 days ago

zero_nlp yuanzhoulvpi2017

383

3.2k

mit

29

中文nlp解决方案(大模型、数据、模型、训练、推理)

gpt nlp bert clip gpt2 llama llava llama2 pytorch chatglm-6b transformers text-generation huggingface-transformers

Created 2023-02-05

244 commits to main branch, last one about a month ago

LLamaSharp SciSharp

375

2.9k

mit

56

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

gpt llm llama llava llama2 llama3 chatbot llamacpp llama-cpp multi-modal semantic-kernel

Created 2023-05-09

1,847 commits to master branch, last one 16 hours ago

FindTheChatGPTer chenking2020

201

2.0k

unknown

56

ChatGPT爆火，开启了通往AGI的关键一步，本项目旨在汇总那些ChatGPT的开源平替们，包括文本大模型、多模态大模型等，为大家提供一些便利

Created 2023-04-07

65 commits to main branch, last one about a year ago

VLMEvalKit open-compass

254

1.8k

apache-2.0

12

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

gpt llm vit vqa clip gpt4 qwen llava claude gemini gpt-4v openai chatgpt pytorch evaluation openai-api multi-modal computer-vision large-language-models

Created 2023-12-01

1,173 commits to main branch, last one 23 hours ago

OmAgent om-ai-lab

144

1.5k

apache-2.0

55

Build multimodal language agents for fast prototype and production

Created 2024-07-04

416 commits to main branch, last one 7 days ago

Video-ChatGPT mbzuai-oryx

111

1.3k

cc-by-4.0

14

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for ...

clip gpt-4 llama llava vicuna chatbot mulit-modal video-chatboat vision-language video-conversation vision-language-pretraining

Created 2023-05-18

43 commits to main branch, last one 5 months ago

uform unum-cloud

63

1.1k

apache-2.0

14

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Created 2023-02-21

298 commits to main branch, last one 29 days ago

taggui jhc13

40

874

gpl-3.0

15

Tag manager and captioner for image datasets

llava cogvlm pyside6 florence-2 tag-manager image-tagging image-captioning stable-diffusion

Created 2023-03-08

559 commits to main branch, last one about a month ago

LLaVA-pp mbzuai-oryx

62

827

unknown

9

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

llm lmms phi3 llava llama3 llava-phi3 phi3-llava phi-3-llava phi3-vision conversation llama3-llava llava-llama3 phi-3-vision llama-3-llava llama3-vision llama-3-vision vision-language

Created 2024-04-26

11 commits to main branch, last one 9 months ago

mlx-vlm Blaizzy

66

764

mit

9

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

llm mlx llava molmo idefics pixtral local-ai florence2 paligemma apple-silicon vision-framework vision-transformer vision-language-model

Created 2024-04-16

198 commits to main branch, last one 14 hours ago

machina PsyChip

34

733

unknown

2

OpenCV+YOLO+LLAVA powered video surveillance system

rtsp yolo llava camera opencv python ollama-api

Created 2024-10-07

14 commits to main branch, last one 2 months ago

TinyLLaVA_Factory TinyLLaVA

79

715

apache-2.0

11

A Framework of Small-scale Large Multimodal Models

nlp llama llava tinyllama transformers vision-language large-multimodal-models

Created 2024-02-21

223 commits to main branch, last one 5 days ago

awesome-vlm-architectures gokayfem

31

596

cc0-1.0

12

Famous Vision Language Models and Their Architectures

vlm blip clip llava cogvlm kosmos awesome qwen-vl internlm multimodal awesome-list text-encoder image-encoder vision-language-model

Created 2024-02-15

231 commits to main branch, last one 4 months ago

awesome-foundation-and-multimodal-models SkalskiP

44

595

unknown

27

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

nlp blip clip llava multimodal grounding-dino computer-vision image-captioning segment-anything foundational-models zero-shot-detection open-vocabulary-detection open-vocabulary-segmentation

Created 2023-10-08

31 commits to master branch, last one 11 months ago

EAGLE NVlabs

38

584

unknown

25

Eagle Family: Exploring Model Designs, Data Recipes and Training Strategies for Frontier-Class Multimodal LLMs

llm lmm demo gpt4 lvlm mllm eagle llama llava nvdia llama3 huggingface large-language-models

Created 2024-06-27

102 commits to main branch, last one 4 days ago

PaddleMIX PaddlePaddle

175

476

apache-2.0

22

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...

Created 2023-07-05

1,046 commits to develop branch, last one 8 days ago

llama-assistant nrl-ai

40

475

gpl-3.0

11

AI-powered assistant to help you with your daily tasks, powered by Llama 3.2. It can recognize your voice, process natural language, and perform various actions based on your commands: summarizing tex...

owen llama llava llama3 llama-3-2 moondream private-gpt personal-assistant

Created 2024-09-26

91 commits to main branch, last one about a month ago

ComfyUI_VLM_nodes gokayfem

42

455

apache-2.0

6

Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation

llm vlm mllm llava nodes phi15 joytag siglip comfyui img2sfx img2text custom-nodes image-captioning

Created 2024-01-24

271 commits to main branch, last one 2 months ago

llmcord jakobdylanc

84

442

mit

5

Make Discord your LLM frontend ● Supports any OpenAI compatible API (Ollama, LM Studio, vLLM, OpenRouter, xAI, Mistral, Groq and more)

Created 2023-05-08

367 commits to main branch, last one 5 days ago

restai apocas

76

408

apache-2.0

9

RESTai is an AIaaS (AI as a Service) open-source platform. Built on top of LlamaIndex & Langchain. Supports any public LLM supported by LlamaIndex and any local LLM suported by Ollama/vLLM/etc. Precis...

llm rag llama llava ollama openai python fastapi langchain openaiapi embeddings llamaindex transformers stable-diffusion

Created 2023-05-18

867 commits to master branch, last one 15 hours ago

Open-LLaVA-NeXT xiaoachen98

21

375

unknown

10

An open-source implementation for training LLaVA-NeXT.

gpt-4 gpt4o llama llava llama3 chatbot chatgpt llava-next multimodal multi-modality vision-language-model large-multimodal-models visual-language-learning

Created 2024-05-11

36 commits to master branch, last one 3 months ago

InternEvo InternLM

57

339

apache-2.0

10

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

910b gemma llava zero3 llama3 pytorch internlm internlm2 multi-modal llm-training llm-framework ring-attention flash-attention deepspeed-ulysses tensor-parallelism transformers-models pipeline-parallelism sequence-parallelism

Created 2024-01-16

500 commits to develop branch, last one 11 days ago

ViP-LLaVA WisconsinAIVision

22

308

apache-2.0

6

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

clip gpt-4 llama llava llama2 chatbot cvpr2024 multi-modal gpt-4-vision vision-language visual-prompting foundation-models

Created 2023-12-02

44 commits to main branch, last one 6 months ago

LLaVA-Mini ictnlp

13

303

apache-2.0

8

LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.

gpt4o gpt4v llama llava video vision efficient multimodal large-language-models vision-language-model large-multimodal-models visual-instruction-tuning multimodal-large-language-models

Created 2025-01-07

8 commits to main branch, last one 20 days ago