4 results found Sort:

200
2.4k
mit
26
A community-driven AI automation framework that builds upon the incredible work of the open source community. Our goal is to combine language models with specialized tools for tasks like web search, c...
Created 2025-03-08
75 commits to main branch, last one 5 hours ago
Famous Vision Language Models and Their Architectures
Created 2024-02-15
240 commits to main branch, last one 24 days ago
26
276
apache-2.0
8
A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.
Created 2024-07-20
109 commits to main branch, last one about a month ago
Mark web pages for use with vision-language models
Created 2024-04-29
87 commits to main branch, last one 2 months ago