4 results found Sort:
- Filter by Primary Language:
- Python (2)
- Markdown (1)
- TypeScript (1)
- +
A community-driven AI automation framework that builds upon the incredible work of the open source community. Our goal is to combine language models with specialized tools for tasks like web search, c...
Created
2025-03-08
75 commits to main branch, last one 5 hours ago
Famous Vision Language Models and Their Architectures
Created
2024-02-15
240 commits to main branch, last one 24 days ago
A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.
Created
2024-07-20
109 commits to main branch, last one about a month ago
Mark web pages for use with vision-language models
Created
2024-04-29
87 commits to main branch, last one 2 months ago