4 results found Sort:

150
2.2k
apache-2.0
29
GPT4V-level open-source multi-modal model based on Llama3-8B
Created 2024-05-10
86 commits to main branch, last one 4 months ago
39
871
gpl-3.0
15
Tag manager and captioner for image datasets
Created 2023-03-08
559 commits to main branch, last one about a month ago
Famous Vision Language Models and Their Architectures
Created 2024-02-15
231 commits to main branch, last one 4 months ago
Python scripts to use for captioning images with VLMs
Created 2024-03-24
11 commits to main branch, last one 6 months ago