4 results found Sort:

148
2.3k
apache-2.0
29
GPT4V-level open-source multi-modal model based on Llama3-8B
Created 2024-05-10
86 commits to main branch, last one 5 months ago
41
890
gpl-3.0
15
Tag manager and captioner for image datasets
Created 2023-03-08
559 commits to main branch, last one 2 months ago
Famous Vision Language Models and Their Architectures
Created 2024-02-15
237 commits to main branch, last one 10 days ago
Python scripts to use for captioning images with VLMs
Created 2024-03-24
11 commits to main branch, last one 6 months ago