4 results found Sort:
- Filter by Primary Language:
- Python (3)
- Markdown (1)
- +
GPT4V-level open-source multi-modal model based on Llama3-8B
Created
2024-05-10
86 commits to main branch, last one 4 months ago
Tag manager and captioner for image datasets
Created
2023-03-08
559 commits to main branch, last one about a month ago
Famous Vision Language Models and Their Architectures
Created
2024-02-15
231 commits to main branch, last one 4 months ago
Python scripts to use for captioning images with VLMs
Created
2024-03-24
11 commits to main branch, last one 6 months ago