3 results found Sort:

2.4k
19.1k
mit
297
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Created 2019-07-23
1,180 commits to master branch, last one about a month ago
Famous Vision Language Models and Their Architectures
Created 2024-02-15
221 commits to main branch, last one 14 days ago
My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"
Created 2023-09-22
29 commits to main branch, last one 7 months ago