7 results found Sort:

Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks
Created 2023-03-13
110 commits to main branch, last one about a year ago
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
Created 2023-10-08
31 commits to master branch, last one 4 months ago
Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023
Created 2022-10-07
12 commits to main branch, last one about a year ago
Famous Vision Language Models and Their Architectures
Created 2024-02-15
221 commits to main branch, last one 14 days ago
Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
Created 2022-05-20
17 commits to main branch, last one about a year ago
A data discovery and manipulation toolset for unstructured data
Created 2023-03-21
39 commits to main branch, last one 9 months ago
Image captioning using python and BLIP
Created 2023-01-13
32 commits to master branch, last one about a year ago