10 results found Sort:

Famous Vision Language Models and Their Architectures
Created 2024-02-15
237 commits to main branch, last one 11 days ago
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
Created 2023-10-08
31 commits to master branch, last one 11 months ago
Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks
Created 2023-03-13
110 commits to main branch, last one about a year ago
Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023
Created 2022-10-07
12 commits to main branch, last one about a year ago
Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
Created 2022-05-20
17 commits to main branch, last one 2 years ago
This repository provides an interactive image colorization tool that leverages Stable Diffusion (SDXL) and BLIP for user-controlled color generation. With a retrained model using the ControlNet approa...
Created 2024-08-30
38 commits to main branch, last one 2 months ago
A data discovery and manipulation toolset for unstructured data
This repository has been archived (exclude archived)
Created 2023-03-21
39 commits to main branch, last one about a year ago
Image captioning using python and BLIP
Created 2023-01-13
32 commits to master branch, last one about a year ago
The wiki where you edit a word every 30sec, with 2.1M Wikipedia articles ported to a custom markdown format. Real-time text editing, beautiful UI & more. Vandalize articles today!
Created 2025-01-23
263 commits to main branch, last one 5 days ago
[ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives
Created 2024-04-12
12 commits to master branch, last one 3 months ago