5 results found Sort:

[NeurIPS'24 Spotlight] EVE: Encoder-Free Vision-Language Models
Created 2024-06-14
17 commits to main branch, last one 2 months ago
Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)
Created 2024-03-20
12 commits to master branch, last one 2 months ago
DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception
Created 2024-07-05
8 commits to main branch, last one 3 days ago
4
91
unknown
4
This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges".
Created 2024-02-01
7 commits to main branch, last one about a month ago
Open-source code for the paper "Enhancing Remote Sensing Vision-Language Models for Zero-Shot Scene Classification"
Created 2024-08-15
60 commits to main branch, last one 18 days ago