3 results found Sort:

110
1.2k
cc-by-4.0
15
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for ...
Created 2023-05-18
43 commits to main branch, last one 3 months ago
PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models
Created 2023-11-20
8 commits to main branch, last one 11 months ago
Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
Created 2024-06-13
6 commits to main branch, last one 5 months ago