3 results found Sort:

92
1.0k
cc-by-4.0
14
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for ...
Created 2023-05-18
42 commits to main branch, last one 13 days ago
PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models
Created 2023-11-20
8 commits to main branch, last one 5 months ago
Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
Created 2024-06-13
5 commits to main branch, last one 13 days ago