3 results found Sort:
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for ...
Created
2023-05-18
43 commits to main branch, last one 3 months ago
PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models
Created
2023-11-20
8 commits to main branch, last one 11 months ago
Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
Created
2024-06-13
6 commits to main branch, last one 5 months ago