5 results found Sort:

21
395
unknown
6
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train...
Created 2023-10-24
134 commits to master branch, last one about a month ago
This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"
Created 2024-11-19
34 commits to main branch, last one 3 days ago
2
86
apache-2.0
3
[AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding
Created 2024-05-17
30 commits to master branch, last one about a month ago
0
63
apache-2.0
3
[ICLR 2025] TRACE: Temporal Grounding Video LLM via Casual Event Modeling
Created 2024-09-29
23 commits to master branch, last one 7 days ago