6 results found Sort:

23
427
unknown
6
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train...
Created 2023-10-24
135 commits to master branch, last one 13 days ago
This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"
Created 2024-11-19
38 commits to main branch, last one 29 days ago
2
95
apache-2.0
3
[AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding
Created 2024-05-17
30 commits to master branch, last one 3 months ago
0
75
apache-2.0
4
[ICLR 2025] TRACE: Temporal Grounding Video LLM via Casual Event Modeling
Created 2024-09-29
23 commits to master branch, last one 2 months ago
0
63
unknown
1
This is the official implementation of our paper "QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension"
Created 2025-03-04
18 commits to main branch, last one 7 days ago