5 results found Sort:
- Filter by Primary Language:
- Python (3)
- Jupyter Notebook (1)
- +
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train...
Created
2023-10-24
134 commits to master branch, last one about a month ago
Awesome papers & datasets specifically focused on long-term videos.
Created
2022-07-11
47 commits to main branch, last one 2 months ago
This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"
Created
2024-11-19
34 commits to main branch, last one 3 days ago
[AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding
Created
2024-05-17
30 commits to master branch, last one about a month ago
[ICLR 2025] TRACE: Temporal Grounding Video LLM via Casual Event Modeling
Created
2024-09-29
23 commits to master branch, last one 7 days ago