6 results found Sort:
- Filter by Primary Language:
- Python (4)
- Jupyter Notebook (1)
- +
[ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
Created
2024-10-09
13 commits to main branch, last one 7 days ago
[CVPR 2025] Source codes for the paper "3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning"
Created
2024-12-09
7 commits to main branch, last one 4 days ago
SpatialLM: Large Language Model for Spatial Understanding
Created
2025-03-14
1 commits to main branch, last one 3 days ago
Code for "StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation", Arxiv 2025.
Created
2025-01-13
2 commits to main branch, last one 2 months ago
[NeurIPS 2024] Official code for HourVideo: 1-Hour Video Language Understanding
evals
gpt-4
reasoning
gemini-pro
navigation
perception
neurips-2024
summarization
visual-reasoning
benchmark-dataset
egocentric-videos
spatial-intelligence
multiple-choice-questions
long-context-understanding
video-language-understanding
multimodal-large-language-models
1-hour-video-language-understanding
long-form-video-language-understanding
Created
2024-11-27
10 commits to main branch, last one 13 days ago
Multimodal datasets for spatial intelligence
Created
2024-08-21
36 commits to main branch, last one 2 days ago