4 results found Sort:
🚀🚀🚀A collection of some wesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applica...
Created
2023-02-15
159 commits to main branch, last one 7 days ago
🔥[ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
Created
2024-06-07
35 commits to main branch, last one 21 days ago
🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes.
Created
2025-01-29
14 commits to main branch, last one 9 days ago
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning
Created
2024-12-16
14 commits to main branch, last one about a month ago