4 results found Sort:

[ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine“
Created 2024-08-06
23 commits to master branch, last one 22 days ago
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use".
Created 2024-12-12
194 commits to main branch, last one about a month ago
10
53
unknown
3
[CVPR 2025] The official code for "Olympus: A Universal Task Router for Computer Vision Tasks"
Created 2024-12-04
75 commits to main branch, last one 15 days ago
Awesome Reasoning in MLLMs: Papers and Projects about learning to reason with MLLMs, including Chain-of-Thought (CoT), OpenAl o1, and DeepSeek-R1
Created 2025-03-12
13 commits to main branch, last one 2 days ago