5 results found Sort:

230
3.1k
apache-2.0
48
A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
Created 2025-01-19
120 commits to main branch, last one a day ago
145
1.3k
apache-2.0
27
Agent S: an open agentic framework that uses computers like a human
Created 2024-10-09
172 commits to main branch, last one a day ago
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
Created 2024-06-28
168 commits to main branch, last one 9 days ago
12
187
unknown
7
[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents
Created 2024-08-02
177 commits to main branch, last one 18 days ago
Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"
Created 2024-06-27
32 commits to main branch, last one 7 months ago