5 results found Sort:
- Filter by Primary Language:
- Python (3)
- TypeScript (1)
- +
A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
Created
2025-01-19
120 commits to main branch, last one a day ago
Agent S: an open agentic framework that uses computers like a human
Created
2024-10-09
172 commits to main branch, last one a day ago
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
Created
2024-06-28
168 commits to main branch, last one 9 days ago
[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents
Created
2024-08-02
177 commits to main branch, last one 18 days ago
Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"
Created
2024-06-27
32 commits to main branch, last one 7 months ago