7 results found Sort:
[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
Created
2024-10-31
267 commits to main branch, last one about a month ago
A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.
Created
2025-01-05
27 commits to main branch, last one 26 days ago
An open-sourced end-to-end VLM-based GUI Agent
Created
2023-11-28
50 commits to main branch, last one 18 days ago
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use".
Created
2024-12-12
199 commits to main branch, last one 11 days ago
Create your self-hosted, open-source Operator model.
Created
2025-01-24
11 commits to main branch, last one 2 months ago
Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.
Created
2025-02-12
140 commits to main branch, last one 11 days ago
Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents
Created
2025-04-18
4 commits to main branch, last one 2 days ago