29 results found Sort:

980
12.4k
apache-2.0
116
A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.
Created 2025-01-19
287 commits to main branch, last one 18 hours ago
Your AI Operator for Web, Android, Automation & Testing.
Created 2024-07-23
470 commits to main branch, last one 9 hours ago
686
7.4k
mit
55
The most reliable AI agent framework that supports MCP.
Created 2024-05-26
1,221 commits to master branch, last one 2 days ago
430
5.3k
apache-2.0
28
Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.
Created 2024-12-31
132 commits to master branch, last one 12 days ago
163
4.4k
mit
27
c/ua is the Docker Container for Computer-Use AI Agents.
Created 2025-01-31
308 commits to main branch, last one 9 hours ago
255
2.4k
apache-2.0
33
Agent S: an open agentic framework that uses computers like a human
Created 2024-10-09
227 commits to main branch, last one 13 hours ago
320
1.5k
other
63
Ui.Vision Open-Source RPA Software with Computer Vision, OCR, Anthropic Computer Use/LLM. Selenium IDE import/export.
Created 2017-08-04
97 commits to master branch, last one about a month ago
Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
Created 2023-04-12
939 commits to main branch, last one 2 months ago
76
1.2k
apache-2.0
15
[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
Created 2024-10-31
267 commits to main branch, last one about a month ago
77
1.2k
unknown
12
A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.
Created 2025-01-05
27 commits to main branch, last one 26 days ago
139
1.1k
apache-2.0
12
AI computer use powered by open source LLMs and E2B Desktop Sandbox
Created 2024-10-31
108 commits to master branch, last one about a month ago
72
915
apache-2.0
21
An open-sourced end-to-end VLM-based GUI Agent
Created 2023-11-28
50 commits to main branch, last one 18 days ago
132
790
unknown
13
A fork of Anthropic Computer Use that you can run on Mac computers to give Claude and other AI models autonomous access to your computer.
Created 2024-10-24
19 commits to main branch, last one 5 months ago
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
Created 2024-07-29
86 commits to main branch, last one 5 months ago
Bytebot is the container for desktop agents.
Created 2025-02-03
115 commits to main branch, last one a day ago
Desktop app powered by Claude’s computer use capability to control your computer
Created 2024-10-25
24 commits to main branch, last one 2 months ago
A framework to enable autonomous android and computer use using any LLM (local or remote)
Created 2024-12-16
44 commits to main branch, last one about a month ago
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use".
Created 2024-12-12
199 commits to main branch, last one 12 days ago
Build, evaluate and run General Multi-Agent Assistance with ease
Created 2025-03-14
421 commits to main branch, last one 2 days ago
Spongecake is the easiest way to launch computer use agents.
Created 2025-03-08
83 commits to main branch, last one a day ago
Foundation Model Training Using Human Demonstrations
Created 2025-02-20
96 commits to main branch, last one 6 days ago
16
100
apache-2.0
3
A general AI agent framework that can be adapted to various tasks and environments.
Created 2023-05-29
100 commits to main branch, last one 2 months ago
A curated list of awesome resources, tools, research papers, and projects related to the concept of Large Language Model Operating Systems (LLM-OS).
Created 2024-05-13
25 commits to main branch, last one 5 days ago
A zero-installation solution for AI agents to control remote macOS systems. Full desktop capabilities without extra software, using only built-in Screen Sharing. Works with Claude and any MCP client, ...
Created 2025-03-21
48 commits to main branch, last one 7 days ago
✨ Use natural language to control your browser, powered by LLM and playwright
Created 2024-11-08
9 commits to main branch, last one 5 months ago
Mark web pages for use with vision-language models
Created 2024-04-29
90 commits to main branch, last one 23 days ago
Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups
Created 2024-11-04
7 commits to main branch, last one 4 months ago
try Computer Use on your Mac with a few clicks
Created 2024-10-31
33 commits to main branch, last one 4 months ago
Meet WebHive, the AI-powered browser that takes care of tasks for you. No more endless clicks, tell it what you need, and it gets it done.
Created 2025-01-29
297 commits to main branch, last one 2 months ago