22 results found Sort:
- Filter by Primary Language:
- Python (13)
- TypeScript (4)
- HTML (1)
- JavaScript (1)
- +
Let AI be your browser operator.
Created
2024-07-23
374 commits to main branch, last one 19 hours ago
The most reliable AI agent framework that supports MCP.
Created
2024-05-26
1,131 commits to master branch, last one 2 days ago
Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.
Created
2024-12-31
87 commits to master branch, last one a day ago
A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
Created
2025-01-19
175 commits to main branch, last one 4 hours ago
Create and run high-performance macOS and Linux VMs on Apple Silicon, with built-in support for AI agents.
Created
2025-01-31
122 commits to main branch, last one a day ago
Ui.Vision Open-Source RPA Software with Computer Vision, OCR, Anthropic Computer Use. Selenium IDE import/export.
Created
2017-08-04
96 commits to master branch, last one about a month ago
Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
Created
2023-04-12
939 commits to main branch, last one 29 days ago
[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
Created
2024-10-31
267 commits to main branch, last one 7 days ago
A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.
Created
2025-01-05
25 commits to main branch, last one 7 days ago
AI computer use powered by open source LLMs and E2B Desktop Sandbox
Created
2024-10-31
108 commits to master branch, last one 7 days ago
An open-sourced end-to-end VLM-based GUI Agent
Created
2023-11-28
47 commits to main branch, last one 29 days ago
A fork of Anthropic Computer Use that you can run on Mac computers to give Claude and other AI models autonomous access to your computer.
Created
2024-10-24
19 commits to main branch, last one 4 months ago
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
Created
2024-07-29
86 commits to main branch, last one 4 months ago
Desktop app powered by Claude’s computer use capability to control your computer
Created
2024-10-25
24 commits to main branch, last one about a month ago
A framework to enable autonomous android and computer use using any LLM (local or remote)
Created
2024-12-16
44 commits to main branch, last one 21 days ago
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use".
Created
2024-12-12
194 commits to main branch, last one about a month ago
A general AI agent framework that can be adapted to various tasks and environments.
Created
2023-05-29
100 commits to main branch, last one about a month ago
A curated list of awesome resources, tools, research papers, and projects related to the concept of Large Language Model Operating Systems (LLM-OS).
Created
2024-05-13
24 commits to main branch, last one 17 days ago
✨ Use natural language to control your browser, powered by LLM and playwright
Created
2024-11-08
9 commits to main branch, last one 4 months ago
Mark web pages for use with vision-language models
Created
2024-04-29
87 commits to main branch, last one 2 months ago
Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups
Created
2024-11-04
7 commits to main branch, last one 2 months ago
Meet WebHive, the AI-powered browser that takes care of tasks for you. No more endless clicks, tell it what you need, and it gets it done.
Created
2025-01-29
297 commits to main branch, last one about a month ago