19 results found Sort:

626
6.5k
mit
54
Upsonic is a reliability-focused agent framework with dockerized, server-client architecture and MCP
Created 2024-05-26
1,024 commits to master branch, last one 8 hours ago
Let AI be your browser operator.
Created 2024-07-23
317 commits to main branch, last one 18 hours ago
189
2.6k
apache-2.0
46
A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
Created 2025-01-19
81 commits to main branch, last one 18 hours ago
304
1.4k
other
63
Ui.Vision Open-Source RPA Software with Computer Vision, OCR, Anthropic Computer Use. Selenium IDE import/export.
Created 2017-08-04
96 commits to master branch, last one 20 days ago
Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
Created 2023-04-12
937 commits to main branch, last one about a month ago
54
951
apache-2.0
14
Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
Created 2024-10-31
232 commits to main branch, last one a day ago
56
779
unknown
9
A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.
Created 2025-01-05
19 commits to main branch, last one about a month ago
120
738
unknown
12
A fork of Anthropic Computer Use that you can run on Mac computers to give Claude and other AI models autonomous access to your computer.
Created 2024-10-24
19 commits to main branch, last one 3 months ago
54
721
apache-2.0
17
An open-sourced end-to-end VLM-based GUI Agent
Created 2023-11-28
45 commits to main branch, last one 17 days ago
90
658
apache-2.0
10
Secure AI computer use powered by E2B Desktop Sandbox
Created 2024-10-31
95 commits to master branch, last one a day ago
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
Created 2024-07-29
86 commits to main branch, last one 2 months ago
Desktop app powered by Claude’s computer use capability to control your computer
Created 2024-10-25
24 commits to main branch, last one 16 days ago
A framework to enable autonomous android and computer use using any LLM (local or remote)
Created 2024-12-16
41 commits to main branch, last one 3 days ago
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use".
Created 2024-12-12
188 commits to main branch, last one a day ago
16
99
apache-2.0
3
A general AI agent framework that can be adapted to various tasks and environments.
Created 2023-05-29
100 commits to main branch, last one 10 days ago
A curated list of awesome resources, tools, research papers, and projects related to the concept of Large Language Model Operating Systems (LLM-OS).
Created 2024-05-13
22 commits to main branch, last one 4 days ago
✨ Use natural language to control your browser, powered by LLM and playwright
Created 2024-11-08
9 commits to main branch, last one 2 months ago
Mark web pages for use with vision-language models
Created 2024-04-29
87 commits to main branch, last one about a month ago
Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups
Created 2024-11-04
7 commits to main branch, last one about a month ago