19 results found Sort:
- Filter by Primary Language:
- Python (10)
- TypeScript (3)
- HTML (1)
- JavaScript (1)
- Jupyter Notebook (1)
- +
Upsonic is a reliability-focused agent framework with dockerized, server-client architecture and MCP
Created
2024-05-26
1,024 commits to master branch, last one 8 hours ago
Let AI be your browser operator.
Created
2024-07-23
317 commits to main branch, last one 18 hours ago
A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
Created
2025-01-19
81 commits to main branch, last one 18 hours ago
Ui.Vision Open-Source RPA Software with Computer Vision, OCR, Anthropic Computer Use. Selenium IDE import/export.
Created
2017-08-04
96 commits to master branch, last one 20 days ago
Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
Created
2023-04-12
937 commits to main branch, last one about a month ago
Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
Created
2024-10-31
232 commits to main branch, last one a day ago
A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.
Created
2025-01-05
19 commits to main branch, last one about a month ago
A fork of Anthropic Computer Use that you can run on Mac computers to give Claude and other AI models autonomous access to your computer.
Created
2024-10-24
19 commits to main branch, last one 3 months ago
An open-sourced end-to-end VLM-based GUI Agent
Created
2023-11-28
45 commits to main branch, last one 17 days ago
Secure AI computer use powered by E2B Desktop Sandbox
Created
2024-10-31
95 commits to master branch, last one a day ago
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
Created
2024-07-29
86 commits to main branch, last one 2 months ago
Desktop app powered by Claude’s computer use capability to control your computer
Created
2024-10-25
24 commits to main branch, last one 16 days ago
A framework to enable autonomous android and computer use using any LLM (local or remote)
Created
2024-12-16
41 commits to main branch, last one 3 days ago
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use".
Created
2024-12-12
188 commits to main branch, last one a day ago
A general AI agent framework that can be adapted to various tasks and environments.
Created
2023-05-29
100 commits to main branch, last one 10 days ago
A curated list of awesome resources, tools, research papers, and projects related to the concept of Large Language Model Operating Systems (LLM-OS).
Created
2024-05-13
22 commits to main branch, last one 4 days ago
✨ Use natural language to control your browser, powered by LLM and playwright
Created
2024-11-08
9 commits to main branch, last one 2 months ago
Mark web pages for use with vision-language models
Created
2024-04-29
87 commits to main branch, last one about a month ago
Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups
Created
2024-11-04
7 commits to main branch, last one about a month ago