3 results found Sort:
The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-user-interfaces (GUIs) by using only natural language. Uses Visuali...
Created
2024-01-01
69 commits to main branch, last one 4 days ago
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use".
Created
2024-12-12
194 commits to main branch, last one 9 hours ago
Mark web pages for use with vision-language models
Created
2024-04-29
87 commits to main branch, last one about a month ago