분류

Browser & computer-use agents — let AI drive web and desktop apps

Browser Use, Skyvern, Claude Computer Use, OpenAI Operator, Manus and more—models that click pages, fill forms, and run workflows.

This lane, “**drive the GUI instead of only calling APIs**”, broke out in 2025. Three shapes are common: **in-browser extensions** (Comet, etc.), **hosted virtual desktops** (Operator, Claude Computer Use), and **open-source self-hosted** (Browser Use, Skyvern, Reworkd). Ask three questions up front: does it run in the user’s live environment, how do wrong actions **roll back and audit**, and how are **credentials** (cookies, 2FA, SSO) scoped down?

편집 / GSC 추가

Browser agents vs RPA

RPA is scripted against fixed widgets/coords. Browser agents plan with an LLM and adapt better but are less deterministic. Use RPA for 100%-fixed flows and browser agents when pages shift or data is semi-structured.

Can I trust Operator/Claude Computer Use with sensitive accounts?

Use ephemeral credentials, least-privilege sessions, and sandboxed VMs. Read retention and training clauses; gate critical actions behind a human confirm.

Who should self-host (Browser Use, Skyvern)?

Teams that need custom policies, keep credentials on-prem, or run high-volume scraping/ticket work. Stability and anti-bot resilience still take engineering.

이 분류의 도구

요약과 공식 링크는 각 상세 페이지에서 확인하고, 같은 분류의 관련 항목도 살펴보세요.