カテゴリ

Browser & computer-use agents — let AI drive web and desktop apps

Browser Use, Skyvern, Claude Computer Use, OpenAI Operator, Manus and more—models that click pages, fill forms, and run workflows.

This lane, “**drive the GUI instead of only calling APIs**”, broke out in 2025. Three shapes are common: **in-browser extensions** (Comet, etc.), **hosted virtual desktops** (Operator, Claude Computer Use), and **open-source self-hosted** (Browser Use, Skyvern, Reworkd). Ask three questions up front: does it run in the user’s live environment, how do wrong actions **roll back and audit**, and how are **credentials** (cookies, 2FA, SSO) scoped down?

編集用 / GSC 追記欄

Browser agents vs RPA

RPA is scripted against fixed widgets/coords. Browser agents plan with an LLM and adapt better but are less deterministic. Use RPA for 100%-fixed flows and browser agents when pages shift or data is semi-structured.

Can I trust Operator/Claude Computer Use with sensitive accounts?

Use ephemeral credentials, least-privilege sessions, and sandboxed VMs. Read retention and training clauses; gate critical actions behind a human confirm.

Who should self-host (Browser Use, Skyvern)?

Teams that need custom policies, keep credentials on-prem, or run high-volume scraping/ticket work. Stability and anti-bot resilience still take engineering.

このカテゴリのツール

概要と公式リンクは各ツール詳細へ。カテゴリ内の関連ピックも参照してください。