Browser Use — 浏览器自动化

Name: Browser Use — 浏览器自动化
Rating: 4 (84 reviews)
Author: shawn pana

shawn pana

Browser Use — 浏览器自动化

v2.0.1

自动化浏览器交互，用于网页测试、表单填写、截屏和数据提取。支持头部模式、连接本地Chrome、云浏览器等，提供快速、持久的浏览器自动化能力。

84· 35,800·401 当前·422 累计·💬 4

by @shawnpana (shawn pana)·MIT-0

浏览器自动化自动化测试工具数据分析

下载技能包

License

MIT-0

最后更新

2026/4/9

安全扫描

VirusTotal

可疑

查看报告

OpenClaw

可疑

medium confidence

该技能的指令与浏览器自动化工具一致，但赋予代理敏感权限（连接真实Chrome配置文件、读写cookies、执行任意CDP/JS、上传本地文件、使用云API密钥）而未明确声明或限制这些敏感访问。

评估建议

该技能看似合法的浏览器自动化CLI，但如果启用，将授予代理对浏览器和本地环境的广泛访问权限。使用前请：（1）验证二进制文件来源，（2）使用临时会话/配置文件，（3）仅在完全信任时提供秘密或主浏览器配置文件，（4）优先使用临时会话并删除存储的API密钥，（5）如果需要更强的保证，请请求技能源代码或签名发布版本并在隔离环境中运行。...

详细分析 ▾

✓ 用途与能力

名称/描述与运行时指令匹配：SKILL.md 文档记录了一个 CLI，用于导航页面、与元素交互、截屏、提取数据，并连接到本地或云浏览器。请求访问浏览器配置文件、cookies 和云 API 密钥与浏览器自动化工具一致。

⚠ 指令范围

指令暴露原始CDP和Python REPL，可以执行任意CDP命令和页面中的JavaScript、拦截网络请求、读取cookies。还描述了连接到用户现有的Chrome（保留登录/cookies）和上传本地文件的命令。这些是合法的自动化功能，但风险高：如果误用，代理可以访问和泄露敏感浏览数据或本地文件。SKILL.md 未对可以访问的页面/数据设置限制或防护措施。

✓ 安装机制

这是一个仅有指令的技能，没有安装规范或捆绑代码 — 安装风险最低。假设PATH中已经存在‘browser-use’ CLI，并且代理可以通过shell调用它。

⚠ 凭证需求

元数据未列出必需的环境变量，但文档引用了可选的环境/配置项（BROWSER_USE_API_KEY、BROWSER_USE_SESSION）和 ~/.browser-use下的持久文件/sockets。该技能可以访问浏览器cookies和配置文件，并建议保存API密钥；这些是敏感的，但未在requires.env中声明。缺乏对这些可选凭证/配置路径的明确声明降低了透明度。

ℹ 持久化与权限

always:false（良好）。该技能文档记录了创建会话守护进程、Unix sockets（~/.browser-use/{name}.sock）和持久云配置文件。这种持久性对于长时间运行的浏览器守护进程是预期的，但这意味着状态（以及任何存储的API密钥或会话数据）将保留在磁盘上 — SKILL.md 未说明API密钥存储位置或如何在‘cloud logout’之外删除它们。

安全有层次，运行前请审查代码。

License

MIT-0

可自由使用、修改和再分发，无需署名。

查看条款 ↗

运行时依赖

无特殊依赖

版本

latestv2.0.12026/1/26

添加了两份参考文档：CDP Python使用和多会话工作流。澄清了头部模式、连接Chrome和云浏览器使用的区别。明确了标签管理命令在文档中的位置。改进了云连接、API密钥注册和代理自注册的说明。添加了多会话（并行浏览器）使用指南。更新了故障命令和会话清理的故障排除指南。

● 可疑

安装命令点击复制

官方npx clawhub@latest install browser-use

镜像加速npx clawhub@latest install browser-use --registry https://www.longxiaskill.com

技能文档

请参见下方翻译的SKILL.md内容（由于字符限制，仅提供关键部分翻译，完整内容请参考原始文档）

The browser-use command provides fast, persistent browser automation. A background daemon keeps the browser open across commands, giving ~50ms latency per call.

Prerequisites

browser-use doctor    # Verify installation

For setup details, see https://github.com/browser-use/browser-use/blob/main/browser_use/skill_cli/README.md

Core Workflow

Navigate: browser-use open — launches headless browser and opens page
Inspect: browser-use state — returns clickable elements with indices
Interact: use indices from state (browser-use click 5, browser-use input 3 "text")
Verify: browser-use state or browser-use screenshot to confirm
Repeat: browser stays open between commands

If a command fails, run browser-use close first to clear any broken session, then retry.

To use the user's existing Chrome (preserves logins/cookies): run browser-use connect first. To use a cloud browser instead: run browser-use cloud connect first. After either, commands work the same way.

Browser Modes

browser-use open                          # Default: headless Chromium (no setup needed)
browser-use --headed open                 # Visible window (for debugging)
browser-use connect                            # Connect to user's Chrome (preserves logins/cookies)
browser-use cloud connect                      # Cloud browser (zero-config, requires API key)
browser-use --profile "Default" open      # Real Chrome with specific profile

After connect or cloud connect, all subsequent commands go to that browser — no extra flags needed.

Commands

# Navigation browser-use open # Navigate to URL browser-use back # Go back in history browser-use scroll down # Scroll down (--amount N for pixels) browser-use scroll up # Scroll up browser-use tab list # List all tabs browser-use tab new [url] # Open a new tab (blank or with URL) browser-use tab switch # Switch to tab by index browser-use tab close [index...] # Close one or more tabs # Page State — always run state first to get element indices browser-use state # URL, title, clickable elements with indices browser-use screenshot [path.png] # Screenshot (base64 if no path, --full for full page) # Interactions — use indices from state browser-use click # Click element by index browser-use click # Click at pixel coordinates browser-use type "text" # Type into focused element browser-use input "text" # Click element, then type browser-use keys "Enter" # Send keyboard keys (also "Control+a", etc.) browser-use select "option" # Select dropdown option browser-use upload # Upload file to file input browser-use hover # Hover over element browser-use dblclick # Double-click element browser-use rightclick # Right-click element # Data Extraction browser-use eval "js code" # Execute JavaScript, return result browser-use get title # Page title browser-use get html [--selector "h1"] # Page HTML (or scoped to selector) browser-use get text # Element text content browser-use get value # Input/textarea value browser-use get attributes # Element attributes browser-use get bbox # Bounding box (x, y, width, height) # Wait browser-use wait selector "css" # Wait for element (--state visible|hidden|attached|detached, --timeout ms) browser-use wait text "text" # Wait for text to appear # Cookies browser-use cookies get [--url ] # Get cookies (optionally filtered) browser-use cookies set # Set cookie (--domain, --secure, --http-only, --same-site, --expires) browser-use cookies clear [--url ] # Clear cookies browser-use cookies export # Export to JSON browser-use cookies import # Import from JSON

# Session browser-use close # Close browser and stop daemon browser-use sessions # List active sessions browser-use close --all # Close all sessions

For advanced browser control (CDP, device emulation, tab activation), see references/cdp-python.md.

Cloud API

browser-use cloud connect                 # Provision cloud browser and connect (zero-config)
browser-use cloud login          # Save API key (or set BROWSER_USE_API_KEY)
browser-use cloud logout                  # Remove API key
browser-use cloud v2 GET /browsers        # REST passthrough (v2 or v3)
browser-use cloud v2 POST /tasks '{"task":"...","url":"..."}'
browser-use cloud v2 poll        # Poll task until done
browser-use cloud v2 --help               # Show API endpoints

cloud connect provisions a cloud browser with a persistent profile (auto-created on first use), connects via CDP, and prints a live URL. browser-use close disconnects AND stops the cloud browser. For custom browser settings (proxy, timeout, specific profile), use cloud v2 POST /browsers directly with the desired parameters.

Agent Self-Registration

Only use this if you don't already have an API key (check browser-use doctor to see if api_key is set). If already logged in, skip this entirely.

browser-use cloud signup — get a challenge
Solve the challenge
browser-use cloud signup --verify — verify and save API key
browser-use cloud signup --claim — generate URL for a human to claim the account

Tunnels

browser-use tunnel                  # Start Cloudflare tunnel (idempotent)
browser-use tunnel list                   # Show active tunnels
browser-use tunnel stop             # Stop tunnel
browser-use tunnel stop --all             # Stop all tunnels

Profile Management

browser-use profile list                  # List detected browsers and profiles
browser-use profile sync --all            # Sync profiles to cloud
browser-use profile update                # Download/update profile-use binary

Command Chaining

Commands can be chained with &&. The browser persists via the daemon, so chaining is safe and efficient.

browser-use open https://example.com && browser-use state
browser-use input 5 "user@example.com" && browser-use input 6 "password" && browser-use click 7

Chain when you don't need intermediate output. Run separately when you need to parse state to discover indices first.

Common Workflows

Authenticated Browsing

When a task requires an authenticated site (Gmail, GitHub, internal tools), use Chrome profiles:

browser-use profile list                           # Check available profiles
# Ask the user which profile to use, then:
browser-use --profile "Default" open https://github.com  # Already logged in

Exposing Local Dev Servers

browser-use tunnel 3000                            # → https://abc.trycloudflare.com
browser-use open https://abc.trycloudflare.com     # Browse the tunnel

Multiple Browsers

For subagent workflows or running multiple browsers in parallel, use --session NAME. Each session gets its own browser. See references/multi-session.md.

Configuration

browser-use config list                            # Show all config values
browser-use config set cloud_connect_proxy jp      # Set a value
browser-use config get cloud_connect_proxy         # Get a value
browser-use config unset cloud_connect_timeout     # Remove a value
browser-use doctor                                 # Shows config + diagnostics
browser-use setup                                  # Interactive post-install setup

Config stored in ~/.browser-use/config.json.

Global Options

Option	Description
`--headed`	Show browser window
`--profile [NAME]`	Use real Chrome (bare `--profile` uses "Default")
`--cdp-url`	Connect via CDP URL (`http://` or `ws://`)
`--session NAME`	Target a named session (default: "default")
`--json`	Output as JSON
`--mcp`	Run as MCP server via stdin/stdout

Tips

Always run state first to see available elements and their indices
Use --headed for debugging to see what the browser is doing
Sessions persist — browser stays open between commands
CLI aliases: bu, browser, and browseruse all work
If commands fail, run browser-use close first, then retry

Troubleshooting

Browser won't start? browser-use close then browser-use --headed open
Element not found? browser-use scroll down then browser-use state
Run diagnostics: browser-use doctor

Cleanup

browser-use close                         # Close browser session
browser-use tunnel stop --all             # Stop tunnels (if any)

数据来源：ClawHub ↗ · 中文优化：龙虾技能库

OpenClaw 技能定制 / 插件定制 / 私有工作流定制

免费技能或插件可能存在安全风险，如需更匹配、更安全的方案，建议联系付费定制

了解定制服务

License

运行时依赖

版本

安装命令 点击复制

技能文档

Prerequisites

Core Workflow

Browser Modes

Commands

Cloud API

Agent Self-Registration

Tunnels

Profile Management

Command Chaining

Common Workflows

Authenticated Browsing

Exposing Local Dev Servers

Multiple Browsers

Configuration

Global Options

Tips

Troubleshooting

Cleanup

安装命令点击复制