该技能赋予代理 "眼睛",通过真实的 Chromium 浏览器浏览网页。与简单的 HTTP 请求不同,claw-browser:
- 渲染 JavaScript
- 处理动态内容
- 加载 SPA 应用
- 拍摄真实截屏
- 可以运行在可视(非无头)模式
...
(
完整翻译的 SKILL.md 内容太长,仅提供开头,实际中应包含完整翻译)
# 🌐 Claw Browser Skill
This skill gives the agent "Eyes" to browse the web using a real
Chromium browser.
Unlike simple HTTP requests, claw-browser:
- Renders JavaScript
- Handles dynamic content
- Loads SPA applications
- Takes real screenshots
- Can run in visible (non-headless) mode
------------------------------------------------------------------------
# 🛠️ Installation & Setup (READ FIRST)
⚠️
Before using this tool, the agent MUST set up the Python
environment and download browser binaries.
If
venv does not exist, run this setup command block automatically:
``
bash
# 1. Create virtual environment
python3 -m venv venv
# 2. Activate it
source venv/bin/activate
# 3. Install dependencies
pip install -r requirements.txt
# 4. Download Chromium binaries (CRITICAL STEP)
playwright install chromium
`
⚠️ The
playwright install chromium
step is CRITICAL.\
Without it, the browser will NOT launch.
------------------------------------------------------------------------
# 🔧 Tools & Usage
browser.py
Arguments
-----------------------------------------------------------------------
Argument Description
----------------------------------- -----------------------------------
--url
The website to visit (Required)
--action
Action to perform: read
or
screenshot
--visible
(Optional) Runs the browser in
visible mode (NOT headless). Use if
the user wants to see the
browser.
-----------------------------------------------------------------------
------------------------------------------------------------------------
# 🚀 Execution Examples
⚠️ Always activate the virtual environment first.
------------------------------------------------------------------------
1️⃣ Read a Page (Headless - Default)
`
bash
source venv/bin/activate && python browser.py --url "https://github.com/trending" --action read
`
------------------------------------------------------------------------
2️⃣ Take a Screenshot of a News Site
`
bash
source venv/bin/activate && python browser.py --url "https://news.ycombinator.com" --action screenshot
`
This saves:
evidence.png
------------------------------------------------------------------------
3️⃣ Visible Mode (Shows Browser UI - Great for Demos)
`
bash
source venv/bin/activate && python browser.py --url "https://google.com" --action read --visible
`
This will launch a real Chromium window on screen.
------------------------------------------------------------------------
4️⃣ Visible Mode (WSL / Linux Fix) ⭐️
Use this if you get "Missing X server" or "Display not found" errors:
`
bash
export DISPLAY=:0 && source venv/bin/activate && python browser.py --url "https://google.com" --action read --visible
`
This sets the DISPLAY
variable so the browser can open on your screen.
------------------------------------------------------------------------
# 🧠 When Should the Agent Use This Skill?
Use claw-browser
when:
- The page requires JavaScript rendering
- The site is dynamic (React, Vue, Angular, etc.)
- Screenshots are required
- The user explicitly asks to "open" or "see" a website
- Traditional HTTP requests fail
------------------------------------------------------------------------
# ⚡ Summary
claw-browser` transforms your OpenClaw agent from a simple API caller
into a
real browser-powered assistant with visual capabilities.
It enables:
- Dynamic page interaction
- Visual verification
- Demo-ready browsing
- Real-world automation
------------------------------------------------------------------------
Made with ❤️ by @canbirlik