📦 Web Scraper — 智能网页抓取

v1.0.0

支持 JavaScript 渲染的网页抓取技能,可用 CSS 选择器、XPath 或 AI 智能提取,一键获取结构化数据。

0· 162·0 当前·0 累计
下载技能包
最后更新
2026/3/24
0
安全扫描
VirusTotal
无害
查看报告
OpenClaw
可疑
high confidence
The skill's description promises JS-rendering scraping (Playwright/Puppeteer) and shows commands that run scripts/scrape.py, but the package contains no code, no install steps, and declares no dependencies — the parts don't line up.
评估建议
This skill is incomplete and ambiguous: it documents commands that run scripts/scrape.py and references Playwright/Puppeteer, but the package contains no code, no install instructions, and no trusted source URL. Before installing or enabling it: 1) ask the publisher for the source code or a real homepage/README and a dependency list (Python version, required pip packages or npm packages, Playwright/browser binaries); 2) require an explicit install spec or packaged binary from a trusted host (Git...
详细分析 ▾
用途与能力
The skill claims JavaScript rendering support (Playwright/Puppeteer) and crawling features, but declares no required binaries, no environment variables, and provides no code or install spec. A scraping tool that needs browser automation would normally list Node/Python packages, a browser driver, or an install step; those are missing, which is disproportionate and incoherent.
指令范围
SKILL.md instructs the agent to run 'python3 scripts/scrape.py' with various flags (rendering, crawling, AI extraction). There is no scripts/scrape.py in the bundle. The instructions therefore point to executing local code that doesn't exist. The doc also implies use of heavy runtime components (Playwright/Puppeteer) but gives no guidance on installing or sandboxing them.
安装机制
There is no install spec. Given the stated features (JS rendering, Playwright/Puppeteer), an installation step is expected (pip/npm installs, browser binaries). The absence of an install mechanism leaves ambiguity about where the code would come from and how dependencies would be provisioned — increasing risk if an agent tries to fetch/install packages at runtime.
凭证需求
The skill declares no environment variables or credentials, which is consistent with a simple, local scraper. However, it also omits declaring expected system binaries or package requirements (python, node, playwright browsers). If the scraper needs authentication for target sites, those credentials aren't declared. The lack of declared dependencies is the main proportionality issue.
持久化与权限
always is false and there are no claims of modifying other skills or agent-wide config. Autonomous invocation is allowed (default) but that alone is not a red flag.
安全有层次,运行前请审查代码。

运行时依赖

无特殊依赖

版本

latestv1.0.02026/3/24

Initial release of jpeng-web-scraper. - Supports web scraping for static and JavaScript-rendered pages. - Flexible data extraction using CSS selectors, XPath, or regex. - Includes AI-powered extraction for structured information. - Allows crawling multiple pages with rate limiting and robots.txt respect. - Provides simple command-line usage examples for various scenarios.

无害

安装命令

点击复制
官方npx clawhub@latest install jpeng-web-scraper
镜像加速npx clawhub@latest install jpeng-web-scraper --registry https://cn.longxiaskill.com
数据来源ClawHub ↗ · 中文优化:龙虾技能库