首页龙虾技能列表 › Pangolinfo Scrape (Legacy) — Pangolin 搜索与抓取

Pangolinfo Scrape (Legacy) — Pangolin 搜索与抓取

v1.0.1

使用 Pangolin APIs 搜索 Google 和抓取 Amazon,支持 AI Mode 搜索(Google AI Overview + 多轮对话)、标准 SERP 与 AI Overview 提取、Amazon 产品数据抓取和截图捕获。适用于程序化搜索 Google、获取 AI 摘要、抓取 SERP 结果、执行多轮 Google AI 搜索或提取 Amazon 产品数据。需要 Pangolin 账户和 PANGOLIN_TOKEN 或 PANGOLIN_EMAIL + PANGOLIN_PASSWORD 凭证。

0· 289·1 当前·1 累计
by @tammy-hash (Pangolinfo & AgentGo)·MIT-0
下载技能包
License
MIT-0
最后更新
2026/4/14
安全扫描
VirusTotal
无害
查看报告
OpenClaw
可疑
medium confidence
技能代码和 SKILL.md 与所述的抓取/搜索目的匹配,但注册表元数据缺少所需的 Pangolin 凭证,且来源不明 — 使用前应验证服务可靠性,对提供邮箱/密码或长期令牌保持谨慎。
评估建议
该包实现了宣称的 Pangolin 抓取功能,包含的 Python 客户端代码可读且无依赖。然而存在以下问题:(1) 注册表元数据错误列出无需环境变量,但脚本实际需要 PANGOLIN_TOKEN 或 PANGOLIN_EMAIL + PANGOLIN_PASSWORD,应将此不匹配视为红旗并要求发布者/注册表更正;(2) 脚本会将邮箱/密码发送至 https://scrapeapi.pangolinfo.com/auth 获取 bearer 令牌,并将令牌保存至 ~/.pangolin_token(文件权限设为 600),建议优先提供现有 PANGOLIN_TOKEN 而非账户密码;(3) 来源/主页未知,需在提供凭证前验证发布者和 scrapeapi.pangolinfo.com 的合法性;(4) 如决定安装使用,应在受限环境中运行(或手动检查/运行脚本),并考虑创建专用 Pangolin 账户或短期令牌以避免暴露主要凭证。...
详细分析 ▾
用途与能力
名称/描述承诺通过 Pangolin APIs 进行 Google SERP 和 Amazon 抓取,包含的 Python 客户端调用 scrapeapi.pangolinfo.com 端点来执行这些操作——能力与目的相符。然而,注册表元数据声称无需环境变量或主要凭证,而 SKILL.md 和脚本明确需要 PANGOLIN_TOKEN 或 PANGOLIN_EMAIL+PANGOLIN_PASSWORD。这种元数据遗漏是不一致的,应予以更正/澄清。
指令范围
SKILL.md 和脚本提供明确的 CLI 用法,仅指示对 Pangolin API 端点的网络调用、令牌缓存和 JSON 解析。指令不请求无关的系统文件或其他凭证。脚本写入令牌缓存(~/.pangolin_token)并使用环境变量;这些操作在 SKILL.md 和参考文档中有记录。
安装机制
无安装规范(仅提供说明和包含的 Python 脚本)。不执行外部下载或包安装。脚本零依赖(使用 Python 标准库),因此没有隐藏的安装程序风险。
凭证需求
要求 PANGOLIN_TOKEN 或 PANGOLIN_EMAIL+PANGOLIN_PASSWORD 与对外部 Pangolin API 进行身份验证是相称的。问题是注册表元数据列出无需环境变量(矛盾),且技能会将 bearer 令牌缓存在用户主目录(~/.pangolin_token)。如果提供邮箱+密码,这些凭证会被发送到外部认证端点;请确保您信任该服务,并尽可能优先使用令牌而非在环境变量中存储密码。
持久化与权限
技能不请求 'always: true' 或其他提升的平台权限。它将令牌缓存到其拥有的文件(~/.pangolin_token),这对于 CLI 客户端来说是典型的;它不会修改其他技能或系统范围的代理设置。
安全有层次,运行前请审查代码。

License

MIT-0

可自由使用、修改和再分发,无需署名。

运行时依赖

无特殊依赖

版本

latestv1.0.12026/3/7

文档更新:添加了新技能的规范链接。AI SERP:https://clawhub.ai/pangolinfo/pangolinfo-ai-serp;Amazon Scraper:https://clawhub.ai/pangolinfo/pangolinfo-amazon-scraper。无功能变更。

● 无害

安装命令 点击复制

官方npx clawhub@latest install pangolinfo-scrape-legacy
镜像加速npx clawhub@latest install pangolinfo-scrape-legacy --registry https://cn.clawhub-mirror.com

技能文档

New canonical (AI SERP): https://clawhub.ai/pangolinfo/pangolinfo-ai-serp Amazon skill: https://clawhub.ai/pangolinfo/pangolinfo-amazon-scraper

Search Google and scrape Amazon programmatically via Pangolin APIs. Extract AI overviews, organic search results, Amazon product data, and page screenshots.

Prerequisites

  • Python 3.6+ (uses only standard library)
  • Pangolin account at pangolinfo.com
  • Environment variables (one of):
- PANGOLIN_TOKEN -- existing bearer token - PANGOLIN_EMAIL + PANGOLIN_PASSWORD -- for automatic login

Quick Start

AI Mode Search (Google AI Overview)

python3 scripts/pangolin.py --q "what is quantum computing" --mode ai-mode

Standard SERP with AI Overview

python3 scripts/pangolin.py --q "how does java work" --mode serp --screenshot

Multi-Turn Dialogue

python3 scripts/pangolin.py --q "python web frameworks" --mode ai-mode \
  --follow-up "compare flask vs django" \
  --follow-up "which is better for beginners"

Amazon Product Detail

python3 scripts/pangolin.py --url "https://www.amazon.com/dp/B0DYTF8L2W" --mode amazon

Amazon Keyword Search

python3 scripts/pangolin.py --q "wireless mouse" --mode amazon --parser amzKeyword

Workflow

  • Authenticate -- Token resolved from env var, cache (~/.pangolin_token), or fresh login
  • Choose API mode -- ai-mode | serp | amazon
  • Execute -- Script builds the request, calls the API with retry logic
  • Parse output -- Structured JSON to stdout

Usage

AI Mode (--mode ai-mode)

Uses parserName: "googleAISearch" with udm=50 to get Google AI Mode results.

python3 scripts/pangolin.py --q "explain machine learning" --mode ai-mode

Output includes ai_overview with content paragraphs and source references.

AI Overview SERP (--mode serp)

Uses parserName: "googleSearch" for standard SERP results with AI overview extraction.

python3 scripts/pangolin.py --q "best programming languages 2025" --mode serp

Output includes both organic_results and optional ai_overview.

Multi-Turn Follow-Up

Add follow-up questions to an AI Mode search. Keep to 5 or fewer for optimal performance (more is allowed but slower):

python3 scripts/pangolin.py --q "kubernetes" --mode ai-mode \
  --follow-up "how to deploy" \
  --follow-up "monitoring tools" \
  --follow-up "cost optimization"

Amazon (--mode amazon)

Scrape Amazon product data using various parsers.

Product detail by URL:

python3 scripts/pangolin.py --url "https://www.amazon.com/dp/B0DYTF8L2W" --mode amazon

Keyword search:

python3 scripts/pangolin.py --q "mechanical keyboard" --mode amazon --parser amzKeyword

Best sellers:

python3 scripts/pangolin.py --url "https://www.amazon.com/gp/bestsellers/electronics" \
  --mode amazon --parser amzBestSellers

With custom zipcode and raw HTML:

python3 scripts/pangolin.py --url "https://www.amazon.com/dp/B0DYTF8L2W" \
  --mode amazon --zipcode 90210 --format rawHtml

Available Amazon parsers:

ParserUse Case
amzProductDetailSingle product page (default)
amzKeywordKeyword search results
amzProductOfCategoryCategory listing
amzProductOfSellerSeller's products
amzBestSellersBest sellers ranking
amzNewReleasesNew releases ranking
amzFollowSellerProduct variants / other sellers

Authentication Only

python3 scripts/pangolin.py --auth-only

Raw API Response

python3 scripts/pangolin.py --q "test" --mode ai-mode --raw

All CLI Options

--q QUERY          Search query
--url URL          Target URL (for Amazon product pages, category pages, etc.)
--mode MODE        ai-mode (default) | serp | amazon
--screenshot      Capture page screenshot (Google only)
--follow-up TEXT   Follow-up question (repeatable, ai-mode only)
--num N            Number of results (default: 10, Google only)
--parser PARSER    Amazon parser name (default: amzProductDetail)
--zipcode CODE     Amazon zipcode (default: 10041)
--format FMT       Amazon response format: json (default) | rawHtml | markdown
--auth-only        Authenticate and show token info
--raw              Output raw API response

Choosing the Right API

FeatureAI ModeSERPAmazon
ParsergoogleAISearchgoogleSearchamz* (7 types)
Input--q--q--url or --q
Primary outputAI-generated answerOrganic results + AI overviewProduct data
Multi-turnYes (via --follow-up)NoNo
ScreenshotYesYesNo
Best forAI answersSearch results with AI contextProduct & market data
Cost2 credits2 credits1 credit (json) / 0.75 (raw)

Output Format

Google (ai-mode / serp)

{
  "success": true,
  "task_id": "...",
  "results_num": 1,
  "ai_overview_count": 1,
  "ai_overview": [{"content": ["..."], "references": [{"title": "...", "url": "...", "domain": "..."}]}],
  "organic_results": [{"title": "...", "url": "...", "text": "..."}],
  "screenshot": "https://..."
}

Amazon

{
  "success": true,
  "task_id": "...",
  "url": "https://www.amazon.com/dp/...",
  "results_count": 1,
  "product": {"asin": "...", "title": "...", "price": "...", "star": "...", "rating": "..."}
}

See reference files for full response schemas.

Exit Codes

CodeMeaning
0Success
1API error (non-zero response code)
2Usage error (invalid arguments)
3Network error
4Authentication error

Troubleshooting

ProblemSolution
Auth failsCheck PANGOLIN_EMAIL and PANGOLIN_PASSWORD env vars
Empty AI overviewNot all queries trigger AI overview; try informational queries
Token invalid (1004)Script auto-refreshes; ensure email/password env vars are set
Insufficient credits (2001)Top up at pangolinfo.com
TimeoutScript retries 3x with backoff; check network
Amazon returns emptyVerify the URL and parser match (e.g. product URL + amzProductDetail)
See references/error-codes.md for the full error code reference.

Deep-Dive Documentation

ReferenceContent
references/ai-mode-api.mdAI Mode API schema, multi-turn dialogue mechanism
references/ai-overview-serp-api.mdAI Overview SERP API schema, organic result structure
references/amazon-api.mdAmazon Scrape API, all parser types, product fields
references/error-codes.mdError codes, auth lifecycle, credit management
数据来源:ClawHub ↗ · 中文优化:龙虾技能库
OpenClaw 技能定制 / 插件定制 / 私有工作流定制

免费技能或插件可能存在安全风险,如需更匹配、更安全的方案,建议联系付费定制

了解定制服务