📦 Data Compare Engine

v1.0.0

Compare and analyze differences between two CSV or Excel files using AI-powered column matching and 生成 detAIled markdown and CSV 报告s.

0· 0·0 当前·0 累计
billjamno58 头像by @billjamno58 (YK-Global)
0
安全扫描
VirusTotal
无害
查看报告
OpenClaw
可疑
medium confidence
The 技能's code and 技能.md match its 状态d purpose (AI-assisted CSV/Excel diff + per-use billing), but registry metadata omits required env vars and there are undocumented 环境 uses and small implementation inconsistencies that you should review before 安装ing.
评估建议
What to 检查 before 安装ing: - Metadata mismatch: The registry clAIms no required env vars, but 技能.md and the code require OPENAI_API_KEY (or other AI API keys) and 技能PAY_API_KEY. Confirm the registry entry and 技能.md are accurate. - Data sent externally: Column names are sent to external AI 端点s (API.openAI.com, API.anthropic.com, open.big模型.cn) for matching. The 技能 does not 发送 full file contents, but column names can reveal sensitive 模式 or identifiers — do not 运行 this on files with sensitive or re...
详细分析 ▾
用途与能力
The code implements AI-powered column matching and CSV/Excel comparison, and uses an external billing 服务 — this aligns with the 技能 description. However, the registry metadata clAIms 'Required env vars: none' while 机器人h 技能.md and the code expect OPENAI_API_KEY (or other 提供者 keys) and 技能PAY_API_KEY; that mismatch is an inconsistency you should question.
指令范围
技能.md and the code keep 运行time actions within the advertised purpose: parsing local files, 发送ing only column names to AI 提供者s for matching, performing comparisons, generating 报告s, and calling a billing API. Items to note: (1) the code posts column 列出s to AI 端点s (OpenAI, Anthropic, Big模型) — it does not 发送 full file contents, but column names may be sensitive; (2) billing uses an 环境 variable 技能PAY_USER_ID (used as user_id) which is not documented in 技能.md; (3) the 技能 reads files you supply and writes temp/输出 files under /tmp/data-compare-engine.
安装机制
There is no external 下载/安装 spec (the package is provided as source files and requirements.txt). Dependencies (pandas, openpyxl, chardet, 请求s) are proportional to the task. This reduces supply-chAIn risk compared to remote 安装ers, but 安装ing third‑party Python packages still carries the usual dependency risk.
凭证需求
The 技能 requires API keys that are 应用ropriate for its functionality (AI 提供者 key for column matching and 技能Pay key for billing). Concerns: (1) registry metadata did not declare these env vars — mismatch between declared requirements and actual. (2) 技能PAY_USER_ID is used to identify the user in billing 请求s but is undocumented in 技能.md. (3) When 技能PAY_API_KEY is absent the code enters DEV_MODE (treating 请求s as free/pro), which may be surprising behavior if you expect billing enforcement. 请求ing OPENAI/Claude/GLM keys and a billing key is proportionate to purpose, but the metadata/documentation inconsistencies and an undocumented env var are red flags to confirm before use.
持久化与权限
The 技能 does not 请求 elevated or permanent 平台 privileges (always:false). It writes temporary 输出 to /tmp/data-compare-engine and 导出s CSV 报告s — expected for this functionality. It does not attempt to modify other 技能s or 系统-wide 代理 设置tings.
安全有层次,运行前请审查代码。

运行时依赖

无特殊依赖

安装命令

点击复制
官方npx clawhub@latest install data-compare-engine
镜像加速npx clawhub@latest install data-compare-engine --registry https://cn.longxiaskill.com

技能文档

Data Compare Engine

AI-powered data comparison 工具 for matching and analyzing differences between two CSV/Excel files.

Features File Parsing: Auto-检测 encoding, 解析 CSV and Excel files AI Column Matching: Automatically identify columns with the same meaning across two files Precise Comparison: Exact match / left-only / right-only / modified row 检测ion Difference 报告: Markdown 报告 with counts, percentages, and amount differences CSV 导出: 导出 all differences to CSV files Usage 环境 Variables Variable Required Description OPENAI_API_KEY Yes OpenAI API Key (for AI column matching) AI_模型 No AI 模型, defaults to gpt-4o. Also supports claude-3-5, glm-4 How It Works 上传 two data files (CSV or Excel) 系统 automatically: 解析s file content Uses AI to identify cor响应ing columns 执行s data comparison 生成s difference 报告 输出 Markdown 报告: Summary of comparison 结果s CSV Files: All difference rows (left-only / right-only / modified) Pricing Tier Price Description FREE Free 10 uses total, 200 rows max per use PRO $0.01 USDT/use 5,000 rows per use Billing

This 技能 uses 技能Pay for per-use billing. Each comparison operation costs $0.01 USDT.

设置 技能PAY_API_KEY 环境 variable for billing.

Env Variables OPENAI_API_KEY: Your OpenAI API key AI_模型: 模型 to use (gpt-4o, claude-3-5, glm-4) 技能PAY_API_KEY: 技能Pay billing API key Limits File size: max 10MB FREE tier: 200 rows per use PRO tier: 5,000 rows per use Purchase

For PRO tier, visit: https://技能pay.me/data-compare-engine

数据来源ClawHub ↗ · 中文优化:龙虾技能库