\u270d\ufe0f Scribe Slop Detector — AI文档去噪

v1.0.0

扫描文档与文章，检测AI生成痕迹，输出“slop密度”评分与改进建议，一键提升内容可信度。

0· 23·1 当前·1 累计

by @athola

文档工具 AI模型访问数据分析安全数据处理

下载技能包项目主页

最后更新

2026/4/20

安全扫描

VirusTotal

Pending

查看报告

OpenClaw

Error

NULL

安全有层次，运行前请审查代码。

运行时依赖

无特殊依赖

版本

latestv1.0.02026/4/20

- 发布初版slop-detector技能，用于识别文档与文章中的AI生成痕迹。 - 检测词汇、短语、结构与奉承模式等常见生成文本特征。 - 基于检测到的痕迹计算slop密度评分，并提供可操作的改进建议。 - 支持语言检测与文化校准，可按语言匹配特定模式。 - 输出详细markdown报告，高亮高置信度痕迹、结构问题及改进建议。

● Pending

安装命令

点击复制

官方npx clawhub@latest install nm-scribe-slop-detector

镜像加速npx clawhub@latest install nm-scribe-slop-detector --registry https://cn.longxiaskill.com 镜像可用

需要定制？告诉我你的需求 →

技能文档

Night Market Skill — ported from claude-night-market/scribe. For the full experience with agents, hooks, and commands, install the Claude Code plugin. # AI 垃圾检测 AI 垃圾通过使用模式而非单个词汇来识别。单独一个 “delve” 或许可以接受，但若紧邻 “tapestry” 或 “embark” 等标记，则表明文本为生成内容。我们分析每 100 词中这些标记的密度、聚集程度，以及整体语气是否符合文档类型。 ## 执行工作流首先识别目标文件，并将其归类为技术文档、叙述性散文或代码注释，以便在分析时进行上下文感知评分。 ### 语言检测 - 利用功能词频率自动检测文本语言 - 使用显式 --lang 参数覆盖（en、de、fr、es） - 从 data/languages/{lang}.yaml 加载语言专属模式 - 检测置信度低时回退到英语 - 详见 modules/language-support.md 了解文化校准 ### 词汇与短语检测加载：@modules/vocabulary-patterns.md 我们将标记按置信度分为三级。一级词在 AI 文本中出现频率极高，如 “delve”、“multifaceted”、“leverage”。二级涵盖依赖上下文的过渡词如 “moreover” 或 “subsequently”，三级则识别空洞短语，例如 “In today's fast-paced world” 或 “cannot be overstated”。 | 词汇 | 上下文 | 人工替代 | |------|---------|-------------------| | delve | "delve into" | explore, examine, look at | | tapestry | "rich tapestry" | mix, combination, variety | | realm | "in the realm of" | in, within, regarding | | embark | "embark on a journey" | start, begin | | beacon | "a beacon of" | example, model | | spearheaded | formal attribution | led, started | | multifaceted | describing complexity | complex, varied | | comprehensive | describing scope | thorough, complete | | pivotal | importance marker | key, important | | nuanced | sophistication signal | subtle, detailed | | meticulous/meticulously | care marker | careful, detailed | | intricate | complexity marker | detailed, complex | | showcasing | display verb | showing, displaying | | leveraging | business jargon | using | | streamline | optimization verb | simplify, improve | ### 二级：中等置信度标记（每项 2 分）常见但依赖上下文： | 类别 | 词汇 | |----------|-------| | 过渡滥用 | moreover, furthermore, indeed, notably, subsequently | | 强度聚集 | significantly, substantially, fundamentally, profoundly | | 模糊堆叠 | potentially, typically, often, might, perhaps | | 动作夸大 | revolutionize, transform, unlock, unleash, elevate | | 空洞强调 | crucial, vital, essential, paramount | ### 三级：短语模式（每项 2–4 分） | 短语 | 分数 | 问题 | |--------|-------|-------| | "In today's fast-paced world" | 4 | 空洞开场 | | "It's worth noting that" | 3 | 填充语 | | "At its core" | 2 | 位置拐杖 | | "Cannot be overstated" | 3 | 空洞强调 | | "A testament to" | 3 | 归因陈词 | | "Navigate the complexities" | 4 | 商业腔 | | "Unlock the potential" | 4 | 营销腔 | | "Treasure trove of" | 3 | 过度比喻 | | "Game changer" | 3 | 流行词 | | "Look no further" | 4 | 销售口号 | | "Nestled in the heart of" | 4 | 旅游写作陈词 | | "Embark on a journey" | 4 | 夸张戏剧 | | "Ever-evolving landscape" | 4 | 技术陈词 | | "Hustle and bustle" | 3 | 填充语 | ## 第三步：结构模式检测加载：@modules/structural-patterns.md ### 破折号过度使用统计每 1000 词的破折号（—）数量： - 0–2：正常人类范围 - 3–5：升高，需审查用法 - 6+：强烈 AI 信号 ``bash # Count em dashes in file grep -o '—' file.md | wc -l ` ### 三列结构检测 AI 偏爱三项并列且押头韵： - "fast, efficient, and reliable" - "clear, concise, and compelling" - "robust, reliable, and resilient" 模式：adjective, adjective, and adjective 且发音相似。 ### 列表-散文比例统计 bullet 点数与段落句数： - >60% bullet：AI 倾向 - Emoji 引导的 bullet：技术文档中强烈 AI 信号 ### 句子长度一致性测量句子长度标准差： - 低方差（SD < 5 词）：AI 单调 - 高方差（SD > 10 词）：人类变化 ### 段落对称性 AI 生成“块状”文本，段落长度均一。检查段落是否集中在相同词数附近。 ## 第四步：谄媚模式检测尤其适用于对话或教学类内容： | 短语 | 问题 | |--------|-------| | "I'd be happy to" | 卑屈开场 | | "Great question!" | 空洞肯定 | | "Absolutely!" | 过度附和 | | "That's a wonderful point" | 奉承 | | "I'm glad you asked" | 填充 | | "You're absolutely right" | 谄媚 | 这些短语不增加信息，标志生成内容。 ## 第五步：计算垃圾密度得分 ` slop_score = (tier1_count 3 + tier2_count 2 + phrase_count avg_phrase_score) / word_count 100 ` | 得分 | 评级 | 操作 | |-------|--------|--------| | 0-1.0 | 干净 | 无需操作 | | 1.0-2.5 | 轻微 | 局部修复 | | 2.5-5.0 | 中等 | 建议重写段落 | | 5.0+ | 严重 | 需全文审查 | ## 第六步：生成报告输出格式： `markdown ## 垃圾检测报告：[filename] 总体得分：X.X / 10（评级）词数：N 词发现标记：N 处 ### 高置信度标记 - 第 23 行："delve into" -> 建议："explore" - 第 45 行："rich tapestry" -> 建议："variety" ### 结构问题 - 破折号密度：8/1000 词（高） - bullet 比例：72%（升高） - 句子长度 SD：3.2 词（低方差） ### 短语模式 - 第 12 行："In today's fast-paced world"（空洞开场） - 第 89 行："cannot be overstated"（空洞强调） ### 建议 1. 将 [特定词] 替换为 [替代词] 2. 将第 34–56 行的 bullet 列表改为散文 3. 在第 3–5 段中变化句式结构 ` ## 模块参考 - 见 modules/fiction-patterns.md 了解叙述专属垃圾标记 - 见 modules/remediation-strategies.md 了解修复建议 ## 与修复集成检测后，调用 Skill(scribe:doc-generator) 并加 --remediate` 标志自动修复，或依据报告手动编辑。 ## 退出条件 - 扫描所有目标文件 - 计算密度得分 - 生成带可行建议的报告 - 将高严重项标为立即处理

运行时依赖

版本

安装命令

技能文档

相关技能推荐