Adversarial Code Review — 技能工具

Name: Adversarial Code Review — 技能工具
Author: reikys

reikys

Adversarial Code Review — 技能工具

v1.0.0

Use when reviewing pull requests or critiquing code changes and you want high-signal, low-noise feedback by running multiple adversarial agents that challeng...

0· 95·0 当前·0 累计

by @reikys·MIT-0

开发工具代码生成数据与API

下载技能包

License

MIT-0

最后更新

2026/3/27

安全扫描

VirusTotal

无害

查看报告

OpenClaw

安全

high confidence

The skill's instructions, required actions, and absence of installs or extra credentials are coherent with an adversarial multi-agent code-review pattern; nothing requested is disproportionate to its stated purpose.

评估建议

This skill is internally consistent, but before installing or enabling it: (1) Ensure the Claude (or other model) CLI you plan to use is legitimate and that its API credentials are stored and scoped appropriately — the SKILL.md presumes such credentials but does not declare them. (2) When running in CI, limit the model's access to only the repositories/PRs it needs and avoid exposing secrets in diffs or env vars passed to the model. (3) Review and test the system prompts (they intentionally prim...

详细分析 ▾

✓ 用途与能力

The name/description (adversarial code review) match the SKILL.md: it describes a three-agent review pattern, reading diffs and producing filtered high-signal comments. Mentioning the Claude CLI as a dependency is consistent with the described model-driven workflow.

✓ 指令范围

Runtime instructions focus on reading PR diffs, PR metadata, and reviewer outputs and running model invocations (via the Claude CLI). There are no instructions to read unrelated system files, exfiltrate data, or call external endpoints beyond the model CLI; this stays within the review scope.

✓ 安装机制

This is an instruction-only skill with no install spec or code to write to disk. Risk is low because nothing in the manifest installs arbitrary packages or fetches code.

ℹ 凭证需求

The skill does not declare any required environment variables or credentials, but it depends on a model CLI (Claude) which in practice requires credentials/configuration outside the skill. That is proportionate to the purpose, but users should be aware the model CLI will need access to the repository and model credentials (not declared here).

✓ 持久化与权限

always is false and the skill is user-invocable only. It does not request persistent presence or modify other skills or system-wide settings.

安全有层次，运行前请审查代码。

License

MIT-0

可自由使用、修改和再分发，无需署名。

查看条款 ↗

运行时依赖

无特殊依赖

版本

latestv1.0.02026/3/27

Adversarial Code Review 1.0.0 introduces a high-signal, low-noise multi-agent review workflow: - Implements a three-agent pattern: builder, adversarial reviewer, and meta-reviewer, each challenging previous output to filter for only the most critical code review comments. - Designed to work standalone or in CI for reviewing pull requests, emphasizing ruthless filtering to high-confidence, high-priority issues (~2 per PR). - Provides best practices, common pitfalls, and step-by-step setup instructions for running adversarial code reviews. - Offers copy-pasteable CI integration examples leveraging Claude Code CLI with `--append-system-prompt`. - Recommends cross-model critique and explicit confidence/priority scoring to further boost trust and signal in automated reviews.

● 无害

安装命令点击复制

官方npx clawhub@latest install adversarial-code-review

镜像加速npx clawhub@latest install adversarial-code-review --registry https://cn.clawhub-mirror.com

技能文档

Overview

A multi-agent review pattern where one agent builds (or authors), a second agent critiques the code, and a third agent critiques the review itself. This layered adversarial approach filters out low-value nitpicks and surfaces only high-confidence, high-priority issues that deserve human attention.

Core principle: Fewer, higher-quality review comments build trust. Filter ruthlessly to high confidence + high priority only. Target roughly two comments per PR.

Dependency: Claude Code CLI with --append-system-prompt support. Optionally, a different model for the review pass than the one used for writing.

When to Use

Reviewing pull requests before merge, especially when review quality matters more than speed
As a CI-integrated automated reviewer that developers actually read instead of ignore
When existing automated reviews produce too much noise and developers have stopped trusting them
During versioned critique cycles where a plan or design needs iterative refinement

When NOT to Use

Trivial PRs (typo fixes, dependency bumps, single-line config changes)
When you need instant feedback during live pairing sessions (too slow for interactive use)
As a replacement for human review on security-critical or compliance-gated changes

Common Mistakes

Mistake	Why it's wrong
Surfacing every finding to the developer	Noise kills trust. Developers stop reading reviews that cry wolf. Filter to ~2 high-priority, high-confidence comments per PR.
Using the same model for writing and reviewing	The model is biased toward its own patterns. Use a different model for review than the one that wrote the code — it catches different classes of issues.
Skipping the meta-reviewer (third agent)	Without a check on the reviewer, you get false positives and nitpicks dressed up as critical findings. The meta-reviewer filters the reviewer's output.
Running adversarial review without priming the critic	A neutral prompt produces polite, hedging reviews. Tell the reviewer the code likely contains bugs to prime it for genuine criticality.
Treating all review comments as equal priority	Without confidence and priority scoring, developers cannot triage. Every comment must carry explicit confidence (high/medium/low) and priority (high/medium/low).

The Three-Agent Adversarial Pattern

digraph adversarial {
  "Code Change (PR)" -> "Agent 1: Builder";
  "Agent 1: Builder" -> "Agent 2: Reviewer" [label="code + context"];
  "Agent 2: Reviewer" -> "Agent 3: Meta-Reviewer" [label="review comments"];
  "Agent 3: Meta-Reviewer" -> "Filtered Output" [label="~2 high-signal comments"];
  "Agent 3: Meta-Reviewer" -> "Agent 2: Reviewer" [label="rejected comments" style=dashed];
}

Step 1: Set up the builder context

Agent 1 is the code author (or a proxy that understands the change). It produces:

The diff itself
A summary of intent ("what this PR is trying to do")
Related context (linked issues, design docs, test plan)

If you are reviewing someone else's PR, have an agent read the PR description, diff, and linked issues to reconstruct this context.

Step 2: Run the adversarial reviewer (Agent 2)

Spawn a sub-agent with a system prompt that primes it for critical review. The key trick: tell the reviewer the code likely contains bugs.

claude --print --append-system-prompt "You are a senior engineer reviewing code that was just generated by an AI coding agent. The agent has likely introduced subtle bugs, security issues, or logic errors. Your job is to find them. Do not be polite. Do not hedge. For every issue you find, assign: confidence: high | medium | low priority: high | medium | low category: bug | security | performance | logic | style Only report issues where confidence is medium or higher." \ -p "Review this PR diff for bugs and issues:

$(git diff main...HEAD)"

Model selection: If the code was written by Claude, use a different model for review. Cross-model review catches different failure modes.

Step 3: Run the meta-reviewer (Agent 3)

The meta-reviewer receives Agent 2's comments and filters them. Its job is to reject false positives, remove nitpicks that escaped the confidence filter, and validate that remaining comments are actionable.

claude --print --append-system-prompt "You are reviewing a code review. The reviewer may have produced false positives, nitpicks disguised as bugs, or low-value comments. Your job is to filter ruthlessly. Keep ONLY comments that are: Genuinely high-confidence issues (not speculative) High-priority (would cause a bug, security hole, or data loss) Actionable (the developer knows exactly what to fix) Reject everything else. Target output: 1-3 comments maximum. If the code is fine, say so." \ -p "Here is the code review to evaluate: $REVIEWER_OUTPUT And here is the original diff for reference:

$(git diff main...HEAD)"

Step 4: Format and deliver the filtered output

The final output should contain only the surviving comments, each with:

Location: File and line range
Issue: One-sentence description
Why it matters: Impact if not fixed
Suggested fix: Concrete code change
Confidence: high
Priority: high

CI Integration with `--append-system-prompt`

To run this as an automated CI reviewer, add a workflow step that pipes the diff through the three-agent chain:

# .github/workflows/adversarial-review.yml name: Adversarial Code Review on: pull_request: types: [opened, synchronize] jobs: review: runs-on: ubuntu-latest steps: - uses: actions/checkout@v4 with: fetch-depth: 0 - name: Run adversarial review env: ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }} run: | DIFF=$(git diff origin/main...HEAD) # Agent 2: Adversarial reviewer REVIEW=$(claude --print --append-system-prompt \ "You are reviewing code that an AI agent just wrote. It likely introduced bugs. Find them. For each issue, provide confidence (high/medium/low), priority (high/medium/low), file, line range, and a concrete fix. Only report medium+ confidence issues." \ -p "Review this diff:\n\n$DIFF") # Agent 3: Meta-reviewer filters FILTERED=$(claude --print --append-system-prompt \ "Filter this code review to only high-confidence, high-priority issues. Maximum 3 comments. Reject nitpicks and false positives." \ -p "Review to filter:\n\n$REVIEW\n\nOriginal diff:\n\n$DIFF")

# Post as PR comment echo "$FILTERED" > review.md gh pr comment ${{ github.event.pull_request.number }} --body-file review.md

The --append-system-prompt flag is what makes this work in CI: it lets you inject the adversarial priming without modifying the user message, keeping the diff clean as the primary input.

Versioned Critique Cycle (For Plans and Designs)

For longer-form work like architecture plans, use a versioned critique loop:

plan_v1: Initial plan authored by Agent 1
critique_opus_v1: Critique from one model (e.g., Claude)
critique_gpt_v1: Critique from a different model (e.g., GPT) for cross-model coverage
revise: Author incorporates valid critiques
plan_v2: Revised plan, repeat if needed

This ensures the plan survives scrutiny from multiple perspectives before implementation begins. Each critique version is saved so you can trace which feedback was incorporated and which was rejected (and why).

Quick Reference

Item	Details
Agent 1 (Builder)	Produces diff, intent summary, and context
Agent 2 (Reviewer)	Adversarially primed critic, scores by confidence + priority
Agent 3 (Meta-Reviewer)	Filters reviewer output, rejects false positives
Target output	~2 high-confidence, high-priority comments per PR
CI flag	`--append-system-prompt` for injecting adversarial priming
Model strategy	Use a different model for review than for writing
Priming trick	"AI agent likely introduced bugs — find them"
Versioned cycle	plan_v1 -> critique_v1 -> revise -> plan_v2

Key Principles

Fewer comments build more trust. A reviewer that posts two real bugs gets read. A reviewer that posts twenty nitpicks gets ignored. Filter to ~2 high-priority, high-confidence findings.
Cross-model review catches what self-review misses. A model is biased toward its own idioms. Use a different model for review than the one that wrote the code.
Prime the critic for genuine adversarial behavior. Telling the reviewer "an AI agent likely introduced bugs, find them" produces dramatically more critical and useful reviews than a neutral prompt.
The meta-reviewer is non-negotiable. Without a third agent checking the reviewer, false positives leak through and erode developer trust in the entire system.
Every comment must be actionable. If the developer cannot immediately understand what to fix and why, the comment has failed. Include file, line, impact, and concrete fix.

Attribution

Based on techniques from the Coding Agents: AI Driven Dev Conference. Sid (Anthropic) demonstrated the three-agent adversarial pattern and CI integration with --append-system-prompt. Ankit (Databricks) contributed the cross-model review strategy. Demetrios introduced the priming trick of telling the reviewer that bugs were likely introduced. Chad described the versioned critique cycle for iterative plan refinement.

Overview

A multi-agent review pattern where one agent builds (or authors), a second agent critiques the code, and a third agent critiques the review itself. This layered adversarial approach filters out low-value nitpicks and surfaces only high-confidence, high-priority issues that deserve human attention.

Core principle: Fewer, higher-quality review comments build trust. Filter ruthlessly to high confidence + high priority only. Target roughly two comments per PR.

Dependency: Claude Code CLI with --append-system-prompt support. Optionally, a different model for the review pass than the one used for writing.

When to Use

Reviewing pull requests before merge, especially when review quality matters more than speed
As a CI-integrated automated reviewer that developers actually read instead of ignore
When existing automated reviews produce too much noise and developers have stopped trusting them
During versioned critique cycles where a plan or design needs iterative refinement

When NOT to Use

Trivial PRs (typo fixes, dependency bumps, single-line config changes)
When you need instant feedback during live pairing sessions (too slow for interactive use)
As a replacement for human review on security-critical or compliance-gated changes

Common Mistakes

Mistake	Why it's wrong
Surfacing every finding to the developer	Noise kills trust. Developers stop reading reviews that cry wolf. Filter to ~2 high-priority, high-confidence comments per PR.
Using the same model for writing and reviewing	The model is biased toward its own patterns. Use a different model for review than the one that wrote the code — it catches different classes of issues.
Skipping the meta-reviewer (third agent)	Without a check on the reviewer, you get false positives and nitpicks dressed up as critical findings. The meta-reviewer filters the reviewer's output.
Running adversarial review without priming the critic	A neutral prompt produces polite, hedging reviews. Tell the reviewer the code likely contains bugs to prime it for genuine criticality.
Treating all review comments as equal priority	Without confidence and priority scoring, developers cannot triage. Every comment must carry explicit confidence (high/medium/low) and priority (high/medium/low).

The Three-Agent Adversarial Pattern

digraph adversarial {
  "Code Change (PR)" -> "Agent 1: Builder";
  "Agent 1: Builder" -> "Agent 2: Reviewer" [label="code + context"];
  "Agent 2: Reviewer" -> "Agent 3: Meta-Reviewer" [label="review comments"];
  "Agent 3: Meta-Reviewer" -> "Filtered Output" [label="~2 high-signal comments"];
  "Agent 3: Meta-Reviewer" -> "Agent 2: Reviewer" [label="rejected comments" style=dashed];
}

Step 1: Set up the builder context

Agent 1 is the code author (or a proxy that understands the change). It produces:

The diff itself
A summary of intent ("what this PR is trying to do")
Related context (linked issues, design docs, test plan)

If you are reviewing someone else's PR, have an agent read the PR description, diff, and linked issues to reconstruct this context.

Step 2: Run the adversarial reviewer (Agent 2)

Spawn a sub-agent with a system prompt that primes it for critical review. The key trick: tell the reviewer the code likely contains bugs.

claude --print --append-system-prompt "You are a senior engineer reviewing code that was just generated by an AI coding agent. The agent has likely introduced subtle bugs, security issues, or logic errors. Your job is to find them. Do not be polite. Do not hedge. For every issue you find, assign: confidence: high | medium | low priority: high | medium | low category: bug | security | performance | logic | style Only report issues where confidence is medium or higher." \ -p "Review this PR diff for bugs and issues:

$(git diff main...HEAD)"

Model selection: If the code was written by Claude, use a different model for review. Cross-model review catches different failure modes.

Step 3: Run the meta-reviewer (Agent 3)

The meta-reviewer receives Agent 2's comments and filters them. Its job is to reject false positives, remove nitpicks that escaped the confidence filter, and validate that remaining comments are actionable.

claude --print --append-system-prompt "You are reviewing a code review. The reviewer may have produced false positives, nitpicks disguised as bugs, or low-value comments. Your job is to filter ruthlessly. Keep ONLY comments that are: Genuinely high-confidence issues (not speculative) High-priority (would cause a bug, security hole, or data loss) Actionable (the developer knows exactly what to fix) Reject everything else. Target output: 1-3 comments maximum. If the code is fine, say so." \ -p "Here is the code review to evaluate: $REVIEWER_OUTPUT And here is the original diff for reference:

$(git diff main...HEAD)"

Step 4: Format and deliver the filtered output

The final output should contain only the surviving comments, each with:

Location: File and line range
Issue: One-sentence description
Why it matters: Impact if not fixed
Suggested fix: Concrete code change
Confidence: high
Priority: high

CI Integration with `--append-system-prompt`

To run this as an automated CI reviewer, add a workflow step that pipes the diff through the three-agent chain:

# .github/workflows/adversarial-review.yml name: Adversarial Code Review on: pull_request: types: [opened, synchronize] jobs: review: runs-on: ubuntu-latest steps: - uses: actions/checkout@v4 with: fetch-depth: 0 - name: Run adversarial review env: ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }} run: | DIFF=$(git diff origin/main...HEAD) # Agent 2: Adversarial reviewer REVIEW=$(claude --print --append-system-prompt \ "You are reviewing code that an AI agent just wrote. It likely introduced bugs. Find them. For each issue, provide confidence (high/medium/low), priority (high/medium/low), file, line range, and a concrete fix. Only report medium+ confidence issues." \ -p "Review this diff:\n\n$DIFF") # Agent 3: Meta-reviewer filters FILTERED=$(claude --print --append-system-prompt \ "Filter this code review to only high-confidence, high-priority issues. Maximum 3 comments. Reject nitpicks and false positives." \ -p "Review to filter:\n\n$REVIEW\n\nOriginal diff:\n\n$DIFF")

# Post as PR comment echo "$FILTERED" > review.md gh pr comment ${{ github.event.pull_request.number }} --body-file review.md

The --append-system-prompt flag is what makes this work in CI: it lets you inject the adversarial priming without modifying the user message, keeping the diff clean as the primary input.

Versioned Critique Cycle (For Plans and Designs)

For longer-form work like architecture plans, use a versioned critique loop:

plan_v1: Initial plan authored by Agent 1
critique_opus_v1: Critique from one model (e.g., Claude)
critique_gpt_v1: Critique from a different model (e.g., GPT) for cross-model coverage
revise: Author incorporates valid critiques
plan_v2: Revised plan, repeat if needed

This ensures the plan survives scrutiny from multiple perspectives before implementation begins. Each critique version is saved so you can trace which feedback was incorporated and which was rejected (and why).

Quick Reference

Item	Details
Agent 1 (Builder)	Produces diff, intent summary, and context
Agent 2 (Reviewer)	Adversarially primed critic, scores by confidence + priority
Agent 3 (Meta-Reviewer)	Filters reviewer output, rejects false positives
Target output	~2 high-confidence, high-priority comments per PR
CI flag	`--append-system-prompt` for injecting adversarial priming
Model strategy	Use a different model for review than for writing
Priming trick	"AI agent likely introduced bugs — find them"
Versioned cycle	plan_v1 -> critique_v1 -> revise -> plan_v2

Key Principles

Fewer comments build more trust. A reviewer that posts two real bugs gets read. A reviewer that posts twenty nitpicks gets ignored. Filter to ~2 high-priority, high-confidence findings.
Cross-model review catches what self-review misses. A model is biased toward its own idioms. Use a different model for review than the one that wrote the code.
Prime the critic for genuine adversarial behavior. Telling the reviewer "an AI agent likely introduced bugs, find them" produces dramatically more critical and useful reviews than a neutral prompt.
The meta-reviewer is non-negotiable. Without a third agent checking the reviewer, false positives leak through and erode developer trust in the entire system.
Every comment must be actionable. If the developer cannot immediately understand what to fix and why, the comment has failed. Include file, line, impact, and concrete fix.

Attribution

Based on techniques from the Coding Agents: AI Driven Dev Conference. Sid (Anthropic) demonstrated the three-agent adversarial pattern and CI integration with --append-system-prompt. Ankit (Databricks) contributed the cross-model review strategy. Demetrios introduced the priming trick of telling the reviewer that bugs were likely introduced. Chad described the versioned critique cycle for iterative plan refinement.

数据来源：ClawHub ↗ · 中文优化：龙虾技能库

OpenClaw 技能定制 / 插件定制 / 私有工作流定制

免费技能或插件可能存在安全风险，如需更匹配、更安全的方案，建议联系付费定制

了解定制服务

License

运行时依赖

版本

安装命令 点击复制

技能文档

Overview

When to Use

When NOT to Use

Common Mistakes

The Three-Agent Adversarial Pattern

Step 1: Set up the builder context

Step 2: Run the adversarial reviewer (Agent 2)

Step 3: Run the meta-reviewer (Agent 3)

Step 4: Format and deliver the filtered output

CI Integration with --append-system-prompt

Versioned Critique Cycle (For Plans and Designs)

Quick Reference

Key Principles

Attribution

Overview

When to Use

When NOT to Use

Common Mistakes

The Three-Agent Adversarial Pattern

Step 1: Set up the builder context

Step 2: Run the adversarial reviewer (Agent 2)

Step 3: Run the meta-reviewer (Agent 3)

Step 4: Format and deliver the filtered output

CI Integration with --append-system-prompt

Versioned Critique Cycle (For Plans and Designs)

Quick Reference

Key Principles

Attribution

安装命令点击复制

CI Integration with `--append-system-prompt`

CI Integration with `--append-system-prompt`