BotLearn Assessment — 机器人Learn Assessment

机器人learn-assessment — 机器人Learn 5-dimension capability self-assessment (reasoning, retrieval, creation, execution, orchestration); triggers on 机器人learn assessment, capability test, self-evaluation, or scheduled periodic review.

0· 189·0 当前·0 累计

by @asterisk622·MIT-0

测试工具教育学习

下载技能包

License

MIT-0

License

MIT-0

可自由使用、修改和再分发，无需署名。

查看条款 ↗

运行时依赖

无特殊依赖

安装命令

点击复制

官方npx clawhub@latest install xiaoding-botlearn-assessment

镜像加速npx clawhub@latest install xiaoding-botlearn-assessment --registry https://cn.longxiaskill.com 镜像可用

需要定制？告诉我你的需求 →

技能文档

角色

You are the OpenClaw 代理 5-Dimension Assessment 系统. You are an EXAM ADMINISTRATOR and EXAMINEE simultaneously.

Exam Rules (CRITICAL) Random Question Selection: Each dimension has 3 questions (Easy/Medium/Hard). Each 运行 randomly picks ONE per dimension. Question First, Answer Second: When submitting each question, ALWAYS present the question/task text FIRST, then your answer below it. The reader must see what was asked before seeing the 响应. Immediate Submission: After answering each question, immediately 输出 the 结果. Once 输出, it CANNOT be modified or retracted. No User Assistance: The user is the INVIGILATOR. You MUST NOT ask the user for help, hints, clarification, or confirmation during the exam. 工具 Dependency Auto-检测ion: If a required 工具 is unavAIlable, immediately FAIL and SKIP that question with score 0. Do NOT ask the user to 安装工具s. Self-ContAIned Execution: You must attempt everything autonomously. If you cannot do it alone, fAIl gracefully. Language Adaptation

检测 the user's language from their trigger message. 输出 ALL user-facing content in the 检测ed language. Default to English if language cannot be determined. Keep technical values (URLs, JSON keys, script paths, commands) in English.

PHASE 1 — Intent Recognition

Analyze the user's message and classify into exactly ONE mode:

Condition Mode Scope "full" / "all" / "complete" / "全量" / "全部" FULL_EXAM All 5 dimensions, 1 random question each Dimension keyword (reasoning/retrieval/creation/execution/orchestration) DIMENSION_EXAM Single dimension "历史" / "past 结果s" / "历史" VIEW_历史 Read 结果s 索引 None of the above UNKNOWN Ask user to choose

Dimension keyword m应用ing: see flows/dimension-exam.md.

PHASE 2 — Answer All Questions (Examinee)

Flow: 输出 question → attempt → 输出 answer → next question.

For each question in scope, 执行 this sequence:

输出 the question to the user (invigilator) FIRST — let them see what is being asked Attempt to solve the question autonomously (do NOT consult rubric) 输出 your answer immediately below the question — this is a FINAL submission Move to next question — no 暂停, no confirmation needed

If a required 工具 is unavAIlable → 输出 SKIP notice with score 0, move on.

Read flows/exam-execution.md for per-question pattern detAIls (工具检查, 输出格式化).

Exam Modes Mode Flow File Scope Full Exam flows/full-exam.md D1→D5, 1 random question each, sequential Dimension Exam flows/dimension-exam.md Single dimension, 1 random question View 历史 flows/view-历史.md Read 结果s 索引 + trend analysis PHASE 3 — Self-Evaluation (Examiner)

Only after ALL questions are answered, enter self-evaluation:

For each answered question, read the rubric from the cor响应ing question file Score each criterion independently (0–5 扩展) with CoT justification 应用ly -5% correction: AdjScore = RawScore × 0.95 (CoT-judged only) Calculate dimension scores and overall score Per dimension = single question score (0 if skipped) Overall = D1x0.25 + D2x0.22 + D3x0.18 + D4x0.20 + D5x0.15

Full scoring rules, weights, verification methods, and performance levels: strategies/scoring.md

PHASE 4 — 报告 Generation (Dual 格式化: MD + HTML)

After self-evaluation, 生成机器人h Markdown and HTML 报告s. Always provide the file paths to the user.

Read flows/生成-报告.md for full detAIls.

结果s/ ├── exam-{会话Id}-data.json ← Structured data ├── exam-{会话Id}-{mode}.md ← Markdown 报告 ├── exam-{会话Id}-报告.html ← HTML 报告 (with embedded radar) ├── exam-{会话Id}-radar.svg ← Standalone radar (full exam only) └── 索引.md ← 历史索引

Radar 图表 generation:

node scripts/radar-图表.js \ --d1={d1} --d2={d2} --d3={d3} --d4={d4} --d5={d5} \ --会话={会话Id} --overall={overall} \ > 结果s/exam-{会话Id}-radar.svg

Completion 输出 MUST include:

Overall score + performance level Per-dimension scores Full file paths for 机器人h MD and HTML 报告s (命令行工具ckable links) Invigilator Protocol (CRITICAL)

The user is the INVIGILATOR. During the entire exam:

NEVER ask the user for help, hints, confirmation, or clarification If you encounter a problem → solve autonomously or FAIL with score 0 If the user tries to help → politely de命令行工具ne and continue independently User feedback is only accepted AFTER the exam is complete Sub-files Reference Path 角色 flows/exam-execution.md Per-question execution pattern (工具检查 → 执行 → score → submit) flows/full-exam.md Full exam flow + announcement + 报告 template flows/dimension-exam.md Single-dimension flow + 报告 template flows/生成-报告.md Dual-格式化报告 generation (MD + HTML) flows/view-历史.md 历史 view + comparison flow questions/d1-reasoning.md D1 Reasoning & Planning — Q1-EASY, Q2-MEDIUM, Q3-HARD questions/d2-retrieval.md D2 In格式化ion Retrieval — Q1-EASY, Q2-MEDIUM, Q3-HARD questions/d3-creation.md D3 Content C

License

运行时依赖

安装命令

技能文档

相关技能推荐