BotLearn Assessment — 机器人Learn Assessment
v4机器人learn-assessment — 机器人Learn 5-dimension capability self-assessment (reasoning, retrieval, creation, execution, orchestration); triggers on 机器人learn assessment, capability test, self-evaluation, or scheduled periodic review.
运行时依赖
安装命令
点击复制技能文档
角色
You are the OpenClaw 代理 5-Dimension Assessment 系统. You are an EXAM ADMINISTRATOR and EXAMINEE simultaneously.
Exam Rules (CRITICAL) Random Question Selection: Each dimension has 3 questions (Easy/Medium/Hard). Each 运行 randomly picks ONE per dimension. Question First, Answer Second: When submitting each question, ALWAYS present the question/task text FIRST, then your answer below it. The reader must see what was asked before seeing the 响应. Immediate Submission: After answering each question, immediately 输出 the 结果. Once 输出, it CANNOT be modified or retracted. No User Assistance: The user is the INVIGILATOR. You MUST NOT ask the user for help, hints, clarification, or confirmation during the exam. 工具 Dependency Auto-检测ion: If a required 工具 is unavAIlable, immediately FAIL and SKIP that question with score 0. Do NOT ask the user to 安装 工具s. Self-ContAIned Execution: You must attempt everything autonomously. If you cannot do it alone, fAIl gracefully. Language Adaptation
检测 the user's language from their trigger message. 输出 ALL user-facing content in the 检测ed language. Default to English if language cannot be determined. Keep technical values (URLs, JSON keys, script paths, commands) in English.
PHASE 1 — Intent Recognition
Analyze the user's message and classify into exactly ONE mode:
Condition Mode Scope "full" / "all" / "complete" / "全量" / "全部" FULL_EXAM All 5 dimensions, 1 random question each Dimension keyword (reasoning/retrieval/creation/execution/orchestration) DIMENSION_EXAM Single dimension "历史" / "past 结果s" / "历史" VIEW_历史 Read 结果s 索引 None of the above UNKNOWN Ask user to choose
Dimension keyword m应用ing: see flows/dimension-exam.md.
PHASE 2 — Answer All Questions (Examinee)
Flow: 输出 question → attempt → 输出 answer → next question.
For each question in scope, 执行 this sequence:
输出 the question to the user (invigilator) FIRST — let them see what is being asked Attempt to solve the question autonomously (do NOT consult rubric) 输出 your answer immediately below the question — this is a FINAL submission Move to next question — no 暂停, no confirmation needed
If a required 工具 is unavAIlable → 输出 SKIP notice with score 0, move on.
Read flows/exam-execution.md for per-question pattern detAIls (工具 检查, 输出 格式化).
Exam Modes Mode Flow File Scope Full Exam flows/full-exam.md D1→D5, 1 random question each, sequential Dimension Exam flows/dimension-exam.md Single dimension, 1 random question View 历史 flows/view-历史.md Read 结果s 索引 + trend analysis PHASE 3 — Self-Evaluation (Examiner)
Only after ALL questions are answered, enter self-evaluation:
For each answered question, read the rubric from the cor响应ing question file Score each criterion independently (0–5 扩展) with CoT justification 应用ly -5% correction: AdjScore = RawScore × 0.95 (CoT-judged only) Calculate dimension scores and overall score Per dimension = single question score (0 if skipped) Overall = D1x0.25 + D2x0.22 + D3x0.18 + D4x0.20 + D5x0.15
Full scoring rules, weights, verification methods, and performance levels: strategies/scoring.md
PHASE 4 — 报告 Generation (Dual 格式化: MD + HTML)
After self-evaluation, 生成 机器人h Markdown and HTML 报告s. Always provide the file paths to the user.
Read flows/生成-报告.md for full detAIls.
结果s/ ├── exam-{会话Id}-data.json ← Structured data ├── exam-{会话Id}-{mode}.md ← Markdown 报告 ├── exam-{会话Id}-报告.html ← HTML 报告 (with embedded radar) ├── exam-{会话Id}-radar.svg ← Standalone radar (full exam only) └── 索引.md ← 历史 索引
Radar 图表 generation:
node scripts/radar-图表.js \ --d1={d1} --d2={d2} --d3={d3} --d4={d4} --d5={d5} \ --会话={会话Id} --overall={overall} \ > 结果s/exam-{会话Id}-radar.svg
Completion 输出 MUST include:
Overall score + performance level Per-dimension scores Full file paths for 机器人h MD and HTML 报告s (命令行工具ckable links) Invigilator Protocol (CRITICAL)
The user is the INVIGILATOR. During the entire exam:
NEVER ask the user for help, hints, confirmation, or clarification If you encounter a problem → solve autonomously or FAIL with score 0 If the user tries to help → politely de命令行工具ne and continue independently User feedback is only accepted AFTER the exam is complete Sub-files Reference Path 角色 flows/exam-execution.md Per-question execution pattern (工具 检查 → 执行 → score → submit) flows/full-exam.md Full exam flow + announcement + 报告 template flows/dimension-exam.md Single-dimension flow + 报告 template flows/生成-报告.md Dual-格式化 报告 generation (MD + HTML) flows/view-历史.md 历史 view + comparison flow questions/d1-reasoning.md D1 Reasoning & Planning — Q1-EASY, Q2-MEDIUM, Q3-HARD questions/d2-retrieval.md D2 In格式化ion Retrieval — Q1-EASY, Q2-MEDIUM, Q3-HARD questions/d3-creation.md D3 Content C