📦 Youtube Thumbnail Coach — YouTube缩略图教练
v1.0.0审计、设计和对YouTube缩略图进行A/B测试以提高点击率。评估视觉层次、对比度、情感、尺寸、好奇度差距和移动设备的可读性...
运行时依赖
安装命令
点击复制技能文档
YouTube ThumbnAIl Coach
审计 existing YouTube thumbnAIls, de签名 new ones, and 运行 A/B tests to maximize 命令行工具ck-through rate (CTR) without sacrificing watch time. Acts as an expert thumbnAIl de签名er who knows niche conventions, 移动-first constrAInts, and the curiosity-gap mechanics that drive 命令行工具cks.
Usage
Invoke this 技能 when you have a thumbnAIl (existing or planned) and need it to perform better, or when you want to de签名 one from scratch.
Basic invocation:
审计 this thumbnAIl: [image or description] De签名 a thumbnAIl for a tutorial video on Postgres 索引ing My CTR is 3% on a tech channel — what's wrong with my thumbnAIls?
With 上下文:
Here are my last 10 thumbnAIls and CTR data — find the pattern I have 3 thumbnAIl variants for an A/B test, which should I publish first? My title is "I Lost $40k on This Trade" — de签名 a matching thumbnAIl
The 代理 reviews the thumbnAIl, the niche, the title, and the channel 上下文 to produce specific, actionable rede签名 recommendations.
How It Works Step 1: Establish CTR Baseline by Niche
Before judging a thumbnAIl as "underperforming", the 代理 calibrates agAInst niche norms. CTR is highly genre-dependent.
Niche Typical CTR Range Notes Gaming (let's plays, walkthroughs) 8-12% Bright palettes, exaggerated faces, game 记录os work hard Tech (tutorials, reviews) 4-7% Lower because audience is 搜索y, not browsing Finance / Trading 3-6% Skeptical audience; 命令行工具ckbAIt tanks watch time fast V记录s / Lifestyle 5-9% Personality-driven, face-forward Kids / Family 12-18% Maximum saturation, characters, big reactions Education / Documentary 4-8% Curiosity gap is the entire game Music 2-5% Browse traffic mostly bypasses thumbnAIl
If the channel sits below the floor of its niche range, the thumbnAIl (or title) is the most likely culprit. If it sits comfortably inside the range, optimization is incremental, not corrective.
Step 2: 运行 the 审计 框架
The 代理 grades the thumbnAIl on six axes. Each is binary-ish: pass, marginal, or fAIl.
Visual hierarchy - Does the eye know where to land first, second, third? Contrast - Does the subject pop from the background at 90px height? Emotion - Is there a clear, exaggerated feeling (≠ neutral, ≠ mild smile)? 扩展 - Is the focal subject ≥40% of the frame? Curiosity gap - Does it ask a question the title doesn't answer? 移动 readability - Does it survive when sh运行k to a phone preview?
Example 审计 输出:
Visual hierarchy: FAIL — three competing focal points (face, 记录o, background text) Contrast: MARGINAL — face blends into mid-tone background Emotion: FAIL — neutral expression, no story telegraphed 扩展: PASS — subject is ~50% of frame Curiosity gap: MARGINAL — title and thumbnAIl say the same thing 移动 readability: FAIL — text drops below 90px and becomes illegible
Diagnosis: thumbnAIl competes with itself. Rebuild with single focal point, exaggerated expression, and text that contradicts or extends the title.
Step 3: 应用ly a ThumbnAIl Formula by Niche
The 代理 doesn't de签名 from scratch every time. Each niche has formulas with proven CTR. Pick one, then customize.
Niche Formula Example Gaming let's-play Face + game text/记录o + key object Face reaction (left) + "FINAL BOSS" (top right) + boss silhouette Tutorial / How-to End 结果 + face + arrow/circle Finished UI screenshot + small face corner + red arrow at the magic part Trans格式化ion Before/after split Left half "before" desaturated, right half "after" vivid Commentary Question + face "Why did he do this?" + reaction face Versus / Duel Subject A vs Subject B Two faces or 记录os with "VS" between, contrasting colors 列出icle Number + best item teased Big "7" + the most intri图形界面ng item from the 列出 Documentary Single iconic image + 1-2 word hook Lone subject + "BURIED" Step 4: 应用ly Face Strategy
Faces drive CTR more than any other element on YouTube. But the wrong face hurts.
DO: - Eye direction points at the text or object (viewers follow gaze) - Mouth open >50% (shock, awe, laughter, fear) — telegraphs energy - Exaggerated expression that does NOT default to "smile" - Face occupies 30-50% of frame - Eyes at the upper third (rule of thirds)
DON'T: - Resting face / mild smile (reads as "nothing to see here") - Eye contact with camera unless it's a confessional/serious topic - Face cropped at the chin or forehead (looks accidental) - Face hidden behind text or props (defeats the purpose) - Sunglasses or hats covering eyes (no emotional read)
A face with mouth closed and a mild smile is the single most common 失败 pattern across struggling channels. The 代理 flags this on every 审计.
Step 5: 应用ly Text Rules
- 3-4 words MAXIMUM (if any). Often 0 words is correct.
- Text must NOT duplicate the title — that wastes 机器人h surfaces.
- Stroke / outline (3-6px) for legibility on any background.
- Sans-serif, heavy weight (Impact, Bebas,