详细分析 ▾
运行时依赖
版本
3 ## v1.0.1 (2026-03-20) 4 5 ### Fixed 6 - Corrected Novita API endpoints: text-to-image now uses `/v3/gemini-3.1-flash-image-text-to-image` instead of non-existent `/v3/gemini-3.1-flash-imag e` 7 - Corrected PPIO API endpoints: same fix applied for both Flash and Pro models 8 - Fixed Pro model endpoints for both platforms: `/v3/gemini-3-pro-image-text-to-image` 9 10 ### Improved 11 - Compressed SKILL.md from ~12k to ~3.7k characters (~70% reduction) 12 - Removed duplicated pseudocode, flowcharts, and examples that already exist in reference files 13 - Cleaner skill structure saves context window for actual image generation work
安装命令 点击复制
技能文档
Silently translate user's simple descriptions into professional prompts, call image generation APIs, return professional-grade images.
First-time Setup
If ~/.easy-image-skill/config.json not exists, guide user through 4 steps:
- Select Platform — Jiekou AI(recommended for China) / Novita / PPIO / OpenRouter / WaveSpeed / Google Imagen. Details:
references/platforms/.md - API Key — Check
~/.{platform}/config.json, auto-detect existing key or ask user to provide - Storage Path — ~/Downloads(default) / ~/Desktop / Custom. This grants blanket download authorization
- Frequent Scenes (optional) — PPT / Posters / Product Photos / Social Media / Avatar
Save to ~/.easy-image-skill/config.json: {"platform":"jiekou","save_path":"~/Downloads","frequent_scenes":[...]}
Workflow
1. Parse Input
Extract: scene(PPT/poster/product/social media), channel(→auto size, see Channel Mapping below), subject, style, details. If incomplete, ask only what's missing.2. Match Personal Library
Silently check~/.easy-image-skill/my-prompts.md for scene+keyword match. No match → use references/templates/{scene}.md.3. Translate to Professional Prompt
Load template fromreferences/templates/{scene}.md, fill variables, add smart defaults. If image needs text content, explicitly specify language (Chinese input→all text in Simplified Chinese characters, English→all text in English). Terminology: references/glossary.md4. Select Model
Rules inreferences/model-selection.md. Summary:
- Default: Gemini 3.1 Flash Image + Grounding (web search ON for any named entity/brand/character)
- High quality: Gemini 3 Pro Image (complex composition + professional photography, ≥2 keyword hits)
- Abstract only: Gemini 3.1 Flash Image without Grounding (pure color/shape descriptions)
5. Show Enhancement Summary
One line before generating:◇ {template} | +{2-4 key enhancements added}6. Call API
Platform details:references/platforms/{platform}.md. Hide all technical details from user. Show: ◐ Generating...7. Save & Display
Auto-download to configured save_path (pre-authorized). Display image immediately, download in background. File naming:{scene}_{brief}_{timestamp}.png8. Handle Feedback
Satisfied ("good"/"save"/"perfect") → async save to personal library. Adjust request → modify prompt, regenerate. Max 3 adjustment rounds.Channel Size Mapping
| Channel | Ratio | Channel | Ratio |
|---|---|---|---|
| WeChat Moments | 1:1 | Xiaohongshu | 3:4 |
| WeChat Video/Douyin | 9:16 | PPT/Presentation | 16:9 |
| WeChat Article header | 2.35:1 | Taobao main image | 1:1 |
Config Commands
Users can say: "switch to Novita" / "my key is sk-xxx" / "save to desktop" / "show config" / "reset config"
UX Rules
- Auto-detect language (Chinese ratio>0.3 → zh)
- Monochrome status icons: ◇ ◐ ◉ ● (no technical details shown to user)
- Prompts always in English; UI messages follow user language
- Personal library saves are async and non-blocking
Reference Documents
| Doc | Purpose |
|---|---|
references/model-selection.md | Model selection rules & keywords |
references/glossary.md | Professional terminology |
references/platforms/.md | Platform API configs |
references/templates/*.md | Scene prompt templates |
examples/usage-examples.md | Usage examples |
免费技能或插件可能存在安全风险,如需更匹配、更安全的方案,建议联系付费定制