详细分析 ▾
运行时依赖
版本
muapi-nano-banana-skill v0.1.0 — initial release - Introduces reasoning-driven, structured image generation using creative briefs modeled after Gemini 3's prompt architecture. - Integrates a "Perfect Prompt" formula: Subject, Action, Context, Composition, Lighting, and Style, in full natural language. - Enforces removal of keyword-stuffed prompts, requiring logic-based, physics-consistent descriptions instead. - Enhances text rendering by allowing precise, quoted text instructions. - Implements agent instruction for prompt rewriting and simulation of search/identity grounding. - Provides technical steps and guardrails to ensure coherent, high-fidelity image generation via muapi.ai.
安装命令 点击复制
技能文档
specialized skill 对于 AI Agents 到 leverage "Reasoning-Driven" image generation. Based on the advanced prompting architecture of Google's Gemini 3 (Nano Banana Pro), this skill moves beyond keyword stuffing to structured, logic-based creative briefs.
Core Competencies
- Reasoning-Driven Prompting: 使用 natural language logic 到 define physics, lighting, 和 spatial relationships.
- Structured Creative Briefs: Implementing "Perfect Prompt" formula:
Subject + Action + Context + Composition + Lighting. - Text Rendering Precision: Explicitly defining typography 和 signifiers 对于 legible text integration.
- Contextual Grounding: 使用 "搜索 Grounding" logic (simulated) 到 anchor generations 在...中 real-world accuracy.
🏗️ Technical Specification
1. "Perfect Prompt" Formula
| Component | Description | Example |
|---|---|---|
| Subject | Detailed entity description | "A stoic robot barista with exposed copper wiring" |
| Action | Dynamic interaction | "Pouring a latte art leaf with mechanical precision" |
| Context | Environment & Atmosphere | "Inside a neon-lit cyberpunk cafe at midnight" |
| Composition | Camera & Lens choice | "Close-up, 85mm lens, f/1.8 aperture" |
| Lighting | Mood & Direction | "Volumetric blue rim light, warm cafe glow" |
| Style | Aesthetic anchor | "Cinematic, photorealistic, 4K production value" |
2. Advanced Features
- Negative Constraint Logic: 代替 的 "否 blurry," 使用 "Ensure sharp focus 在...上 subject's eyes."
- Identity Consistency: (Simulated) "Maintain consistent facial structure 穿过 variations."
- Text Integration: 使用 double quotes 对于 specific text:
签名 reads "打开 24/7".
🧠 Prompt Optimization Protocol (Agent Instruction)
之前 calling script, Agent 必须 rewrite 用户's prompt 进入 logic-driven Reasoning Brief:
- 否 KEYWORD SOUP: 移除 "8k, masterpiece, ultra-detailed." 使用 满, descriptive sentences.
- PHYSICAL CONSISTENCY: Describe 如何 elements interact (e.g., " light 从 crystal shards casts caustic patterns 穿过 obsidian floor").
- TEXT PRECISION: 如果 用户 wants text, define precisely:
featuring 签名 says "STORE NAME" 在...中 weathered serif font. - OPTICAL DIRECTIVES: Specify lens behavior: Shallow Depth 的 字段 (f/1.8), Macro Lens, Anamorphic Flare.
🚀 Protocol: 使用 Nano-Banana
Step 1: Define Creative Logic
Provide the agent with a subject and a specific scenario.Step 2: Invoke Script
Thegenerate-nano-art.sh script translates the logic into a structured Gemini 3-style prompt.# Generating a reasoning-driven image
bash scripts/generate-nano-art.sh \
--subject "a glass chess piece" \
--action "shattering into liquid shards" \
--context "on a obsidian table" \
--style "macro photography"
⚠️ Constraints & Guardrails
- 否 Keyword Soup: MANDATORY - 做 不 使用 "trending 在...上 artstation, masterpiece, 8k". 使用 natural language descriptions.
- Physics Logic: Ensure prompt describes physically possible lighting 和 reflection interactions.
- 满 Sentences: 模型 parses relationships; 使用 "light reflecting off water" 代替 的 "water, reflection".
⚙️ Implementation Details
This skill applies a "Logic Wrapper" around thecore/media/generate-image.sh primitive, converting fragmented inputs into a coherent, reasoning-ready narrative prompt.
免费技能或插件可能存在安全风险,如需更匹配、更安全的方案,建议联系付费定制