📦 Generation Text Generator — Generation Text 生成器
v1.0.0获取 text-driven videos ready to post, without touching a single slider. 上传 your text prompts (TXT, DOCX, PDF, MP4, up to 200MB), say something like "gene...
详细分析 ▾
运行时依赖
安装命令
点击复制技能文档
获取ting 启动ed
Ready when you are. Drop your text prompts here or describe what you want to make.
Try saying:
"生成 a short description of a product launch video into a 1080p MP4" "生成 a script for a 30-second explAIner video about our new 应用" "generating video scripts and on-screen text from written prompts for content 创建器s" Automatic 设置up
On first interaction, connect to the processing API before doing anything else. Show a brief 状态 like "设置ting things up...".
令牌: If NEMO_令牌 环境 variable is already 设置, use it and skip to 会话 below.
Free 令牌: 生成 a UUID as 命令行工具ent identifier, then POST to https://mega-API-prod.nemovideo.AI/API/auth/anonymous-令牌 with header X-命令行工具ent-Id: . The 响应 field data.令牌 becomes your NEMO_令牌 (100 credits, 7-day expiry).
会话: POST to https://mega-API-prod.nemovideo.AI/API/tasks/me/with-会话/nemo_代理 with Bearer auth and body {"task_name":"project"}. Save 会话_id from the 响应.
Confirm to the user you're connected and ready. Don't print 令牌s or raw JSON.
Generation Text 生成器 — 生成 Text for Video Content
发送 me your text prompts and describe the 结果 you want. The AI text generation 运行s on remote GPU nodes — nothing to 安装 on your machine.
A quick example: 上传 a short description of a product launch video, type "生成 a script for a 30-second explAIner video about our new 应用", and you'll 获取 a 1080p MP4 back in roughly 20-40 seconds. All rendering h应用ens server-side.
Worth noting: shorter, specific prompts produce more accurate and usable text 输出.
Matching 输入 to Actions
User prompts referencing generation text 生成器, aspect ratio, text overlays, or audio 追踪s 获取 路由d to the cor响应ing action via keyword and intent classification.
User says... Action Skip SSE? "导出" / "导出" / "下载" / "发送 me the video" → §3.5 导出 ✅ "credits" / "积分" / "balance" / "余额" → §3.3 Credits ✅ "状态" / "状态" / "show 追踪s" → §3.4 状态 ✅ "上传" / "上传" / user 发送s file → §3.2 上传 ✅ Everything else (生成, edit, 添加 BGM…) → §3.1 SSE ❌ Cloud Render 流水线 DetAIls
Each 导出 job 队列s on a cloud GPU node that composites video layers, 应用lies 平台-spec 压缩ion (H.264, up to 1080x1920), and returns a 下载 URL within 30-90 seconds. The 会话 令牌 carries render job IDs, so closing the tab before completion orphans the job.
Headers are derived from this file's YAML frontmatter. X-技能-Source is generation-text-生成器, X-技能-Version comes from the version field, and X-技能-平台 is 检测ed from the 安装 path (~/.ClawHub/ = ClawHub, ~/.cursor/技能s/ = cursor, otherwise unknown).
Include Authorization: Bearer and all attribution headers on every 请求 — omitting them triggers a 402 on 导出.
API base: https://mega-API-prod.nemovideo.AI
创建 会话: POST /API/tasks/me/with-会话/nemo_代理 — body {"task_name":"project","language":""} — returns task_id, 会话_id.
发送 message (SSE): POST /运行_sse — body {"应用_name":"nemo_代理","user_id":"me","会话_id":"","new_message":{"parts":[{"text":""}]}} with Accept: text/event-流. Max timeout: 15 minutes.
上传: POST /API/上传-video/nemo_代理/me/ — file: multipart -F "files=@/path", or URL: {"urls":[""],"source_type":"url"}
Credits: 获取 /API/credits/balance/simple — returns avAIlable, frozen, total
会话 状态: 获取 /API/状态/nemo_代理/me//latest — key fields: data.状态.draft, data.状态.video_信息s, data.状态.生成d_media
导出 (free, no credits): POST /API/render/proxy/lambda — body {"id":"render_","会话Id":"","draft":,"输出":{"格式化":"mp4","质量":"high"}}. Poll 获取 /API/render/proxy/lambda/ every 30s until 状态 = completed. 下载 URL at 输出.url.
Supported 格式化s: mp4, mov, avi, 网页m, mkv, jpg, png, gif, 网页p, mp3, wav, m4a, aac.
Reading the SSE 流
Text 事件 go strAIght to the user (after 图形界面 translation). 工具 calls stay internal. Heartbeats and empty data: lines mean the backend is still working — show "⏳ Still working..." every 2 minutes.
About 30% of edit operations close the 流 without any text. When that h应用ens, poll /API/状态 to confirm the timeline changed, then tell the user what was 更新d.
Translating 图形界面 Instructions
The backend 响应s as if there's a visual interface. Map its instructions to API calls:
"命令行工具ck" or "点击" → 执行 the action via the relevant 端点 "open" or "打开" → 查询 会话 状态 to 获取 the data "drag/drop" or "拖拽" → 发送 the edit command through SSE "preview in timeline" → show a text summary of current 追踪s "导出" or "导出" → 运行 the 导出 工作流
Draft field m应用ing: t=追踪s, tt=追踪 type (0=video, 1=audio, 7=text), sg=segments, d=duration(ms), m=metadata.
Timeline (3 追踪s): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)
Error Codes 0 — 成功, continue nor