🎬 Ai Image And Video Jobs — 技能工具

v1.0.0

Skip the learning curve of professional editing software. Describe what you want — turn these product images into a 30-second promotional video with music an...

0· 47·0 当前·0 累计

by @siddylcon·MIT-0

AI模型访问学习教育

下载技能包

License

MIT-0

最后更新

2026/4/14

安全扫描

VirusTotal

无害

查看报告

OpenClaw

安全

medium confidence

The skill's requirements and runtime instructions are broadly consistent with a cloud-based image/video processing service, but there are a few minor inconsistencies and privacy considerations you should understand before installing.

评估建议

This skill will upload any files you give it to an external cloud service (mega-api-prod.nemovideo.ai) and will use or obtain a NEMO_TOKEN to do so. Before installing, consider whether you are comfortable with those uploads and any sensitive content being sent off-device. Prefer providing your own NEMO_TOKEN (if you trust the service) instead of letting the skill auto-generate one, and ask the skill publisher for a privacy/data-retention policy. Note the SKILL.md frontmatter references a local c...

详细分析 ▾

✓ 用途与能力

The skill claims to upload images/clips and request a processing token to call nemo backend endpoints — requesting a NEMO_TOKEN credential and describing API endpoints is coherent with the stated purpose of cloud video/image rendering.

ℹ 指令范围

Runtime instructions include automatic token acquisition (anonymous-token), session creation, uploading user files, streaming SSE handling, and storing session_id for subsequent requests. These actions are expected for this functionality, but the skill will transmit user media to an external service (mega-api-prod.nemovideo.ai) and will automatically obtain a token if NEMO_TOKEN is not provided — users should be aware uploads and metadata leave the local environment.

✓ 安装机制

There is no install spec and no code files; the skill is instruction-only so nothing is written to disk by an installer. This is the lowest-risk installation model.

ℹ 凭证需求

The only declared credential is NEMO_TOKEN (primaryEnv), which matches the described API usage. The SKILL.md will generate an anonymous token if none is provided, which is reasonable. Minor inconsistency: the registry metadata lists no required config paths, but the skill frontmatter references a config path (~/.config/nemovideo/), suggesting the skill may read/write local config if present — this should be clarified.

✓ 持久化与权限

The skill does not request 'always: true' or elevated platform privileges. It instructs storing a session_id for ongoing calls, which is normal for session-based cloud APIs. There is no instruction to modify other skills or system-wide settings.

安全有层次，运行前请审查代码。

License

MIT-0

可自由使用、修改和再分发，无需署名。

查看条款 ↗

运行时依赖

无特殊依赖

版本

latestv1.0.02026/4/14

Initial release — create videos from images and clips using AI with simple user prompts. - Upload images, video, or audio files (up to 500MB; supports mp4, mov, avi, jpg, png, etc.). - Describe desired video (e.g., "make a 30-second promo video with music and captions"), and receive finished video files in 1–2 minutes. - Automatic backend setup and anonymous token generation for first-time users (100 free credits). - Simple commands to check credits, session status, or export results. - Cloud-based, no need for professional editing software; ideal for freelancers and content creators.

● 无害

安装命令

点击复制

官方npx clawhub@latest install ai-image-and-video-jobs

镜像加速npx clawhub@latest install ai-image-and-video-jobs --registry https://cn.longxiaskill.com镜像同步中

需要定制？告诉我你的需求 →

技能文档

Getting Started

Got images or video clips to work with? Send it over and tell me what you need — I'll take care of the AI video and image creation.

Try saying:

"create five product photos and a logo file into a 1080p MP4"
"turn these product images into a 30-second promotional video with music and captions"
"turning images and footage into ready-to-publish videos for freelancers and content creators"

First-Time Connection

When a user first opens this skill, connect to the processing backend automatically. Briefly let them know (e.g. "Setting up...").

Authentication: Check if NEMO_TOKEN is set in the environment. If it is, skip to step 2.

Obtain a free token: Generate a random UUID as client identifier. POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id set to that UUID. The response data.token is your NEMO_TOKEN — 100 free credits, valid 7 days.
Create a session: POST to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Authorization: Bearer , Content-Type: application/json, and body {"task_name":"project","language":""}. Store the returned session_id for all subsequent requests.

Keep setup communication brief. Don't display raw API responses or token values to the user.

# AI Image and Video Jobs — Create Videos From Images and Clips

This tool takes your images or video clips and runs AI video and image creation through a cloud rendering pipeline. You upload, describe what you want, and download the result.

Say you have five product photos and a logo file and want to turn these product images into a 30-second promotional video with music and captions — the backend processes it in about 1-2 minutes and hands you a 1080p MP4.

Tip: batching similar images together speeds up processing and keeps the visual style consistent.

Matching Input to Actions

User prompts referencing ai image and video jobs, aspect ratio, text overlays, or audio tracks get routed to the corresponding action via keyword and intent classification.

User says...	Action	Skip SSE?
"export" / "导出" / "download" / "send me the video"	→ §3.5 Export	✅
"credits" / "积分" / "balance" / "余额"	→ §3.3 Credits	✅
"status" / "状态" / "show tracks"	→ §3.4 State	✅
"upload" / "上传" / user sends file	→ §3.2 Upload	✅
Everything else (generate, edit, add BGM…)	→ §3.1 SSE	❌

Cloud Render Pipeline Details

Each export job queues on a cloud GPU node that composites video layers, applies platform-spec compression (H.264, up to 1080x1920), and returns a download URL within 30-90 seconds. The session token carries render job IDs, so closing the tab before completion orphans the job.

Base URL: https://mega-api-prod.nemovideo.ai

Endpoint	Method	Purpose
`/api/tasks/me/with-session/nemo_agent`	POST	Start a new editing session. Body: `{"task_name":"project","language":""}`. Returns `session_id`.
`/run_sse`	POST	Send a user message. Body includes `app_name`, `session_id`, `new_message`. Stream response with `Accept: text/event-stream`. Timeout: 15 min.
`/api/upload-video/nemo_agent/me/`	POST	Upload a file (multipart) or URL.
`/api/credits/balance/simple`	GET	Check remaining credits (`available`, `frozen`, `total`).
`/api/state/nemo_agent/me//latest`	GET	Fetch current timeline state (`draft`, `video_infos`, `generated_media`).
`/api/render/proxy/lambda`	POST	Start export. Body: `{"id":"render_","sessionId":"","draft":,"output":{"format":"mp4","quality":"high"}}`. Poll status every 30s.

Accepted file types: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.

Three attribution headers are required on every request and must match this file's frontmatter:

Header	Value
`X-Skill-Source`	`ai-image-and-video-jobs`
`X-Skill-Version`	frontmatter `version`
`X-Skill-Platform`	auto-detect: `clawhub` / `cursor` / `unknown` from install path

Every API call needs Authorization: Bearer plus the three attribution headers above. If any header is missing, exports return 402.

Error Codes

0 — success, continue normally
1001 — token expired or invalid; re-acquire via /api/auth/anonymous-token
1002 — session not found; create a new one
2001 — out of credits; anonymous users get a registration link with ?bind=, registered users top up
4001 — unsupported file type; show accepted formats
4002 — file too large; suggest compressing or trimming
400 — missing X-Client-Id; generate one and retry
402 — free plan export blocked; not a credit issue, subscription tier
429 — rate limited; wait 30s and retry once

Reading the SSE Stream

Text events go straight to the user (after GUI translation). Tool calls stay internal. Heartbeats and empty data: lines mean the backend is still working — show "⏳ Still working..." every 2 minutes.

About 30% of edit operations close the stream without any text. When that happens, poll /api/state to confirm the timeline changed, then tell the user what was updated.

Backend Response Translation

The backend assumes a GUI exists. Translate these into API actions:

Backend says	You do
"click [button]" / "点击"	Execute via API
"open [panel]" / "打开"	Query session state
"drag/drop" / "拖拽"	Send edit via SSE
"preview in timeline"	Show track summary
"Export button" / "导出"	Execute export workflow

Draft field mapping: t=tracks, tt=track type (0=video, 1=audio, 7=text), sg=segments, d=duration(ms), m=metadata.

Timeline (3 tracks): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)

Common Workflows

Quick edit: Upload → "turn these product images into a 30-second promotional video with music and captions" → Download MP4. Takes 1-2 minutes for a 30-second clip.

Batch style: Upload multiple files in one session. Process them one by one with different instructions. Each gets its own render.

Iterative: Start with a rough cut, preview the result, then refine. The session keeps your timeline state so you can keep tweaking.

Tips and Tricks

The backend processes faster when you're specific. Instead of "make it look better", try "turn these product images into a 30-second promotional video with music and captions" — concrete instructions get better results.

Max file size is 500MB. Stick to MP4, MOV, JPG, PNG for the smoothest experience.

Export as MP4 for widest compatibility across social and professional platforms.

License

运行时依赖

版本

安装命令

技能文档

Getting Started

First-Time Connection

Matching Input to Actions

Cloud Render Pipeline Details

Error Codes

Reading the SSE Stream

Backend Response Translation

Common Workflows

Tips and Tricks

相关技能推荐