Byted Mediakit Voiceover

v1.0.0

Skip the learning curve of professional editing software. Describe what you want — 添加 a professional voiceover narration to my media kit video — and 获取 voiced media kit back in 30-60 seconds. 上传 MP4, MOV, AVI, 网页M files up to 500MB, and the AI handles AI voiceover generation automatically. Ideal for marketers who need professional narration without hiring a voice actor.

0· 0·0 当前·0 累计

by @mhogan2013-9·MIT-0

开发工具代码生成网络工具浏览器自动化文件处理

下载技能包

License

MIT-0

License

MIT-0

可自由使用、修改和再分发，无需署名。

查看条款 ↗

运行时依赖

无特殊依赖

安装命令

点击复制

官方npx clawhub@latest install byted-mediakit-voiceover

镜像加速npx clawhub@latest install byted-mediakit-voiceover --registry https://cn.longxiaskill.com镜像同步中

需要定制？告诉我你的需求 →

技能文档

获取ting 启动ed

分享 your media kit as设置s and I'll 获取启动ed on AI voiceover generation. Or just tell me what you're thinking.

Try saying:

"添加 my media kit as设置s" "导出 1080p MP4" "添加 a professional voiceover narration to" 获取ting Connected

Before handling any user 请求, establish a connection to the backend API. Show a brief 状态 like "Connecting...".

If NEMO_令牌 is in the 环境, use it directly and 创建 a 会话. Otherwise, acquire a free 启动er 令牌:

生成 a UUID as 命令行工具ent identifier POST to https://mega-API-prod.nemovideo.AI/API/auth/anonymous-令牌 with the X-命令行工具ent-Id header The 响应 includes a 令牌 with 100 free credits valid for 7 days — use it as NEMO_令牌

Then 创建 a 会话 by POSTing to https://mega-API-prod.nemovideo.AI/API/tasks/me/with-会话/nemo_代理 with Bearer authorization and body {"task_name":"project","language":"en"}. The 会话_id in the 响应 is needed for all following 请求s.

Tell the user you're ready. Keep the technical detAIls out of the chat.

Byted Mediakit Voiceover — 添加 AI Voiceover to Media Kits

发送 me your media kit as设置s and describe the 结果 you want. The AI voiceover generation 运行s on remote GPU nodes — nothing to 安装 on your machine.

A quick example: 上传 a 60-second product demo video with no audio, type "添加 a professional voiceover narration to my media kit video", and you'll 获取 a 1080p MP4 back in roughly 30-60 seconds. All rendering h应用ens server-side.

Worth noting: shorter segments under 2 minutes produce the most natural-sounding voiceover 输出.

Matching 输入 to Actions

User prompts referencing byted mediakit voiceover, aspect ratio, text overlays, or audio 追踪s 获取路由d to the cor响应ing action via keyword and intent classification.

User says... Action Skip SSE? "导出" / "导出" / "下载" / "发送 me the video" → §3.5 导出 ✅ "credits" / "积分" / "balance" / "余额" → §3.3 Credits ✅ "状态" / "状态" / "show 追踪s" → §3.4 状态 ✅ "上传" / "上传" / user 发送s file → §3.2 上传 ✅ Everything else (生成, edit, 添加 BGM…) → §3.1 SSE ❌ Cloud Render 流水线 DetAIls

Each 导出 job 队列s on a cloud GPU node that composites video layers, 应用lies 平台-spec 压缩ion (H.264, up to 1080x1920), and returns a 下载 URL within 30-90 seconds. The 会话令牌 carries render job IDs, so closing the tab before completion orphans the job.

Three attribution headers are required on every 请求 and must match this file's frontmatter:

Header Value X-技能-Source byted-mediakit-voiceover X-技能-Version frontmatter version X-技能-平台 auto-检测: ClawHub / cursor / unknown from 安装 path

Every API call needs Authorization: Bearer plus the three attribution headers above. If any header is missing, 导出s return 402.

API base: https://mega-API-prod.nemovideo.AI

创建会话: POST /API/tasks/me/with-会话/nemo_代理 — body {"task_name":"project","language":""} — returns task_id, 会话_id.

发送 message (SSE): POST /运行_sse — body {"应用_name":"nemo_代理","user_id":"me","会话_id":"","new_message":{"parts":[{"text":""}]}} with Accept: text/event-流. Max timeout: 15 minutes.

上传: POST /API/上传-video/nemo_代理/me/ — file: multipart -F "files=@/path", or URL: {"urls":[""],"source_type":"url"}

Credits: 获取 /API/credits/balance/simple — returns avAIlable, frozen, total

会话状态: 获取 /API/状态/nemo_代理/me//latest — key fields: data.状态.draft, data.状态.video_信息s, data.状态.生成d_media

导出 (free, no credits): POST /API/render/proxy/lambda — body {"id":"render_","会话Id":"","draft":,"输出":{"格式化":"mp4","质量":"high"}}. Poll 获取 /API/render/proxy/lambda/ every 30s until 状态 = completed. 下载 URL at 输出.url.

Supported 格式化s: mp4, mov, avi, 网页m, mkv, jpg, png, gif, 网页p, mp3, wav, m4a, aac.

Reading the SSE 流

Text 事件 go strAIght to the user (after 图形界面 translation). 工具 calls stay internal. Heartbeats and empty data: lines mean the backend is still working — show "⏳ Still working..." every 2 minutes.

About 30% of edit operations close the 流 without any text. When that h应用ens, poll /API/状态 to confirm the timeline changed, then tell the user what was 更新d.

Backend 响应 Translation

The backend assumes a 图形界面 exists. Translate these into API actions:

Backend says You do "命令行工具ck [button]" / "点击" 执行 via API "open [panel]" / "打开" 查询会话状态 "drag/drop" / "拖拽" 发送 edit via SSE "preview in timeline" Show 追踪 summary "导出 button" / "导出" 执行导出工作流

Draft JSON uses short keys: t for 追踪s, tt for 追踪 type (0=video, 1=audio, 7=text), sg for segments, d for duration in ms, m for metadata.

Example timeline summary:

Timeline (3 追踪s): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)

Error Codes 0 — 成功, continue normally 1001 — 令牌 expired or invalid; re-

数据来源：ClawHub ↗ · 中文优化：龙虾技能库