Free Audio Editor — 免费音频编辑器

Name: Free Audio Editor — 免费音频编辑器
Author: udnerc

udnerc

🎙️ Free Audio Editor — 免费音频编辑器

v1.0.0

使用此免费音频编辑器技能，编辑音频文件（支持 MP3、WAV、AAC、M4A，最大 200MB），通过云端 AI 处理生成清晰的音频视频。适合播客主、内容创作者、学生等，支持背景噪声去除、静音修剪、音量归一化等功能，处理时间仅 30-60 秒，输出 1080p MP4 文件。

0· 24·0 当前·0 累计

by @udnerc·MIT-0

音频处理云服务 AI模型访问

下载技能包

License

MIT-0

最后更新

2026/4/11

安全扫描

VirusTotal

无害

查看报告

OpenClaw

可疑

medium confidence

该技能基本匹配云端音频编辑服务的前端，但其运行指令存在不匹配和功能蔓延（自动令牌创建、文件系统探测安装路径以及不一致的元数据），需谨慎。

评估建议

["该技能将上传您的音频到外部服务（mega-api-prod.nemovideo.ai），如果没有 NEMO_TOKEN，将自动请求或创建匿名令牌。安装前，请考虑：（1）您是否信任该外部服务处理您的音频？避免上传敏感录音。（2）技能可能在首次打开时进行自动背景网络调用，并存储会话标识符，预计会有一些会话状态的持久化。（3）SKILL.md 要求代理探测安装路径和读取其前置元数据；请问发布者为什么需要文件系统读取以及为什么元数据列表存在不一致（配置路径不一致）。（4）没有安装脚本，因此安装时不会写入任何内容，但运行时行为包括网络上传。如果您需要更强的保证，请请求技能所有者/源代码，审查代码实现，或者先使用非敏感音频进行测试。"]...

详细分析 ▾

ℹ 用途与能力

名称/描述（通过云端渲染的音频编辑）与描述的 API 端点和上传/导出流程一致。要求 NEMO_TOKEN 是合理的。然而，SKILL.md 的前置元数据声称有一个配置路径（~/.config/nemovideo/），而注册元数据没有列出任何必需的配置路径——这种不一致应得到解释。探测用户的安装路径以设置 X-Skill-Platform (~/.clawhub, ~/.cursor) 对于核心音频编辑不是必要的，似乎是附带的。

⚠ 指令范围

指令指示代理获取匿名令牌（向外部端点 POST）并在首次打开时无需显式用户同意即自动连接。指令还指示读取 YAML 前置元数据和检测用户文件系统中的安装路径以填充归属头 —— 两者都涉及超出用户音频上传的本地文件/路径读取。上传用户音频到外部 API 是预期的，但自动背景网络调用和文件系统探测是用户应了解的范围扩展。

✓ 安装机制

没有安装规范和代码文件 —— 该技能仅为指令，因此没有内容被下载或由安装程序写入磁盘。这样可以最小化安装时的风险。

ℹ 凭证需求

仅声明 NEMO_TOKEN 为必需（primaryEnv）。这与云端编辑服务成比例。然而，SKILL.md 描述了自动生成和存储匿名令牌，这意味着该技能将执行网络身份验证流并为后续调用持久化 session_id；用户应了解将创建和使用临时凭证和会话标识符。注册元数据（无 configPaths）和 SKILL.md 前置元数据（提及 ~/.config/nemovideo/）之间的不匹配值得澄清。

ℹ 持久化与权限

该技能不请求 'always: true' 且可由用户调用，因此不会被强制包含。它请求存储 session_id 以用于后续请求，并在首次打开时自动重新连接 —— 对于基于会话的客户端这是合理的，但意味着该技能在首次激活时会进行自主网络请求。它不请求修改其他技能或系统范围的设置。

安全有层次，运行前请审查代码。

License

MIT-0

可自由使用、修改和再分发，无需署名。

查看条款 ↗

运行时依赖

无特殊依赖

版本

latestv1.0.02026/4/11

免费音频编辑器技能的初始发布。- 使用云端 AI 处理编辑和清理音频文件（MP3、WAV、AAC、M4A，最大 200MB）。- 通过快速在线管道以 1080p MP4 或其他支持格式输出处理后的音频。- 自动设置，包含 7 天有效期的 100 信用免费身份验证令牌；无缝会话管理。- 简单命令用于上传、编辑、检查信用、导出结果。- 支持在一分钟内进行背景噪声去除、静音修剪和音量归一化。- 清晰的错误消息指导用户关于文件格式、令牌问题和导出限制。

● 无害

安装命令点击复制

官方npx clawhub@latest install free-audio-editor

镜像加速npx clawhub@latest install free-audio-editor --registry https://cn.clawhub-mirror.com

技能文档

该工具接收您的音频文件并通过云端渲染管道进行 AI 音频编辑。您上传、描述所需编辑，然后下载结果。例如，您有一个 3 分钟的播客录音，包含背景噪声，想要去除背景噪声、修剪静音并归一化音量 —— 后端在约 30-60 秒内处理并提供 1080p MP4 文件。

提示：较短的音频片段（少于 2 分钟）处理速度显著加快。

... （由于原始内容过长，仅翻译了部分关键内容，完整内容请根据需要自行翻译）

Getting Started

Share your audio files and I'll get started on AI audio editing. Or just tell me what you're thinking.

Try saying:

"edit my audio files"
"export 1080p MP4"
"remove background noise, trim silence, and"

First-Time Connection

When a user first opens this skill, connect to the processing backend automatically. Briefly let them know (e.g. "Setting up...").

Authentication: Check if NEMO_TOKEN is set in the environment. If it is, skip to step 2.

Obtain a free token: Generate a random UUID as client identifier. POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id set to that UUID. The response data.token is your NEMO_TOKEN — 100 free credits, valid 7 days.
Create a session: POST to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Authorization: Bearer , Content-Type: application/json, and body {"task_name":"project","language":""}. Store the returned session_id for all subsequent requests.

Keep setup communication brief. Don't display raw API responses or token values to the user.

# Free Audio Editor — Edit and Export Clean Audio

This tool takes your audio files and runs AI audio editing through a cloud rendering pipeline. You upload, describe what you want, and download the result.

Say you have a 3-minute podcast recording with background noise and want to remove background noise, trim silence, and normalize the volume — the backend processes it in about 30-60 seconds and hands you a 1080p MP4.

Tip: shorter audio clips under 2 minutes process significantly faster.

Matching Input to Actions

User prompts referencing free audio editor, aspect ratio, text overlays, or audio tracks get routed to the corresponding action via keyword and intent classification.

User says...	Action	Skip SSE?
"export" / "导出" / "download" / "send me the video"	→ §3.5 Export	✅
"credits" / "积分" / "balance" / "余额"	→ §3.3 Credits	✅
"status" / "状态" / "show tracks"	→ §3.4 State	✅
"upload" / "上传" / user sends file	→ §3.2 Upload	✅
Everything else (generate, edit, add BGM…)	→ §3.1 SSE	❌

Cloud Render Pipeline Details

Each export job queues on a cloud GPU node that composites video layers, applies platform-spec compression (H.264, up to 1080x1920), and returns a download URL within 30-90 seconds. The session token carries render job IDs, so closing the tab before completion orphans the job.

All requests must include: Authorization: Bearer , X-Skill-Source, X-Skill-Version, X-Skill-Platform. Missing attribution headers will cause export to fail with 402.

Skill attribution — read from this file's YAML frontmatter at runtime:

X-Skill-Source: free-audio-editor
X-Skill-Version: from frontmatter version
X-Skill-Platform: detect from install path (~/.clawhub/ → clawhub, ~/.cursor/skills/ → cursor, else unknown)

API base: https://mega-api-prod.nemovideo.ai

Create session: POST /api/tasks/me/with-session/nemo_agent — body {"task_name":"project","language":""} — returns task_id, session_id.

Send message (SSE): POST /run_sse — body {"app_name":"nemo_agent","user_id":"me","session_id":"","new_message":{"parts":[{"text":""}]}} with Accept: text/event-stream. Max timeout: 15 minutes.

Upload: POST /api/upload-video/nemo_agent/me/ — file: multipart -F "files=@/path", or URL: {"urls":[""],"source_type":"url"}

Credits: GET /api/credits/balance/simple — returns available, frozen, total

Session state: GET /api/state/nemo_agent/me//latest — key fields: data.state.draft, data.state.video_infos, data.state.generated_media

Export (free, no credits): POST /api/render/proxy/lambda — body {"id":"render_","sessionId":"","draft":,"output":{"format":"mp4","quality":"high"}}. Poll GET /api/render/proxy/lambda/ every 30s until status = completed. Download URL at output.url.

Supported formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.

Error Handling

Code	Meaning	Action
0	Success	Continue
1001	Bad/expired token	Re-auth via anonymous-token (tokens expire after 7 days)
1002	Session not found	New session §3.0
2001	No credits	Anonymous: show registration URL with `?bind=` (get from create-session or state response when needed). Registered: "Top up credits in your account"
4001	Unsupported file	Show supported formats
4002	File too large	Suggest compress/trim
400	Missing X-Client-Id	Generate Client-Id and retry (see §1)
402	Free plan export blocked	Subscription tier issue, NOT credits. "Register or upgrade your plan to unlock export."
429	Rate limit (1 token/client/7 days)	Retry in 30s once

Translating GUI Instructions

The backend responds as if there's a visual interface. Map its instructions to API calls:

"click" or "点击" → execute the action via the relevant endpoint
"open" or "打开" → query session state to get the data
"drag/drop" or "拖拽" → send the edit command through SSE
"preview in timeline" → show a text summary of current tracks
"Export" or "导出" → run the export workflow

Reading the SSE Stream

Text events go straight to the user (after GUI translation). Tool calls stay internal. Heartbeats and empty data: lines mean the backend is still working — show "⏳ Still working..." every 2 minutes.

About 30% of edit operations close the stream without any text. When that happens, poll /api/state to confirm the timeline changed, then tell the user what was updated.

Draft field mapping: t=tracks, tt=track type (0=video, 1=audio, 7=text), sg=segments, d=duration(ms), m=metadata.

Timeline (3 tracks): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)

Tips and Tricks

The backend processes faster when you're specific. Instead of "make it look better", try "remove background noise, trim silence, and normalize the volume" — concrete instructions get better results.

Max file size is 200MB. Stick to MP3, WAV, AAC, M4A for the smoothest experience.

Export as MP4 for widest compatibility across platforms.

Common Workflows

Quick edit: Upload → "remove background noise, trim silence, and normalize the volume" → Download MP4. Takes 30-60 seconds for a 30-second clip.

Batch style: Upload multiple files in one session. Process them one by one with different instructions. Each gets its own render.

Iterative: Start with a rough cut, preview the result, then refine. The session keeps your timeline state so you can keep tweaking.

数据来源：ClawHub ↗ · 中文优化：龙虾技能库

OpenClaw 技能定制 / 插件定制 / 私有工作流定制

免费技能或插件可能存在安全风险，如需更匹配、更安全的方案，建议联系付费定制

了解定制服务

License

运行时依赖

版本

安装命令 点击复制

技能文档

Getting Started

First-Time Connection

Matching Input to Actions

Cloud Render Pipeline Details

Error Handling

Translating GUI Instructions

Reading the SSE Stream

Tips and Tricks

Common Workflows

安装命令点击复制