首页龙虾技能列表 › Qwen3-TTS VoiceDesign — 自然语言文本到语音,自定义语音设计

Qwen3-TTS VoiceDesign — 自然语言文本到语音,自定义语音设计

v1.0.0

Qwen3-TTS VoiceDesign 实现文本到语音的功能,允许通过自然语言描述和基于种子(seed)的音调固定设计自定义语音。包含 OpenAI 兼容的 API 服务器、一键式设置和批量种子探索工具,适用于语音生成、语音设计和集成到 OpenClaw。

0· 521·0 当前·0 累计
by @xiaoyaner0201·MIT-0
下载技能包
License
MIT-0
最后更新
2026/4/10
安全扫描
VirusTotal
可疑
查看报告
OpenClaw
安全
medium confidence
该包和脚本与自托管 TTS 服务器一致:安装 Python 依赖项、下载 TTS 模型并提供客户端/服务器脚本。文件中没有请求无关凭据或隐藏数据泄露端点,但安装前应审查一些操作风险。
评估建议
["预计大型下载 (~3.5GB) 和安装许多包(包括 torch/CUDA);建议在受控环境或 VM/容器中运行。","服务器启动时清除 HTTP(S)_PROXY 环境变量;如果需要代理,请绑定到 127.0.0.1 或配置防火墙。","验证信任 ModelScope/HuggingFace 和指定模型仓库。","避免传递不受信任的文本到客户端 shell 脚本。","如果暴露服务器,确保安全(防火墙、反向代理、认证)"]...
详细分析 ▾
用途与能力
Name/description (Qwen3-TTS VoiceDesign TTS server + client tools) matches the included scripts: a FastAPI server, client helpers, setup script and seed-batching tooling. The declared behavior (model download, one-click setup, OpenAI-compatible API) is consistent with the code.
指令范围
SKILL.md instructs running setup.sh which creates a venv, pip-installs dependencies, downloads the model (ModelScope or Hugging Face), and runs the server; the runtime scripts only reference their .env and local files. Notable scope items: the server code clears proxy environment variables at start (potentially bypassing a corporate proxy), and the docs show guidance to register scheduled tasks or systemd units (these are only instructions, not executed automatically). The client scripts build JSON bodies via shell interpolation (potential for malformed input/escaping issues if used with untrusted text).
安装机制
There is no platform install spec, but setup.sh will pip-install packages (qwen-tts, soundfile, pydub, uvicorn, fastapi, numpy and possibly modelscope and torch from the official PyTorch index). It downloads the ~3.5GB model via ModelScope or Hugging Face. These are expected for a local TTS runtime but do involve network access and large binary downloads; the sources used (ModelScope/HuggingFace, PyTorch wheel index) are standard release hosts rather than arbitrary shorteners.
凭证需求
The skill requests no credentials and exposes only environment variables relevant to running a local TTS server (seed, instruct, model path, host/port, format). The only surprising behavior is that the server explicitly clears HTTP(S) proxy environment variables at startup, which may affect network routing on hosts that rely on proxies; this is operational (not credential) behavior and not an attempt to read secrets.
持久化与权限
The skill is not always-enabled and does not attempt to change other skills' config. setup.sh suggests how to create systemd units or a Windows scheduled task, but it does not automatically create system-level services or elevate privileges. You must run setup/start manually, so persistence is user-controlled.
安全有层次,运行前请审查代码。

License

MIT-0

可自由使用、修改和再分发,无需署名。

运行时依赖

无特殊依赖

版本

latestv1.0.02026/2/25

初始发布:通过自然语言和种子固定的 VoiceDesign 声音设计,OpenAI 兼容 API 服务器,一键式设置和批量种子探索工具。

● 可疑

安装命令 点击复制

官方npx clawhub@latest install qwen3-tts-voicedesign
镜像加速npx clawhub@latest install qwen3-tts-voicedesign --registry https://cn.clawhub-mirror.com

技能文档

自然语言文本到语音 + 基于种子的音调固定设计... (中间代码块和 Markdown 格式保持不变,仅翻译了非代码部分)

数据来源:ClawHub ↗ · 中文优化:龙虾技能库
OpenClaw 技能定制 / 插件定制 / 私有工作流定制

免费技能或插件可能存在安全风险,如需更匹配、更安全的方案,建议联系付费定制

了解定制服务