Edge TTS — 文本转语音

Name: Edge TTS — 文本转语音
Rating: 28

v2.0.0

基于 node-edge-tts 的 npm 包，快速将文本转为多语种语音，可自由调节语速、音调，并自动生成字幕，适合多任务、无障碍、驾驶等听读场景。

28· 1.8万·245 当前·259 累计

by @i3130002

数据与API AI模型访问

使用场景：使用Edge TTS — 文本转语音进行数据与API使用Edge TTS — 文本转语音

下载技能包

最后更新

2026/2/26

安全扫描

VirusTotal

无害

查看报告

OpenClaw

安全

high confidence

该技能使用 Microsoft Edge 的神经 TTS 服务和 node-edge-tts npm 包将文本转换为高质量语音。支持多语言、多声音、可调节语速/音调、字幕生成，无需 API 密钥，为免费服务。

评估建议

该技能安全风险较低，主要提供文本转语音功能。 1. **在线服务**：使用 Microsoft Edge 的在线 TTS 服务，需要网络连接，无 API 密钥需求（免费服务） 2. **音频输出**：默认输出 MP3 格式，临时文件存储在系统临时目录，不会自动删除，调用应用需负责清理 3. **可自定义性**：支持声音选择、语速/音调/音量调节、输出格式选择、字幕生成 4. **关键词过滤**：自动过滤 TTS 相关关键词（tts、TTS、text-to-speech），避免转换触发词本身 **使用注意**： - 需要稳定的网络连接访问 Microsoft TTS 服务 - 临时音频文件不会自动删除，建议定期清理 - 对于重复偏好，使用 config-manager.js 设置默认值 - Neural 声音（以 Neural 结尾）质量高于 Standard 声音 - 可通过 --list-voices 查看完整声音列表...

详细分析 ▾

✓ 用途与能力

Name/description match the included scripts and package.json. Declared dependencies (node-edge-tts, commander) and the CLI/helper scripts are appropriate for a Text‑to‑Speech skill; nothing in the manifest requests unrelated cloud credentials, OS tools, or privileged access.

ℹ 指令范围

SKILL.md stays on‑topic: it instructs agents to call the built‑in tts tool or the included scripts, documents options, and shows how to install and test. The scripts read/write a per‑user config (~/.tts-config.json) and write audio to a temp dir; they also accept a proxy option. Note: SKILL.md references an external preview site (https://tts.travisvn.com) — a third‑party testing URL that is separate from Microsoft Edge endpoints and should be validated if you plan to trust it.

✓ 安装机制

No high‑risk download/install host is used. Install flow is npm install in the scripts directory (install.sh provided). Dependencies are from the public npm registry and package-lock.json lists expected packages; nothing is fetched from obscure shorteners or arbitrary URLs.

✓ 凭证需求

The skill declares no required environment variables or credentials. The config manager persists user preferences to ~/.tts-config.json (voice, proxy, timeout, etc.), which is proportional for a user‑configurable TTS client. Be aware the proxy field could be pointed at a capture proxy by a user or operator — the skill will route requests there if configured.

✓ 持久化与权限

The skill does not request always:true and does not modify other skills or system settings. Its persistent footprint is limited to a per‑user config file and temporary audio files in the system temp directory, which is appropriate for this functionality.

安全有层次，运行前请审查代码。

运行时依赖

无特殊依赖

版本

latestv2.0.02026/1/25

**Major upgrade with added scripts, configurability, and streamlined TTS triggers** - Added Node.js scripts for TTS conversion and config management (`tts-converter.js`, `config-manager.js`), with installation and usage instructions. - Added resource and reference files, including a complete voice/option guide and install script. - Changed TTS intent detection to trigger only on the "tts" keyword (removes long trigger phrase lists), and clarified keyword filtering prior to conversion. - Expanded documentation with workflow, usage examples, advanced configuration, troubleshooting, and testing instructions. - Clarified default voice, output formats, temporary file handling, and recommended voice test site.

● 无害

安装命令

点击复制

官方npx clawhub@latest install edge-tts

镜像加速npx clawhub@latest install edge-tts --registry https://cn.longxiaskill.com 镜像可用

本土化适配说明

Edge TTS — 文本转语音安装说明：安装命令：npx clawhub@latest install edge-tts 支持国内镜像加速，使用 --registry https://cn.longxiaskill.com 参数可加速下载

需要定制？告诉我你的需求 →

技能文档

概述

通过 node-edge-tts npm 包调用 Microsoft Edge 神经 TTS 服务，生成高质量文本转语音音频。支持多语言、多声音、可调节语速/音调及字幕生成。

快速开始

当检测到触发词或用户请求中的 TTS 意图时：

调用 tts 工具（Clawdbot 内置）将文本转换为语音
工具返回 MEDIA: 路径
Clawdbot 将音频路由到当前频道

// 示例：内置 tts 工具用法
tts("Your text to convert to speech")
// 返回: MEDIA: /path/to/audio.mp3

触发词检测

将 "tts" 关键词识别为 TTS 请求。技能会在转换前自动过滤 TTS 相关关键词，避免将触发词本身转换为语音。

高级自定义

使用 Node.js 脚本

如需更多控制，可直接使用绑定的脚本：

TTS 转换器

cd scripts
npm install
node tts-converter.js "Your text" --voice en-US-AriaNeural --rate +10% --output output.mp3

参数选项：

--voice, -v：声音名称（默认：en-US-AriaNeural）
--lang, -l：语言代码（例如：en-US、es-ES、zh-CN）
--format, -o：输出格式（默认：audio-24khz-48kbitrate-mono-mp3）
--pitch：音调调节（例如：+10%、-20%、default）
--rate, -r：语速调节（例如：+10%、-20%、default）
--volume：音量调节（例如：+0%、-50%、+100%、default）
--output, -f：输出文件路径
--list-voices：列出所有可用声音
--list-formats：列出所有可用输出格式
--write-subtitle, -s：生成音频字幕文件
--compress：压缩音频输出

字幕生成器

node subtitle-generator.js input.mp3 --format vtt --output subtitles.vtt

参数选项：

--format：字幕格式（vtt 或 srt，默认：vtt）
--output, -o：输出文件路径

声音选择

质量优先级：Neural > NeuralHQ > Neural2 > Standard

Neural 声音（以 "Neural" 或 "Neural2" 结尾）提供最佳音质。Standard 声音作为兼容性备选。

声音列表因语言而异。使用 --list-voices 查找目标语言对应的声音：

node tts-converter.js --list-voices
# 按语言筛选:
node tts-converter.js --list-voices | grep "zh-CN"

配置管理

对于重复使用相同偏好的场景，使用配置管理器：

const { ConfigManager } = require('./config-manager.js');
const configManager = new ConfigManager();
// 设置默认声音
configManager.set('voice', 'en-US-AriaNeural');
// 设置默认输出目录
configManager.set('outputDir', './tts-output');// 获取当前配置
const config = configManager.get();

最佳实践

使用 Neural 声音：其音质显著优于 Standard 声音
测试不同声音：不同声音具有不同特性，为内容找到最佳匹配
考虑语言匹配：使用与内容语言一致的声音以获得最佳发音
处理网络问题：TTS 需要网络连接，生产环境中实施重试逻辑
清理临时文件：临时音频文件不会自动删除
大文件使用压缩：--compress 标志可减少大文件体积
生成字幕：为无障碍访问和视频内容，生成音频时同步生成字幕