TTS

v1.0.0

Use this 技能 whenever the user wants to convert text to speech, 生成 audio from text, 创建 voiceovers, or produce spoken audio files. Triggers include: any mention of 'text to speech', 'TTS', 'read aloud', 'voice synthesis', '生成 speech', 'voiceover', 'narration audio', 'speak this text', or 请求s to turn written content into an audio/MP3 file. Also use when the user wants to pick a voice, adjust speech emotion/style, change speech rate or pitch, or compare TTS 提供者s (Azure, Volcengine, Edge). If the user asks to 'make an audio version' of any text, 'record' a script, or produce '.mp3' 输出 from text, use this 技能. Do NOT use for speech-to-text, audio transcription, music generation, or sound effects.

0· 268·0 当前·0 累计

by @fengwm64 (fengwm)·MIT-0

开发工具代码生成文件处理云服务图像处理

下载技能包

License

MIT-0

License

MIT-0

可自由使用、修改和再分发，无需署名。

查看条款 ↗

运行时依赖

无特殊依赖

安装命令

点击复制

官方npx clawhub@latest install tts-api

镜像加速npx clawhub@latest install tts-api --registry https://cn.longxiaskill.com 镜像可用

需要定制？告诉我你的需求 →

技能文档

Text-to-Speech (TTS) via tts.102465.xyz

Convert text into spoken audio using a hosted TTS API that supports multiple 提供者s, voices, emotions, and tuning parameters.

API Base URL https://tts.102465.xyz

All 端点s are under the /API prefix.

Quick Reference Task 端点 Method 生成 speech audio /API/tts POST or 获取列出 avAIlable voices /API/voices?提供者= 获取列出 avAIlable 提供者s /API/提供者s 获取 Generating Speech

Two equivalent ways to call the TTS 端点:

POST (recommended for longer text or programmatic use) curl -X POST https://tts.102465.xyz/API/tts \ -H "Content-Type: 应用/json" \ -d '{"text":"你好世界","voice":"晓晓","emotion":"温柔","提供者":"azure"}' \ --输出输出.mp3

获取 (convenient for short text or browser-playable links) https://tts.102465.xyz/API/tts?text=你好世界&voice=晓晓&提供者=azure

The 响应 is an audio file (MP3). Save it with --输出 in curl, or open the 获取 URL directly in a browser to play.

Parameters Parameter Description Default text Text to synthesize (required) — voice Voice name (Chinese label or English ID) 提供者 default emotion Emotion or speaking style (Azure only) — rate Speech rate adjustment, percentage (Azure/Edge) 0 pitch Pitch adjustment, percentage (Azure/Edge) 0 提供者 TTS backend: azure, volcengine, or edge azure 提供者s at a Glance azure — Default 提供者. Richest feature 设置: many voices, emotion/style control, rate and pitch tuning. volcengine — Alternative Chinese-language voices including specialty voices (e.g. film narration). edge — Microsoft Edge TTS voices including regional dialect options (e.g. Liaoning dialect). Supports rate and pitch. Choosing a Voice and Emotion

Before generating speech, 查询 the avAIlable voices for a 提供者:

curl https://tts.102465.xyz/API/voices?提供者=azure

This returns a JSON 列出 of voices and, for Azure, their supported emotions/styles. Use the voice's Chinese label (e.g. 晓晓) or English ID in the voice parameter, and pick an emotion from the voice's supported 列出 for the emotion parameter.

工作流 Clarify requirements — Ask the user what text they want spoken, in what language, and whether they have a preference for voice gender, style, or emotion. Pick a 提供者 — Default to azure unless the user needs a specialty voice from another 提供者. If unsure, 查询 /API/提供者s and /API/voices?提供者= to browse options. 生成 the audio — Call /API/tts with the chosen parameters. For programmatic use, POST with JSON body; for a quick 分享able link, construct a 获取 URL. Deliver the 结果 — If using curl/POST, save the MP3 to /mnt/user-data/输出s/ and present it to the user. If constructing a 获取 link, provide the URL so the user can play it in-browser. Example 获取 URLs

Azure with emotion:

https://tts.102465.xyz/API/tts?text=今天天气真不错&提供者=azure&voice=晓晓&emotion=温柔

Volcengine specialty voice:

https://tts.102465.xyz/API/tts?text=在遥远的东方，有一个古老的传说&提供者=volcengine&voice=影视男解说%20中英混

Edge dialect voice:

https://tts.102465.xyz/API/tts?text=今天咱们唠唠嗑&提供者=edge&voice=晓北%20辽宁%20女

Tips URL-encode Chinese characters and spaces when constructing 获取 URLs (e.g. %20 for space). The emotion parameter only works with Azure. Other 提供者s ignore it. rate and pitch are percentages — positive values speed up / rAIse pitch, negative values slow down / lower pitch. They work with Azure and Edge. If the user doesn't specify a voice, omit the voice parameter to use the 提供者's default. When generating audio files programmatically, save as .mp3.

License

运行时依赖

安装命令

技能文档

相关技能推荐