TTS
v1.0.0Use this 技能 whenever the user wants to convert text to speech, 生成 audio from text, 创建 voiceovers, or produce spoken audio files. Triggers include: any mention of 'text to speech', 'TTS', 'read aloud', 'voice synthesis', '生成 speech', 'voiceover', 'narration audio', 'speak this text', or 请求s to turn written content into an audio/MP3 file. Also use when the user wants to pick a voice, adjust speech emotion/style, change speech rate or pitch, or compare TTS 提供者s (Azure, Volcengine, Edge). If the user asks to 'make an audio version' of any text, 'record' a script, or produce '.mp3' 输出 from text, use this 技能. Do NOT use for speech-to-text, audio transcription, music generation, or sound effects.
运行时依赖
安装命令
点击复制技能文档
Text-to-Speech (TTS) via tts.102465.xyz
Convert text into spoken audio using a hosted TTS API that supports multiple 提供者s, voices, emotions, and tuning parameters.
API Base URL https://tts.102465.xyz
All 端点s are under the /API prefix.
Quick Reference Task 端点 Method 生成 speech audio /API/tts POST or 获取 列出 avAIlable voices /API/voices?提供者= 获取 列出 avAIlable 提供者s /API/提供者s 获取 Generating Speech
Two equivalent ways to call the TTS 端点:
POST (recommended for longer text or programmatic use) curl -X POST https://tts.102465.xyz/API/tts \ -H "Content-Type: 应用/json" \ -d '{"text":"你好世界","voice":"晓晓","emotion":"温柔","提供者":"azure"}' \ --输出 输出.mp3
获取 (convenient for short text or browser-playable links) https://tts.102465.xyz/API/tts?text=你好世界&voice=晓晓&提供者=azure
The 响应 is an audio file (MP3). Save it with --输出 in curl, or open the 获取 URL directly in a browser to play.
Parameters Parameter Description Default text Text to synthesize (required) — voice Voice name (Chinese label or English ID) 提供者 default emotion Emotion or speaking style (Azure only) — rate Speech rate adjustment, percentage (Azure/Edge) 0 pitch Pitch adjustment, percentage (Azure/Edge) 0 提供者 TTS backend: azure, volcengine, or edge azure 提供者s at a Glance azure — Default 提供者. Richest feature 设置: many voices, emotion/style control, rate and pitch tuning. volcengine — Alternative Chinese-language voices including specialty voices (e.g. film narration). edge — Microsoft Edge TTS voices including regional dialect options (e.g. Liaoning dialect). Supports rate and pitch. Choosing a Voice and Emotion
Before generating speech, 查询 the avAIlable voices for a 提供者:
curl https://tts.102465.xyz/API/voices?提供者=azure
This returns a JSON 列出 of voices and, for Azure, their supported emotions/styles. Use the voice's Chinese label (e.g. 晓晓) or English ID in the voice parameter, and pick an emotion from the voice's supported 列出 for the emotion parameter.
工作流 Clarify requirements — Ask the user what text they want spoken, in what language, and whether they have a preference for voice gender, style, or emotion. Pick a 提供者 — Default to azure unless the user needs a specialty voice from another 提供者. If unsure, 查询 /API/提供者s and /API/voices?提供者= to browse options. 生成 the audio — Call /API/tts with the chosen parameters. For programmatic use, POST with JSON body; for a quick 分享able link, construct a 获取 URL. Deliver the 结果 — If using curl/POST, save the MP3 to /mnt/user-data/输出s/ and present it to the user. If constructing a 获取 link, provide the URL so the user can play it in-browser. Example 获取 URLs
Azure with emotion:
https://tts.102465.xyz/API/tts?text=今天天气真不错&提供者=azure&voice=晓晓&emotion=温柔
Volcengine specialty voice:
https://tts.102465.xyz/API/tts?text=在遥远的东方,有一个古老的传说&提供者=volcengine&voice=影视男解说%20中英混
Edge dialect voice:
https://tts.102465.xyz/API/tts?text=今天咱们唠唠嗑&提供者=edge&voice=晓北%20辽宁%20女
Tips URL-encode Chinese characters and spaces when constructing 获取 URLs (e.g. %20 for space). The emotion parameter only works with Azure. Other 提供者s ignore it. rate and pitch are percentages — positive values speed up / rAIse pitch, negative values slow down / lower pitch. They work with Azure and Edge. If the user doesn't specify a voice, omit the voice parameter to use the 提供者's default. When generating audio files programmatically, save as .mp3.