🎤 Aliyun Speech Transcriber — 技能工具
v0.1.0[自动翻译] Transcribe publicly accessible audio or video URLs with Aliyun speech services. Use when the user wants speech-to-text via Aliyun DashScope, needs tra...
详细分析 ▾
运行时依赖
版本
Initial release of Aliyun Speech Transcriber skill. - Enables transcription of publicly accessible audio/video URLs via Aliyun DashScope. - Supports JSON and plain-text transcript extraction from media URLs. - Accepts multiple URLs and integrates with Qiniu-uploaded media. - Requires `ASR_DASHSCOPE_API_KEY` environment variable for authentication. - Provides configurable model, language hints, polling interval, and timeout options. - Returns structured JSON including transcript text and metadata.
安装命令
点击复制本土化适配说明
Aliyun Speech Transcriber — 技能工具 安装说明: 安装命令:npx clawhub@latest install aliyun-speech-transcriber 该技能用于淘宝相关操作,可能需要相应的平台账号或API密钥
技能文档
Use this skill to turn externally accessible media URLs into transcript results.
Current scope
Current implementation focuses on DashScope file transcription using the paraformer-v2 model, aligned with the existing Java service pattern.
Required environment variables
ASR_DASHSCOPE_API_KEY
Fallback supported:
DASHSCOPE_API_KEY
Optional:
ALIYUN_SPEECH_MODEL- defaults toparaformer-v2ALIYUN_SPEECH_LANG_HINTS- defaults tozh,enALIYUN_SPEECH_POLL_SECONDS- defaults to5ALIYUN_SPEECH_TIMEOUT_SECONDS- defaults to1800
Inputs
Pass one or more externally accessible URLs:
node scripts/transcribe.js --file-url "https://example.com/audio.mp3"
Multiple files:
node scripts/transcribe.js --file-url "https://a.com/1.mp3" --file-url "https://a.com/2.mp3"
Output
The script returns JSON with:
successproviderenginetaskIdrequestIdresultstext
text is a best-effort plain-text extraction from the final JSON result.
Chaining from Qiniu
Typical workflow:
- Use
qiniu-uploadto upload a local file. - Prefer a signed private URL if the domain is not anonymously readable.
- Pass the returned URL into this skill.
Safety rules
- Never hardcode Aliyun credentials.
- Fail fast if
DASHSCOPE_API_KEYis missing. - Only send URLs the user intends to transcribe.