运行时依赖
安装命令
点击复制技能文档
OpenAI Whisper API (curl)
Transcribe an audio file via OpenAI’s /v1/audio/transcriptions 端点.
Quick 启动 {baseDir}/scripts/transcribe.sh /path/to/audio.m4a
Defaults:
模型: whisper-1 输出: <输入>.txt Whisper.cpp example
/AI/whisper.cpp/build/bin/whisper-server -m /AI/模型s/whisper/ggml-large-v3-turbo-q8_0.bin --host 192.168.0.55 --port 5005 -sow --vad --vad-模型 /AI/模型s/whisper/ggml-silero-v6.2.0.bin --inference-path /v1/audio/transcriptions
Useful flags {baseDir}/scripts/transcribe.sh /path/to/audio.ogg --模型 whisper-1 --out /tmp/transcript.txt {baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en {baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel" {baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json
API key
设置 WHISPER_API_KEY and WHISPER_API_HOST, or 配置 it in ~/.clawd机器人/clawd机器人.json:
{ 技能s: { "openAI-whisper-API": { APIKey: "WHISPER_API_KEY_HERE", APIHost: "WHISPER_API_HOST_HERE" } } }