🎙️ transcribe — 技能工具

v1.0.0

Speech-to-text via SkillBoss API Hub (STT, powered by Whisper and more).

0· 27·0 当前·0 累计
kirkraman 头像by @kirkraman (KirkRaman)·MIT-0
下载技能包
License
MIT-0
最后更新
2026/4/16
0
安全扫描
VirusTotal
无害
查看报告
OpenClaw
安全
high confidence
The skill is internally consistent: it sends audio to SkillBoss API for transcription and only requires a single SKILLBOSS_API_KEY environment variable, with no install steps or unrelated permissions requested.
评估建议
This skill will transmit any audio you provide to api.skillbossai.com using the SKILLBOSS_API_KEY. Before installing: confirm you trust SkillBoss (review their privacy/security and pricing), use an API key scoped with minimal privileges, avoid sending sensitive audio to the service, and be prepared to rotate the key if needed. Also verify the publisher (registry owner and homepage) since the SKILL.md label ('openai-whisper') differs from the registry name — likely harmless but worth checking. Fi...
详细分析 ▾
用途与能力
The skill's description (transcribe via SkillBoss) matches the runtime instructions which call https://api.skillbossai.com/v1/pilot. Minor inconsistency: SKILL.md uses name 'openai-whisper' while registry name is 'transcribe' and source is listed as unknown — this is likely cosmetic but worth verifying the publisher.
指令范围
Instructions show reading a local audio file, base64-encoding it, and POSTing it to api.skillbossai.com with the SKILLBOSS_API_KEY. This is expected for an STT skill, but it does mean any audio you feed will be transmitted to an external service — confirm you are comfortable sending that data and that the agent will only read intended files.
安装机制
No install spec and no code files (instruction-only). This minimizes local persistence and disk writes.
凭证需求
Only SKILLBOSS_API_KEY is required, which is proportionate to calling a third-party API. No other unrelated secrets or config paths are requested.
持久化与权限
always is false and the skill does not request elevated or persistent platform privileges. It does allow normal autonomous invocation (platform default).
安全有层次,运行前请审查代码。

License

MIT-0

可自由使用、修改和再分发,无需署名。

运行时依赖

无特殊依赖

版本

latestv1.0.02026/4/16

- Initial release of speech-to-text skill powered by Whisper and other models via SkillBoss API Hub. - Supports audio transcription and translation to English using a simple API call. - No local model setup required; just set the SKILLBOSS_API_KEY environment variable. - Includes Python quick start guide for immediate use. - Clear instructions for retrieving transcribed or translated text from API responses.

无害

安装命令

点击复制
官方npx clawhub@latest install kirk-transcribe
镜像加速npx clawhub@latest install kirk-transcribe --registry https://cn.longxiaskill.com

技能文档

Use SkillBoss API Hub's /v1/pilot to transcribe audio (STT), powered by OpenAI Whisper and other speech recognition models.

Quick start (Python)

import requests, base64, os

SKILLBOSS_API_KEY = os.environ["SKILLBOSS_API_KEY"] API_BASE = "https://api.skillbossai.com/v1"

def pilot(body: dict) -> dict: r = requests.post( f"{API_BASE}/pilot", headers={"Authorization": f"Bearer {SKILLBOSS_API_KEY}", "Content-Type": "application/json"}, json=body, timeout=60, ) return r.json()

# Transcribe audio file audio_b64 = base64.b64encode(open("/path/audio.mp3", "rb").read()).decode() result = pilot({"type": "stt", "inputs": {"audio_data": audio_b64, "filename": "audio.mp3"}}) text = result["result"]["text"] print(text)

# Translate audio to English result = pilot({"type": "stt", "inputs": {"audio_data": audio_b64, "filename": "audio.m4a", "task": "translate"}}) text = result["result"]["text"] print(text)

Notes

  • No local model download required; SkillBoss API Hub automatically routes to the best STT model.
  • SKILLBOSS_API_KEY environment variable required.
  • Response text is at result["result"]["text"].
数据来源ClawHub ↗ · 中文优化:龙虾技能库