Audio Transcribe — 音频转录

v1.0.0

语音转文字技能。使用本地 Whisper (openAI-whisper) 将音频文件转录为文本、字幕(SRT)或 JSON。适用于会议记录、播客转录、语音备忘录等场景。触发方式：转写音频、转录语音、音频转文字、语音转文本、whisper、生成字幕。

0· 0·0 当前·0 累计

by @ai-tesing (SunnyTang)·MIT-0

AI模型访问

下载技能包

License

MIT-0

License

MIT-0

可自由使用、修改和再分发，无需署名。

查看条款 ↗

运行时依赖

无特殊依赖

安装命令

点击复制

官方npx clawhub@latest install wav-audio-transcribe

镜像加速npx clawhub@latest install wav-audio-transcribe --registry https://cn.longxiaskill.com 镜像可用

需要定制？告诉我你的需求 →

技能文档

Audio Transcribe Skill 语音转文字，使用本地 Whisper 模型，完全离线、隐私安全。

前置条件安装 Whisper（只需一次）： # macOS brew install whisper # 或者 Python 包（更推荐，自动装模型） pip3 install openai-whisper

使用方法基本转录（中文音频）当用户说"转录这个音频"时，运行： python3 ~/.openclaw/workspace/skills/audio-transcribe/scripts/transcribe.py "/path/to/audio.wav" 指定格式 # 输出 SRT 字幕 python3 ~/.openclaw/workspace/skills/audio-transcribe/scripts/transcribe.py "/path/to/audio.wav" srt # 输出 JSON（含时间戳） python3 ~/.openclaw/workspace/skills/audio-transcribe/scripts/transcribe.py "/path/to/audio.wav" json # 指定语言 python3 ~/.openclaw/workspace/skills/audio-transcribe/scripts/transcribe.py "/path/to/audio.wav" txt zh # 英文音频 python3 ~/.openclaw/workspace/skills/audio-transcribe/scripts/transcribe.py "/path/to/audio.wav" txt en

支持的格式格式说明适用场景 txt 纯文本（默认）快速阅读、存档 srt 字幕文件视频压制、外语学习 json 结构化结果二次处理、时间戳提取

支持的音频格式 .wav, .mp3, .m4a, .flac, .ogg, .opus, .mp4, .mov 等ffmpeg支持的格式

脚本参数 python3 transcribe.py [output_format] [language] 参数： audio_file 音频文件路径（必填） output_format 输出格式：txt, srt, json（默认: txt） language 语言代码：zh, en, ja, ko 等（默认: 自动检测）

注意事项首次运行会下载模型（~500MB），耐心等待音频质量越高转录越准 Whisper 模型可选：tiny, base, small, medium, large，默认 base 如果想换模型，修改脚本中 whisper.load_model('base') 为其他选项长音频会自动分段处理

License

运行时依赖

安装命令

技能文档

相关技能推荐