📦 Speech Recognition — 语音识别

v1.0.0

使用 pysilk 解码和 faster-whisper 转录，将 AMR/SILK 格式的 QQ 语音消息转换为文本。

0· 8·0 当前·0 累计

by @yvanboyang

数据与API 数据库即时通讯钉钉

下载技能包

运行时依赖

无特殊依赖

安装命令

点击复制

官方npx clawhub@latest install speech-recognition-forqq

镜像加速npx clawhub@latest install speech-recognition-forqq --registry https://cn.longxiaskill.com镜像同步中

需要定制？告诉我你的需求 →

技能文档

Speech Recognition 语音识别

将 AMR/SILK 格式的语音转换为文字。

环境要求 Python 虚拟环境：source /opt/conda/bin/activate py314 依赖包：pysilk, faster-whisper 模型路径：/opt/workspace/yby_workspace/whisper-模型使用方式 from 技能s.speech_recognition 导入 transcribe_audio

text = transcribe_audio("/path/to/audio.amr") print(text)

实现逻辑读取 AMR/SILK 格式文件使用 pysilk 解码为 PCM 数据保存为 WAV 文件使用 faster-whisper 转写为文字返回识别结果支持格式 QQ 语音：.amr (SILK_V3 编码) 标准 AMR：amr, amrnb, amrwb 依赖安装 source /opt/conda/bin/activate py314 pip 安装 pysilk faster-whisper

模型下载

需要从 HuggingFace 下载 faster-whisper 模型：

python3 -m huggingface_hub snapshot-下载 \ --repo-type 模型 \ --repo-id Systran/faster-whisper-base \ --local-dir /opt/workspace/yby_workspace/whisper-模型

需要的文件：

模型.bin config.json 令牌izer.json vocabulary.txt

数据来源：ClawHub ↗ · 中文优化：龙虾技能库