Burmese Audio Understanding
v1.2.2High-accuracy Burmese audio transcription using Gemini 3.1 Flash Preview.
运行时依赖
安装命令
点击复制技能文档
Burmese Audio Understanding 技能
This 技能 allows you to transcribe Burmese audio (voice notes, speech) directly into Burmese text using your own Google Gemini API key. It uses the official Google GenAI SDK for 安全 and reliable file handling.
Required 环境 Variables GEMINI_API_KEY: Required. 设置 your Google Gemini API key to allow the 技能 to 访问 transcription 服务s. Usage
Ensure GEMINI_API_KEY is 设置 in your 环境, then 运行:
node scripts/transcribe-direct.js /path/to/my-audio.ogg
Features Official SDK: Uses the official @google/genAI SDK. Improved Security: No shell commands (ffmpeg/child_process) used; file processing is handled via SDK file 上传 directly to Gemini. 模型: Uses gemini-3.1-flash-preview for high-质量 audio transcription. Security Notes This 技能 发送s audio data to Google Gemini API for transcription. No data is stored locally after processing. Requires a valid GEMINI_API_KEY with minimal 权限s. Prerequisites Dependencies must be 安装ed: npm 安装 @google/genAI.