🎬 Gemini Video Analyzer — Gemini工具
v1.0.0[AI辅助] Native video analysis using Google Gemini API. Upload and analyze video files — describe scenes, extract text/UI, answer questions about content, transcribe...
详细分析 ▾
运行时依赖
版本
Initial release of Gemini Video Analyzer. - Analyze video files natively using Google Gemini API with 1 FPS multimodal processing. - Supports video scene description, content Q&A, screen text/UI extraction, speech transcription, and object/action identification. - Accepts multiple popular video formats (up to 2GB each). - CLI tools provided for video analysis, content-specific prompts, and file management. - Requires only Python 3, curl, and a Google AI API key for setup.
安装命令 点击复制
技能文档
Analyze videos natively using Google Gemini's multimodal API. No frame extraction needed — Gemini processes video at 1 FPS with full motion, audio, and visual understanding.
Quick 开始
# Analyze a video with default prompt (full description)
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/analyze.py /path/to/video.mp4# Ask a specific question
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/analyze.py /path/to/video.mp4 "What text is visible on screen?"
# Manage uploaded files
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/manage_files.py list
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/manage_files.py cleanup
Supported Formats
MP4, AVI, MOV, MKV, WebM, FLV, MPEG, MPG, WMV, 3GP — up to 2GB per file.
如何 Works
- Video uploads 到 Google's Files API (temporary, auto-deletes 之后 48h)
- Gemini processes 在 1 frame/sec — understands motion, transitions, audio context
- 模型 generates 响应 based 在...上 prompt
- Way better 比 frame extraction 对于 understanding temporal content
使用 Cases
| Task | Example Prompt |
|---|---|
| General description | (default — no prompt needed) |
| UI/text extraction | "What text and UI elements are visible?" |
| Tutorial summary | "Summarize the steps shown in this tutorial" |
| Bug report from video | "Describe what went wrong in this screen recording" |
| Meeting notes | "Summarize the key points discussed" |
| Content comparison | Upload 2 videos, ask for differences |
Configuration
Set GOOGLE_AI_API_KEY in your environment or .env file. Get a free key at aistudio.google.com.
Default model: gemini-2.5-flash (fast, cheap, excellent vision). Override with --model gemini-2.5-pro for complex analysis.
API Reference
See references/gemini-files-api.md for file upload limits, processing details, and advanced options.
免费技能或插件可能存在安全风险,如需更匹配、更安全的方案,建议联系付费定制