首页龙虾技能列表 › Gemini Video Analyzer — Gemini工具

🎬 Gemini Video Analyzer — Gemini工具

v1.0.0

[AI辅助] Native video analysis using Google Gemini API. Upload and analyze video files — describe scenes, extract text/UI, answer questions about content, transcribe...

0· 645·1 当前·1 累计
by @aiwithabidi·MIT-0
下载技能包
License
MIT-0
最后更新
2026/4/12
安全扫描
VirusTotal
可疑
查看报告
OpenClaw
安全
high confidence
The skill is internally coherent: it asks only for a Google AI API key and runs Python scripts that upload videos to Google's Generative Language / Files API and request Gemini analysis as described.
评估建议
This skill appears to do what it says: it uploads videos to Google's Generative Language/Files API and asks Gemini to analyze them. Before installing or running: (1) Be aware that videos will be uploaded off your machine to Google — avoid uploading sensitive footage unless you accept that. (2) Use a restricted API key (limit to the specific project/APIs, set quotas, and rotate or revoke when done) to reduce blast radius if the key is leaked. (3) The declared requirement lists curl though the shi...
详细分析 ▾
用途与能力
Name, description, and included scripts consistently implement video upload + Gemini model analysis against generativelanguage.googleapis.com. The single required credential (GOOGLE_AI_API_KEY) is the expected credential for this purpose. Minor mismatch: the declared required binaries include curl although the provided scripts use only python3/urllib; this is a small inconsistency but not evidence of malicious intent.
指令范围
Runtime instructions and scripts explicitly upload user video files to Google Files API and then call the Gemini model — this is consistent with the stated purpose. Important privacy note: videos (and any text/UI/audio they contain) are transmitted to Google and may be processed server-side and retained per the API (SKILL.md claims ~48h retention). The instructions do not read unrelated files or other environment variables.
安装机制
This is instruction-only plus two Python scripts with no install spec. Nothing is downloaded from third-party URLs during install; risk from installation is low. The scripts perform network calls at runtime (to Google endpoints) which is expected for this skill.
凭证需求
Only GOOGLE_AI_API_KEY is requested and used, which is proportionate to contacting Google's Files/Generative Language APIs. Users should ensure the API key is scoped/restricted (project, API quotas, billing) because it could be used to bill requests or access other Google APIs depending on key permissions. The skill does not request unrelated secrets or config paths.
持久化与权限
The skill is not force-included (always: false) and does not request persistent system-wide privileges or modify other skills. It runs as-invoked and uses only its own scripts and the provided API key.
安全有层次,运行前请审查代码。

License

MIT-0

可自由使用、修改和再分发,无需署名。

运行时依赖

无特殊依赖

版本

latestv1.0.02026/2/16

Initial release of Gemini Video Analyzer. - Analyze video files natively using Google Gemini API with 1 FPS multimodal processing. - Supports video scene description, content Q&A, screen text/UI extraction, speech transcription, and object/action identification. - Accepts multiple popular video formats (up to 2GB each). - CLI tools provided for video analysis, content-specific prompts, and file management. - Requires only Python 3, curl, and a Google AI API key for setup.

● 可疑

安装命令 点击复制

官方npx clawhub@latest install a6-gemini-video-analyzer
镜像加速npx clawhub@latest install a6-gemini-video-analyzer --registry https://cn.clawhub-mirror.com

技能文档

Analyze videos natively using Google Gemini's multimodal API. No frame extraction needed — Gemini processes video at 1 FPS with full motion, audio, and visual understanding.

Quick 开始

# Analyze a video with default prompt (full description)
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/analyze.py /path/to/video.mp4

# Ask a specific question GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/analyze.py /path/to/video.mp4 "What text is visible on screen?"

# Manage uploaded files GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/manage_files.py list GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/manage_files.py cleanup

Supported Formats

MP4, AVI, MOV, MKV, WebM, FLV, MPEG, MPG, WMV, 3GP — up to 2GB per file.

如何 Works

  • Video uploads 到 Google's Files API (temporary, auto-deletes 之后 48h)
  • Gemini processes 在 1 frame/sec — understands motion, transitions, audio context
  • 模型 generates 响应 based 在...上 prompt
  • Way better 比 frame extraction 对于 understanding temporal content

使用 Cases

TaskExample Prompt
General description(default — no prompt needed)
UI/text extraction"What text and UI elements are visible?"
Tutorial summary"Summarize the steps shown in this tutorial"
Bug report from video"Describe what went wrong in this screen recording"
Meeting notes"Summarize the key points discussed"
Content comparisonUpload 2 videos, ask for differences

Configuration

Set GOOGLE_AI_API_KEY in your environment or .env file. Get a free key at aistudio.google.com.

Default model: gemini-2.5-flash (fast, cheap, excellent vision). Override with --model gemini-2.5-pro for complex analysis.

API Reference

See references/gemini-files-api.md for file upload limits, processing details, and advanced options.

数据来源:ClawHub ↗ · 中文优化:龙虾技能库
OpenClaw 技能定制 / 插件定制 / 私有工作流定制

免费技能或插件可能存在安全风险,如需更匹配、更安全的方案,建议联系付费定制

了解定制服务