首页龙虾技能列表 › Gemini Video Analyzer — Gemini工具

🎬 Gemini Video Analyzer — Gemini工具

v1.0.0

[AI辅助] Native video analysis using Google Gemini API. Upload and analyze video files — describe scenes, extract text/UI, answer questions about content, transcribe...

0· 304·0 当前·0 累计
by @aiwithabidi·MIT-0
下载技能包
License
MIT-0
最后更新
2026/4/12
安全扫描
VirusTotal
可疑
查看报告
OpenClaw
安全
high confidence
The skill's code, instructions, and required credential (GOOGLE_AI_API_KEY) align with its stated purpose of uploading videos to Google Gemini for analysis; issues are implementation quality and a small unnecessary binary requirement rather than malicious behavior.
评估建议
This skill appears to do what it says: it uploads videos to Google's generativelanguage Files API and asks Gemini to analyze them. Before installing or using it, consider the following: (1) Privacy: videos are uploaded to Google and may be retained up to ~48 hours — do not upload sensitive or regulated content unless your policy allows it. (2) API key scope: use a minimally privileged API key, monitor/rotate it, and be aware requests may incur costs; test with small files first. (3) Implementati...
详细分析 ▾
用途与能力
Name/description say: upload video and analyze via Google Gemini. The included scripts call generativelanguage.googleapis.com, use the GOOGLE_AI_API_KEY, and perform upload/analysis/cleanup — these are coherent. One minor mismatch: the metadata and requires list python3 and curl, but the shipped scripts only call python (urllib). curl is not used anywhere in SKILL.md or the code, so declaring it as required is unnecessary.
指令范围
SKILL.md and the scripts instruct only to read the user-supplied video file and the declared GOOGLE_AI_API_KEY, upload to Google's Files API, poll for processing, and request analysis. There are no instructions to read unrelated host files, secrets, or to send data to third-party endpoints outside the stated Google API domain. The skill will transmit whole video files to Google's servers (expected for this purpose) and may leave them for up to 48 hours per the docs.
安装机制
This is instruction-only with bundled Python scripts and no install spec — nothing is downloaded from arbitrary URLs and no packages are installed automatically. Risk from install mechanisms is low.
凭证需求
Only the GOOGLE_AI_API_KEY is required (declared as the primary credential), which is appropriate for accessing Google Generative Language Files API. No unrelated credentials or secrets are requested.
持久化与权限
The skill does not request always:true, does not modify other skills or system-wide configs, and is user-invocable. It runs only when invoked and uses the provided API key for network calls — typical and proportionate.
安全有层次,运行前请审查代码。

License

MIT-0

可自由使用、修改和再分发,无需署名。

运行时依赖

无特殊依赖

版本

latestv1.0.02026/3/4

Initial release of gemini-video-analyzer. - Native video analysis using Google Gemini API with support for full scene description, text/UI extraction, object/action identification, and question answering. - Supports multiple video formats (MP4, AVI, MOV, etc.) up to 2GB per file. - Processes videos at 1 FPS with motion, audio, and visual understanding—no manual frame extraction needed. - Includes command-line scripts for analysis, file management, and prompt-based queries. - Requires a Google AI API key; configurable via environment variable. - Suitable for summarizing, extracting information, comparing videos, and analyzing tutorials or walkthroughs.

● 可疑

安装命令 点击复制

官方npx clawhub@latest install gemini-video-analyzer
镜像加速npx clawhub@latest install gemini-video-analyzer --registry https://cn.clawhub-mirror.com

技能文档

Analyze videos natively using Google Gemini's multimodal API. No frame extraction needed — Gemini processes video at 1 FPS with full motion, audio, and visual understanding.

Quick 开始

# Analyze a video with default prompt (full description)
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/analyze.py /path/to/video.mp4

# Ask a specific question GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/analyze.py /path/to/video.mp4 "What text is visible on screen?"

# Manage uploaded files GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/manage_files.py list GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/manage_files.py cleanup

Supported Formats

MP4, AVI, MOV, MKV, WebM, FLV, MPEG, MPG, WMV, 3GP — up to 2GB per file.

如何 Works

  • Video uploads 到 Google's Files API (temporary, auto-deletes 之后 48h)
  • Gemini processes 在 1 frame/sec — understands motion, transitions, audio context
  • 模型 generates 响应 based 在...上 prompt
  • Way better 比 frame extraction 对于 understanding temporal content

使用 Cases

TaskExample Prompt
General description(default — no prompt needed)
UI/text extraction"What text and UI elements are visible?"
Tutorial summary"Summarize the steps shown in this tutorial"
Bug report from video"Describe what went wrong in this screen recording"
Meeting notes"Summarize the key points discussed"
Content comparisonUpload 2 videos, ask for differences

Configuration

Set GOOGLE_AI_API_KEY in your environment or .env file. Get a free key at aistudio.google.com.

Default model: gemini-2.5-flash (fast, cheap, excellent vision). Override with --model gemini-2.5-pro for complex analysis.

API Reference

See references/gemini-files-api.md for file upload limits, processing details, and advanced options.

Credits

Built by M. Abidi · LinkedIn · YouTube · GitHub · Book a Call

数据来源:ClawHub ↗ · 中文优化:龙虾技能库
OpenClaw 技能定制 / 插件定制 / 私有工作流定制

免费技能或插件可能存在安全风险,如需更匹配、更安全的方案,建议联系付费定制

了解定制服务