Gemini Video Analyzer — Gemini工具

Name: Gemini Video Analyzer — Gemini工具
Author: aiwithabidi

aiwithabidi

🎬 Gemini Video Analyzer — Gemini工具

v1.0.0

[AI辅助] Native video analysis using Google Gemini API. Upload and analyze video files — describe scenes, extract text/UI, answer questions about content, transcribe...

0· 304·0 当前·0 累计

by @aiwithabidi·MIT-0

API工具数据分析网络工具文件处理开发工具

下载技能包

License

MIT-0

最后更新

2026/4/12

安全扫描

VirusTotal

可疑

查看报告

OpenClaw

安全

high confidence

The skill's code, instructions, and required credential (GOOGLE_AI_API_KEY) align with its stated purpose of uploading videos to Google Gemini for analysis; issues are implementation quality and a small unnecessary binary requirement rather than malicious behavior.

评估建议

This skill appears to do what it says: it uploads videos to Google's generativelanguage Files API and asks Gemini to analyze them. Before installing or using it, consider the following: (1) Privacy: videos are uploaded to Google and may be retained up to ~48 hours — do not upload sensitive or regulated content unless your policy allows it. (2) API key scope: use a minimally privileged API key, monitor/rotate it, and be aware requests may incur costs; test with small files first. (3) Implementati...

详细分析 ▾

✓ 用途与能力

Name/description say: upload video and analyze via Google Gemini. The included scripts call generativelanguage.googleapis.com, use the GOOGLE_AI_API_KEY, and perform upload/analysis/cleanup — these are coherent. One minor mismatch: the metadata and requires list python3 and curl, but the shipped scripts only call python (urllib). curl is not used anywhere in SKILL.md or the code, so declaring it as required is unnecessary.

✓ 指令范围

SKILL.md and the scripts instruct only to read the user-supplied video file and the declared GOOGLE_AI_API_KEY, upload to Google's Files API, poll for processing, and request analysis. There are no instructions to read unrelated host files, secrets, or to send data to third-party endpoints outside the stated Google API domain. The skill will transmit whole video files to Google's servers (expected for this purpose) and may leave them for up to 48 hours per the docs.

✓ 安装机制

This is instruction-only with bundled Python scripts and no install spec — nothing is downloaded from arbitrary URLs and no packages are installed automatically. Risk from install mechanisms is low.

✓ 凭证需求

Only the GOOGLE_AI_API_KEY is required (declared as the primary credential), which is appropriate for accessing Google Generative Language Files API. No unrelated credentials or secrets are requested.

✓ 持久化与权限

The skill does not request always:true, does not modify other skills or system-wide configs, and is user-invocable. It runs only when invoked and uses the provided API key for network calls — typical and proportionate.

安全有层次，运行前请审查代码。

License

MIT-0

可自由使用、修改和再分发，无需署名。

查看条款 ↗

运行时依赖

无特殊依赖

版本

latestv1.0.02026/3/4

Initial release of gemini-video-analyzer. - Native video analysis using Google Gemini API with support for full scene description, text/UI extraction, object/action identification, and question answering. - Supports multiple video formats (MP4, AVI, MOV, etc.) up to 2GB per file. - Processes videos at 1 FPS with motion, audio, and visual understanding—no manual frame extraction needed. - Includes command-line scripts for analysis, file management, and prompt-based queries. - Requires a Google AI API key; configurable via environment variable. - Suitable for summarizing, extracting information, comparing videos, and analyzing tutorials or walkthroughs.

● 可疑

安装命令点击复制

官方npx clawhub@latest install gemini-video-analyzer

镜像加速npx clawhub@latest install gemini-video-analyzer --registry https://cn.clawhub-mirror.com

技能文档

Analyze videos natively using Google Gemini's multimodal API. No frame extraction needed — Gemini processes video at 1 FPS with full motion, audio, and visual understanding.

Quick 开始

# Analyze a video with default prompt (full description)
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/analyze.py /path/to/video.mp4
# Ask a specific question
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/analyze.py /path/to/video.mp4 "What text is visible on screen?"# Manage uploaded files
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/manage_files.py list
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/manage_files.py cleanup

Supported Formats

MP4, AVI, MOV, MKV, WebM, FLV, MPEG, MPG, WMV, 3GP — up to 2GB per file.

如何 Works

Video uploads 到 Google's Files API (temporary, auto-deletes 之后 48h)
Gemini processes 在 1 frame/sec — understands motion, transitions, audio context
模型 generates 响应 based 在...上 prompt
Way better 比 frame extraction 对于 understanding temporal content

使用 Cases

Task	Example Prompt
General description	(default — no prompt needed)
UI/text extraction	`"What text and UI elements are visible?"`
Tutorial summary	`"Summarize the steps shown in this tutorial"`
Bug report from video	`"Describe what went wrong in this screen recording"`
Meeting notes	`"Summarize the key points discussed"`
Content comparison	Upload 2 videos, ask for differences

Configuration

Set GOOGLE_AI_API_KEY in your environment or .env file. Get a free key at aistudio.google.com.

Default model: gemini-2.5-flash (fast, cheap, excellent vision). Override with --model gemini-2.5-pro for complex analysis.

API Reference

See references/gemini-files-api.md for file upload limits, processing details, and advanced options.

Credits

Built by M. Abidi · LinkedIn · YouTube · GitHub · Book a Call

Analyze videos natively using Google Gemini's multimodal API. No frame extraction needed — Gemini processes video at 1 FPS with full motion, audio, and visual understanding.

Quick Start

# Analyze a video with default prompt (full description)
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/analyze.py /path/to/video.mp4
# Ask a specific question
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/analyze.py /path/to/video.mp4 "What text is visible on screen?"# Manage uploaded files
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/manage_files.py list
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/manage_files.py cleanup

Supported Formats

MP4, AVI, MOV, MKV, WebM, FLV, MPEG, MPG, WMV, 3GP — up to 2GB per file.

How It Works

Video uploads to Google's Files API (temporary, auto-deletes after 48h)
Gemini processes at 1 frame/sec — understands motion, transitions, audio context
Model generates response based on your prompt
Way better than frame extraction for understanding temporal content

Use Cases

Task	Example Prompt
General description	(default — no prompt needed)
UI/text extraction	`"What text and UI elements are visible?"`
Tutorial summary	`"Summarize the steps shown in this tutorial"`
Bug report from video	`"Describe what went wrong in this screen recording"`
Meeting notes	`"Summarize the key points discussed"`
Content comparison	Upload 2 videos, ask for differences

Configuration

Set GOOGLE_AI_API_KEY in your environment or .env file. Get a free key at aistudio.google.com.

Default model: gemini-2.5-flash (fast, cheap, excellent vision). Override with --model gemini-2.5-pro for complex analysis.

API Reference

See references/gemini-files-api.md for file upload limits, processing details, and advanced options.

Credits

Built by M. Abidi · LinkedIn · YouTube · GitHub · Book a Call

数据来源：ClawHub ↗ · 中文优化：龙虾技能库

OpenClaw 技能定制 / 插件定制 / 私有工作流定制

免费技能或插件可能存在安全风险，如需更匹配、更安全的方案，建议联系付费定制

了解定制服务

License

运行时依赖

版本

安装命令 点击复制

技能文档

Quick 开始

Supported Formats

如何 Works

使用 Cases

Configuration

API Reference

Credits

Quick Start

Supported Formats

How It Works

Use Cases

Configuration

API Reference

Credits

安装命令点击复制