📦 Aliyun Qwen Omni — 技能工具

v1.0.0

[自动翻译] Use when tasks require all-in-one multimodal understanding or generation with Alibaba Cloud Model Studio Qwen Omni models, including image-plus-audio ...

0· 75·0 当前·0 累计
by @cinience·MIT-0
下载技能包
License
MIT-0
最后更新
2026/4/1
0
安全扫描
VirusTotal
无害
查看报告
OpenClaw
安全
high confidence
This skill is a small, instruction-only helper that only prepares a local JSON request payload for Alibaba's Qwen Omni models; it does not contact external services or request credentials.
评估建议
This skill is essentially a small template that writes an example request.json for Qwen Omni — it does not call Alibaba Cloud or handle credentials. If you expected a full integration that sends requests to Model Studio, note you'll need additional code and valid Alibaba Cloud credentials (and should verify any such code comes from a trusted source). It's safe to install from a security perspective, but its utility is limited unless you or the maintainer add an actual API client and credential h...
详细分析 ▾
用途与能力
The name/description advertises full multimodal integration with Alibaba Cloud Qwen Omni, but the shipped files only produce a local example request payload and an agent prompt. No network calls, SDKs, or credential handling are included — so the skill is a lightweight payload/template helper rather than a full provider integration.
指令范围
Runtime instructions only compile the included Python script and run it to write a JSON payload to disk. The instructions do not read other system files, environment variables, or transmit data externally.
安装机制
There is no install specification; the skill is instruction-only plus a tiny script. Nothing is downloaded or written beyond the simple generated JSON file.
凭证需求
The skill requests no environment variables, credentials, or config paths. Note: agents/openai.yaml contains a prompt string referencing "$alicloud-ai-multimodal-qwen-omni" but this is a prompt template, not an actual required env var.
持久化与权限
The skill does not request always:true and contains no code that modifies agent/system configuration. It is user-invocable and can be run by the agent, which is normal.
安全有层次,运行前请审查代码。

License

MIT-0

可自由使用、修改和再分发,无需署名。

运行时依赖

无特殊依赖

版本

latestv1.0.02026/4/1

Initial release of aliyun-qwen-omni skill. - Provides access to Alibaba Cloud Model Studio Qwen Omni models for multimodal understanding and generation. - Supports image, audio, and text input, with options for realtime agent and voice assistant use cases. - Validates with a Python script compilation check. - Includes a standardized interface (`omni.chat`) for consistent requests and responses. - Allows selection from supported Qwen Omni model variants.

无害

安装命令

点击复制
官方npx clawhub@latest install aliyun-qwen-omni
镜像加速npx clawhub@latest install aliyun-qwen-omni --registry https://cn.longxiaskill.com

技能文档

Category: provider

# Model Studio Qwen Omni

Validation

mkdir -p output/aliyun-qwen-omni
python -m py_compile skills/ai/multimodal/aliyun-qwen-omni/scripts/prepare_omni_request.py && echo "py_compile_ok" > output/aliyun-qwen-omni/validate.txt

Pass criteria: command exits 0 and output/aliyun-qwen-omni/validate.txt is generated.

Critical model names

Use one of these exact model strings:

  • qwen3-omni-flash
  • qwen3-omni-flash-realtime
  • qwen-omni-turbo
  • qwen-omni-turbo-realtime

Typical use

  • Image + audio + text assistant
  • Realtime multimodal agents
  • Spoken responses grounded in visual input

Normalized interface (omni.chat)

Request

  • model (string, optional): default qwen3-omni-flash
  • text (string, optional)
  • image (string, optional)
  • audio (string, optional)
  • response_modalities (array, optional): e.g. ["text"], ["text","audio"]

Response

  • text (string, optional)
  • audio_url or audio_chunk (optional)
  • usage (object, optional)

Quick start

python skills/ai/multimodal/aliyun-qwen-omni/scripts/prepare_omni_request.py \
  --output output/aliyun-qwen-omni/request.json

References

  • references/sources.md
数据来源ClawHub ↗ · 中文优化:龙虾技能库