龙虾技能库

技能插件模型教程下载加速定制

技能插件模型教程下载加速定制

Token供应商

ToAPIs · 量大好价

新手入门下载 OpenClaw 技能/插件定制服务常见问题加速服务

邮箱：longxiaskill@qq.com ｜ AI 智能体可直接发送定制需求到邮箱

免责声明 | 隐私政策 | 鄂ICP备19007528号

龙虾技能库 — OpenClaw 中文 AI 资源库 | 免费资源 + 付费定制

© 2026 龙虾技能库

首页 › 技能列表

OpenClaw AI 技能

发现优质 AI 技能，一键安装提升效率。

热门搜索

今日头条 a-stock-data 微信抖音网络搜索小红书

标签

全部开发工具 AI模型访问代码生成系统工具自动化网络工具数据分析生产力工具 API工具浏览器自动化文件处理安全文档工具数据库 API开发 CI/CD 数据与API 微信数据可视化智能体 DevOps 云服务设计工具测试工具工作流存储部署办公协作即时通讯图像处理钉钉加密视频处理金融工具加密货币通信工具区块链教育学习监控告警数据处理容器与虚拟化邮件服务 Web3 营销工具金融科技 MCP工具操作系统命令行工具飞书项目管理音频处理

排序：最多下载最近更新最高星标

tao-train-oneformerv1.0.8

OneFormer for universal image segmentation. Unifies panoptic, instance, and semantic segmentation with a single architecture using task-conditioned qu

0 1 0by @nvidia

tao-train-ocrnetv?

OCRNet for scene text recognition. Recognizes text content from cropped text-region images and supports CTC and attention-based decoders. Use when tra

0 1 0by @nvidia

tao-train-ocdnetv?

OCDNet for scene text detection. Detects arbitrary-oriented text regions in natural images using a differentiable binarization approach. Use when trai

0 1 0by @nvidia

tao-train-nvpanoptix3dv?

NVPanoptix3D for panoptic 3D scene reconstruction from posed RGB images. Produces 3D panoptic segmentation (semantic, instance, and panoptic masks) wi

0 1 0by @nvidia

tao-train-nvdinov2v?

NVDINOv2 for self-supervised visual representation learning. Trains vision transformers via self-distillation (teacher-student) without labels and pro

0 1 0by @nvidia

tao-train-metric-learning-recognitionv?

Metric-learning recognition (ml-recog) for fine-grained visual recognition. Learns embeddings for retrieval-based matching (e.g., retail product recog

0 1 0by @nvidia

tao-train-mask2formerv1.0.8

Mask2Former for universal image segmentation (panoptic, instance, and semantic). Transformer-based with masked attention for high-quality segmentation

0 1 0by @nvidia

tao-train-mask-grounding-dinov?

Mask Grounding DINO for grounded instance segmentation. Extends Grounding DINO with a mask-prediction head for open-set segmentation guided by text pr

0 1 0by @nvidia

tao-train-mask-auto-labelv?

MAL (Mask Auto-Label) for weakly-supervised segmentation. Produces segmentation masks from minimal annotations (point or box annotations) using a ViT-

0 1 0by @nvidia

tao-train-mask-auto-encoderv?

Masked Auto-Encoder (MAE) for self-supervised pretraining and fine-tuning. Masks random patches and reconstructs them to learn visual representations;

0 1 0by @nvidia

tao-train-image-classificationv?

PyTorch-based TAO image classification. Supports a wide range of backbones (FAN, EfficientNet, ResNet, etc.) with distillation and quantization for de

0 1 0by @nvidia

tao-train-grounding-dinov?

Grounding DINO for open-set object detection. Combines DINO-style detection with a BERT text encoder for language-guided detection — detects objects d

0 1 0by @nvidia

tao-train-foundation-stereov2

Stereo depth estimation using FoundationStereo. Predicts disparity maps from stereo image pairs for 3D reconstruction. Use when training, evaluating,

0 1 0by @nvidia

tao-train-fast-foundation-stereov?

Real-time stereo depth estimation using FastFoundationStereo (FFS), the distilled bp2 commercial variant of FoundationStereo. Predicts disparity maps

0 1 0by @nvidia

tao-train-dinov?

DINO (DETR with Improved DeNoising Anchor Boxes) for 2D object detection. Transformer-based detector with denoising training, multi-scale features, an

0 1 0by @nvidia

tao-train-depth-anything-v2v2

Monocular depth estimation using Metric Depth Anything v2 or Relative Depth Anything architectures. Predicts per-pixel depth from single RGB images. U

0 1 0by @nvidia

tao-train-deformable-detrv?

Deformable DETR for 2D object detection. Uses deformable attention for efficient multi-scale feature processing, lighter than DINO with competitive ac

0 1 0by @nvidia

tao-train-centerposev?

CenterPose for keypoint / pose estimation. Detects object centers and regresses keypoint locations for 6-DoF object pose estimation. Use when training

0 1 0by @nvidia

tao-train-bevfusionv?

BEVFusion for multi-sensor 3D object detection. Fuses LiDAR point clouds and camera images in bird's-eye-view (BEV) space, used in autonomous driving

0 1 0by @nvidia

tao-train-action-recognitionv?

Action recognition from video sequences. Supports RGB, optical flow, and joint (multi-stream) input types for classifying temporal actions in video cl

0 1 0by @nvidia

←473 474 475 476 477 478 479 480 481 482 →

OpenClaw 技能定制 / 插件定制 / 私有工作流定制

免费技能或插件可能存在安全风险，如需更匹配、更安全的方案，建议联系付费定制

了解定制服务