龙虾技能库龙虾技能库
技能插件模型教程下载加速定制
技能插件模型教程下载加速定制
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
Token供应商
ToAPIs · 量大好价
新手入门下载 OpenClaw技能/插件定制服务常见问题加速服务
邮箱:longxiaskill@qq.com | AI 智能体可直接发送定制需求到邮箱
免责声明 | 隐私政策 | 鄂ICP备19007528号
龙虾技能库 — OpenClaw 中文 AI 资源库 | 免费资源 + 付费定制
© 2026 龙虾技能库
首页 › 技能列表

OpenClaw AI 技能

发现优质 AI 技能,一键安装提升效率。

热门搜索
今日头条a-stock-data微信抖音网络搜索小红书
标签
全部开发工具AI模型访问代码生成系统工具自动化网络工具数据分析生产力工具API工具浏览器自动化文件处理安全文档工具数据库API开发CI/CD数据与API微信数据可视化智能体DevOps云服务设计工具测试工具工作流存储部署办公协作即时通讯图像处理钉钉加密视频处理金融工具加密货币通信工具区块链教育学习监控告警数据处理容器与虚拟化邮件服务Web3营销工具金融科技MCP工具操作系统命令行工具飞书项目管理音频处理
排序:最多下载最近更新最高星标
tao-train-oneformerv1.0.8
OneFormer for universal image segmentation. Unifies panoptic, instance, and semantic segmentation with a single architecture using task-conditioned qu
0 1 0by @nvidia
tao-train-ocrnetv?
OCRNet for scene text recognition. Recognizes text content from cropped text-region images and supports CTC and attention-based decoders. Use when tra
0 1 0by @nvidia
tao-train-ocdnetv?
OCDNet for scene text detection. Detects arbitrary-oriented text regions in natural images using a differentiable binarization approach. Use when trai
0 1 0by @nvidia
tao-train-nvpanoptix3dv?
NVPanoptix3D for panoptic 3D scene reconstruction from posed RGB images. Produces 3D panoptic segmentation (semantic, instance, and panoptic masks) wi
0 1 0by @nvidia
tao-train-nvdinov2v?
NVDINOv2 for self-supervised visual representation learning. Trains vision transformers via self-distillation (teacher-student) without labels and pro
0 1 0by @nvidia
tao-train-metric-learning-recognitionv?
Metric-learning recognition (ml-recog) for fine-grained visual recognition. Learns embeddings for retrieval-based matching (e.g., retail product recog
0 1 0by @nvidia
tao-train-mask2formerv1.0.8
Mask2Former for universal image segmentation (panoptic, instance, and semantic). Transformer-based with masked attention for high-quality segmentation
0 1 0by @nvidia
tao-train-mask-grounding-dinov?
Mask Grounding DINO for grounded instance segmentation. Extends Grounding DINO with a mask-prediction head for open-set segmentation guided by text pr
0 1 0by @nvidia
tao-train-mask-auto-labelv?
MAL (Mask Auto-Label) for weakly-supervised segmentation. Produces segmentation masks from minimal annotations (point or box annotations) using a ViT-
0 1 0by @nvidia
tao-train-mask-auto-encoderv?
Masked Auto-Encoder (MAE) for self-supervised pretraining and fine-tuning. Masks random patches and reconstructs them to learn visual representations;
0 1 0by @nvidia
tao-train-image-classificationv?
PyTorch-based TAO image classification. Supports a wide range of backbones (FAN, EfficientNet, ResNet, etc.) with distillation and quantization for de
0 1 0by @nvidia
tao-train-grounding-dinov?
Grounding DINO for open-set object detection. Combines DINO-style detection with a BERT text encoder for language-guided detection — detects objects d
0 1 0by @nvidia
tao-train-foundation-stereov2
Stereo depth estimation using FoundationStereo. Predicts disparity maps from stereo image pairs for 3D reconstruction. Use when training, evaluating,
0 1 0by @nvidia
tao-train-fast-foundation-stereov?
Real-time stereo depth estimation using FastFoundationStereo (FFS), the distilled bp2 commercial variant of FoundationStereo. Predicts disparity maps
0 1 0by @nvidia
tao-train-dinov?
DINO (DETR with Improved DeNoising Anchor Boxes) for 2D object detection. Transformer-based detector with denoising training, multi-scale features, an
0 1 0by @nvidia
tao-train-depth-anything-v2v2
Monocular depth estimation using Metric Depth Anything v2 or Relative Depth Anything architectures. Predicts per-pixel depth from single RGB images. U
0 1 0by @nvidia
tao-train-deformable-detrv?
Deformable DETR for 2D object detection. Uses deformable attention for efficient multi-scale feature processing, lighter than DINO with competitive ac
0 1 0by @nvidia
tao-train-centerposev?
CenterPose for keypoint / pose estimation. Detects object centers and regresses keypoint locations for 6-DoF object pose estimation. Use when training
0 1 0by @nvidia
tao-train-bevfusionv?
BEVFusion for multi-sensor 3D object detection. Fuses LiDAR point clouds and camera images in bird's-eye-view (BEV) space, used in autonomous driving
0 1 0by @nvidia
tao-train-action-recognitionv?
Action recognition from video sequences. Supports RGB, optical flow, and joint (multi-stream) input types for classifying temporal actions in video cl
0 1 0by @nvidia
←473474475476477478479480481482→
OpenClaw 技能定制 / 插件定制 / 私有工作流定制

免费技能或插件可能存在安全风险,如需更匹配、更安全的方案,建议联系付费定制

了解定制服务