龙虾技能库
技能
插件
模型
教程
下载
加速
定制
登录
技能
插件
模型
教程
下载
加速
定制
首页
›
技能列表
OpenClaw AI 技能
发现优质 AI 技能,一键安装提升效率。
搜索
热门搜索
今日头条
a-stock-data
抖音
网络搜索
标签
全部
开发工具
AI模型访问
代码生成
系统工具
自动化
网络工具
数据分析
生产力工具
API工具
浏览器自动化
文件处理
安全
文档工具
数据库
API开发
CI/CD
数据与API
微信
数据可视化
智能体
DevOps
云服务
设计工具
测试工具
工作流
存储部署
办公协作
即时通讯
图像处理
钉钉
加密
视频处理
金融工具
加密货币
通信工具
区块链
教育学习
监控告警
数据处理
容器与虚拟化
邮件服务
Web3
营销工具
金融科技
MCP工具
操作系统
命令行工具
飞书
项目管理
音频处理
排序:
最多下载
最近更新
最高星标
nemo-mbridge-perf-moe-dispatcher-selection
v?
Choose the right MoE token dispatcher (`alltoall`, DeepEP, or HybridEP) for the hardware, EP degree, and optimization stage. Summarizes patterns from
0
0
0
by @nvidia
nemo-mbridge-perf-moe-comm-overlap
v1
MoE expert-parallel communication overlap in Megatron Bridge. Covers dispatch/combine overlap, flex dispatcher backends, and expert wgrad scheduling.
0
0
0
by @nvidia
nemo-mbridge-perf-memory-tuning
v?
Techniques for reducing peak GPU memory in Megatron Bridge — expandable segments, parallelism resizing, activation recompute, CPU offloading constrain
0
0
0
by @nvidia
nemo-mbridge-perf-megatron-fsdp
v?
Operational guide for enabling Megatron FSDP in Megatron-Bridge, including config knobs, code anchors, pitfalls, and verification.
0
0
0
by @nvidia
nemo-mbridge-perf-hierarchical-context-parallel
v?
Operational guide for enabling hierarchical context parallelism in Megatron-Bridge, including config knobs, code anchors, pitfalls, and verification.
0
0
0
by @nvidia
nemo-mbridge-perf-expert-parallel-overlap
v1
Validate and use MoE expert-parallel communication overlap in Megatron-Bridge, including overlap_moe_expert_parallel_comm, delay_wgrad_compute, and fl
0
0
0
by @nvidia
nemo-mbridge-perf-cuda-graphs
v?
Validate and use CUDA graph capture in Megatron Bridge, including local full-iteration graphs and Transformer Engine scoped graphs for attention, MLP,
0
0
0
by @nvidia
nemo-mbridge-perf-cpu-offloading
v?
Validate and use CPU offloading in Megatron Bridge, including layer-level activation offloading and fractional optimizer state offloading with HybridD
0
0
0
by @nvidia
nemo-mbridge-perf-activation-recompute
v?
Validate and use selective and full activation recompute in Megatron Bridge to reduce GPU memory usage at the cost of extra compute.
0
0
0
by @nvidia
nemo-mbridge-multi-node-slurm
v?
Convert single-node scripts to multi-node Slurm sbatch jobs and debug common multi-node failures. Covers srun-native vs uv run torch.distributed appro
0
0
0
by @nvidia
nemo-mbridge-mlm-bridge-training
v?
Run Megatron-LM (MLM) and Megatron Bridge training with mock or real data. Covers correlation testing, available recipes, and multi-GPU examples.
0
0
0
by @nvidia
nemo-evaluator-plugin
v?
Use when working on the Evaluator plugin CLI, jobs, SDK-backed specs, metric types, or plugin-owned Evaluator skills.
0
0
0
by @nvidia
nemo-data-designer-plugin
v?
Use when the user wants to create a dataset, generate synthetic data, or build a data generation pipeline.
0
0
0
by @nvidia
nemo-automodel-recipe-development
v?
Create and modify NeMo AutoModel training and evaluation recipes, including YAML structure, builders, and execution flow.
0
0
0
by @nvidia
nemo-automodel-model-onboarding
v?
Guide for onboarding new model architectures into NeMo AutoModel, including architecture discovery, implementation patterns, registration, and validat
0
0
0
by @nvidia
nemo-automodel-launcher-config
v?
Configure NeMo AutoModel job launches for interactive runs, Slurm clusters, and SkyPilot cloud execution.
0
0
0
by @nvidia
nemo-automodel-distributed-training
v?
Guide for selecting and configuring distributed training strategies in NeMo AutoModel, including FSDP2, Megatron FSDP, DDP, and parallelism settings.
0
0
0
by @nvidia
mcore-testing
v?
Test system for Megatron-LM. Covers test layout, recipe YAML structure, adding and running unit and functional tests, golden values, marker filters, a
0
0
0
by @nvidia
mcore-split-pr
v?
Split a PR into multiple PRs to reduce the number of required CODEOWNERS reviewer groups.
0
0
0
by @nvidia
mcore-run-on-slurm
v?
How to launch distributed Megatron-LM training jobs on a SLURM cluster. Covers a minimal sbatch skeleton, environment-variable setup for torch.distrib
0
0
0
by @nvidia
←
525
526
527
528
529
530
531
532
533
534
→
OpenClaw 技能定制 / 插件定制 / 私有工作流定制
免费技能或插件可能存在安全风险,如需更匹配、更安全的方案,建议联系付费定制
了解定制服务