deepspeed
✓DeepSpeed 分布式训练的专家指导 - ZeRO 优化阶段、管道并行性、FP16/BF16/FP8、1 位 Adam、稀疏注意力
SKILL.md
Comprehensive assistance with deepspeed development, generated from official documentation.
Pattern 1: DeepNVMe Contents Requirements Creating DeepNVMe Handles Using DeepNVMe Handles Blocking File Write Non-Blocking File Write Parallel File Write Pinned Tensors Putting it together Acknowledgements Appendix Advanced Handle Creation Performance Tuning DeepNVMe APIs General I/O APIs GDS-specific APIs Handle Settings APIs This tutorial will show how to use DeepNVMe for data transfers between persistent stora...
Pattern 2: Mixture of Experts for NLG models Contents 1. Installation 2. Training NLG+MoE models 2.1. Changes to the model 2.2. Pre-training the Standard MoE model 2.3. Pre-training the PR-MoE model 2.4. Training MoS with reduced model size In this tutorial, we introduce how to apply DeepSpeed Mixture of Experts (MoE) to NLG models, which reduces the training cost by 5 times and reduce the MoE model size by 3 time...
可引用信息
为搜索与 AI 引用准备的稳定字段与命令。
- 安装命令
npx skills add https://github.com/ovachiever/droid-tings --skill deepspeed- 分类
- </>开发工具
- 认证
- ✓
- 收录时间
- 2026-02-01
- 更新时间
- 2026-02-18
快速解答
什么是 deepspeed?
DeepSpeed 分布式训练的专家指导 - ZeRO 优化阶段、管道并行性、FP16/BF16/FP8、1 位 Adam、稀疏注意力 来源:ovachiever/droid-tings。
如何安装 deepspeed?
打开你的终端或命令行工具(如 Terminal、iTerm、Windows Terminal 等) 复制并运行以下命令:npx skills add https://github.com/ovachiever/droid-tings --skill deepspeed 安装完成后,技能将自动配置到你的 AI 编程环境中,可以在 Claude Code 或 Cursor 中使用
这个 Skill 的源码在哪?
https://github.com/ovachiever/droid-tings
详情
- 分类
- </>开发工具
- 来源
- skills.sh
- 收录时间
- 2026-02-01