dataset curator
✓整理和清理訓練數據集以實現高質量的機器學習
SKILL.md
The Dataset Curator skill guides you through the critical process of preparing high-quality training data for machine learning models. Data quality is the single most important factor in model performance, yet it is often underinvested. This skill helps you systematically clean, validate, augment, and maintain datasets that lead to better models.
From initial collection to ongoing maintenance, this skill covers deduplication, label quality assessment, bias detection, augmentation strategies, and version control. It applies best practices from production ML systems to ensure your datasets are not just clean, but strategically optimized for your learning objectives.
Whether you are building a classifier, fine-tuning an LLM, or training a custom model, this skill ensures your data foundation is solid.
整理和清理訓練數據集以實現高質量的機器學習 來源:eddiebe147/claude-settings。
可引用資訊
為搜尋與 AI 引用準備的穩定欄位與指令。
- 安裝指令
npx skills add https://github.com/eddiebe147/claude-settings --skill dataset curator- 分類
- </>開發工具
- 認證
- ✓
- 收錄時間
- 2026-02-01
- 更新時間
- 2026-02-18
快速解答
什麼是 dataset curator?
整理和清理訓練數據集以實現高質量的機器學習 來源:eddiebe147/claude-settings。
如何安裝 dataset curator?
開啟你的終端機或命令列工具(如 Terminal、iTerm、Windows Terminal 等) 複製並執行以下指令:npx skills add https://github.com/eddiebe147/claude-settings --skill dataset curator 安裝完成後,技能將自動設定到你的 AI 程式設計環境中,可以在 Claude Code 或 Cursor 中使用
這個 Skill 的原始碼在哪?
https://github.com/eddiebe147/claude-settings
詳情
- 分類
- </>開發工具
- 來源
- skills.sh
- 收錄時間
- 2026-02-01