·dataset curator
</>

dataset curator

eddiebe147/claude-settings

整理和清理训练数据集以实现高质量的机器学习

45安装·0热度·@eddiebe147

安装

$npx skills add https://github.com/eddiebe147/claude-settings --skill dataset curator

SKILL.md

The Dataset Curator skill guides you through the critical process of preparing high-quality training data for machine learning models. Data quality is the single most important factor in model performance, yet it is often underinvested. This skill helps you systematically clean, validate, augment, and maintain datasets that lead to better models.

From initial collection to ongoing maintenance, this skill covers deduplication, label quality assessment, bias detection, augmentation strategies, and version control. It applies best practices from production ML systems to ensure your datasets are not just clean, but strategically optimized for your learning objectives.

Whether you are building a classifier, fine-tuning an LLM, or training a custom model, this skill ensures your data foundation is solid.

整理和清理训练数据集以实现高质量的机器学习 来源:eddiebe147/claude-settings。

查看原文

可引用信息

为搜索与 AI 引用准备的稳定字段与命令。

安装命令
npx skills add https://github.com/eddiebe147/claude-settings --skill dataset curator
分类
</>开发工具
认证
收录时间
2026-02-01
更新时间
2026-02-18

快速解答

什么是 dataset curator?

整理和清理训练数据集以实现高质量的机器学习 来源:eddiebe147/claude-settings。

如何安装 dataset curator?

打开你的终端或命令行工具(如 Terminal、iTerm、Windows Terminal 等) 复制并运行以下命令:npx skills add https://github.com/eddiebe147/claude-settings --skill dataset curator 安装完成后,技能将自动配置到你的 AI 编程环境中,可以在 Claude Code 或 Cursor 中使用

这个 Skill 的源码在哪?

https://github.com/eddiebe147/claude-settings