·dataset curator
</>

dataset curator

整理和清理训练数据集以实现高质量的机器学习

9安装·0热度·@jmsktm

安装

$npx skills add https://github.com/jmsktm/claude-settings --skill dataset curator

如何安装 dataset curator

通过命令行快速安装 dataset curator AI 技能到你的开发环境

  1. 打开终端: 打开你的终端或命令行工具(如 Terminal、iTerm、Windows Terminal 等)
  2. 运行安装命令: 复制并运行以下命令:npx skills add https://github.com/jmsktm/claude-settings --skill dataset curator
  3. 验证安装: 安装完成后,技能将自动配置到你的 AI 编程环境中,可以在 Claude Code、Cursor 或 OpenClaw 中使用

来源:jmsktm/claude-settings。

SKILL.md

查看原文

The Dataset Curator skill guides you through the critical process of preparing high-quality training data for machine learning models. Data quality is the single most important factor in model performance, yet it is often underinvested. This skill helps you systematically clean, validate, augment, and maintain datasets that lead to better models.

From initial collection to ongoing maintenance, this skill covers deduplication, label quality assessment, bias detection, augmentation strategies, and version control. It applies best practices from production ML systems to ensure your datasets are not just clean, but strategically optimized for your learning objectives.

Whether you are building a classifier, fine-tuning an LLM, or training a custom model, this skill ensures your data foundation is solid.

整理和清理训练数据集以实现高质量的机器学习 来源:jmsktm/claude-settings。

可引用信息

为搜索与 AI 引用准备的稳定字段与命令。

安装命令
npx skills add https://github.com/jmsktm/claude-settings --skill dataset curator
分类
</>开发工具
认证
收录时间
2026-02-25
更新时间
2026-03-11

Browse more skills from jmsktm/claude-settings

快速解答

什么是 dataset curator?

整理和清理训练数据集以实现高质量的机器学习 来源:jmsktm/claude-settings。

如何安装 dataset curator?

打开你的终端或命令行工具(如 Terminal、iTerm、Windows Terminal 等) 复制并运行以下命令:npx skills add https://github.com/jmsktm/claude-settings --skill dataset curator 安装完成后,技能将自动配置到你的 AI 编程环境中,可以在 Claude Code、Cursor 或 OpenClaw 中使用

这个 Skill 的源码在哪?

https://github.com/jmsktm/claude-settings