·eval-harness
</>

eval-harness

mhylle/claude-skills-collection

Comprehensive evaluation framework for systematic testing, measurement, and quality assurance of AI-assisted implementations. Supports capability evals, regression testing, multiple grader types, and standardized metrics.

2Installs·0Trend·@mhylle

Installation

$npx skills add https://github.com/mhylle/claude-skills-collection --skill eval-harness

SKILL.md

Evaluation-driven development (EDD) is a methodology where evaluations are defined before or alongside implementation, ensuring that success criteria are explicit, measurable, and testable from the start.

Purpose: Verify that a new capability works correctly. Capability evals test whether the system can do something it couldn't do before, or does something better than before.

Purpose: Verify that existing functionality still works after changes. Regression evals protect against unintended breakage.

Comprehensive evaluation framework for systematic testing, measurement, and quality assurance of AI-assisted implementations. Supports capability evals, regression testing, multiple grader types, and standardized metrics. Source: mhylle/claude-skills-collection.

View raw

Facts (cite-ready)

Stable fields and commands for AI/search citations.

Install command
npx skills add https://github.com/mhylle/claude-skills-collection --skill eval-harness
Category
</>Dev Tools
Verified
First Seen
2026-02-01
Updated
2026-02-18

Quick answers

What is eval-harness?

Comprehensive evaluation framework for systematic testing, measurement, and quality assurance of AI-assisted implementations. Supports capability evals, regression testing, multiple grader types, and standardized metrics. Source: mhylle/claude-skills-collection.

How do I install eval-harness?

Open your terminal or command line tool (Terminal, iTerm, Windows Terminal, etc.) Copy and run this command: npx skills add https://github.com/mhylle/claude-skills-collection --skill eval-harness Once installed, the skill will be automatically configured in your AI coding environment and ready to use in Claude Code or Cursor

Where is the source repository?

https://github.com/mhylle/claude-skills-collection

Details

Category
</>Dev Tools
Source
skills.sh
First Seen
2026-02-01