·evaluating-skills-with-models
</>

evaluating-skills-with-models

taisukeoe/agentic-ai-skills-creator

Evaluate skills by executing them across sonnet, opus, and haiku models using sub-agents. Use when testing if a skill works correctly, comparing model performance, or finding the cheapest compatible model. Returns numeric scores (0-100) to differentiate model capabilities.

9Installs·0Trend·@taisukeoe

Installation

$npx skills add https://github.com/taisukeoe/agentic-ai-skills-creator --skill evaluating-skills-with-models

SKILL.md

Evaluate skills across multiple Claude models using sub-agents with quality-based scoring.

Binary pass/fail ("did it do X?") fails to differentiate models - all models can "do the steps." The difference is how well they do them. This skill uses weighted scoring to reveal capability differences.

Default to difficult scenarios: When multiple scenarios exist, prioritize Hard or Medium difficulty scenarios for evaluation. Easy scenarios often don't show meaningful differences between models and aren't realistic for production use.

Evaluate skills by executing them across sonnet, opus, and haiku models using sub-agents. Use when testing if a skill works correctly, comparing model performance, or finding the cheapest compatible model. Returns numeric scores (0-100) to differentiate model capabilities. Source: taisukeoe/agentic-ai-skills-creator.

View raw

Facts (cite-ready)

Stable fields and commands for AI/search citations.

Install command
npx skills add https://github.com/taisukeoe/agentic-ai-skills-creator --skill evaluating-skills-with-models
Category
</>Dev Tools
Verified
First Seen
2026-02-01
Updated
2026-02-18

Quick answers

What is evaluating-skills-with-models?

Evaluate skills by executing them across sonnet, opus, and haiku models using sub-agents. Use when testing if a skill works correctly, comparing model performance, or finding the cheapest compatible model. Returns numeric scores (0-100) to differentiate model capabilities. Source: taisukeoe/agentic-ai-skills-creator.

How do I install evaluating-skills-with-models?

Open your terminal or command line tool (Terminal, iTerm, Windows Terminal, etc.) Copy and run this command: npx skills add https://github.com/taisukeoe/agentic-ai-skills-creator --skill evaluating-skills-with-models Once installed, the skill will be automatically configured in your AI coding environment and ready to use in Claude Code or Cursor

Where is the source repository?

https://github.com/taisukeoe/agentic-ai-skills-creator