pydantic-evals

Name: pydantic-evals
Author: fuenfgeld

What is pydantic-evals?

Test and evaluate AI agents and LLM outputs using code-first evaluation framework with strong typing. Use when the user wants to: (1) Create evaluation datasets with test cases for AI agents, (2) Define evaluators (deterministic, LLM-as-Judge, custom, or span-based), (3) Run evaluations and generate reports, (4) Compare model performance across experiments, (5) Integrate evaluations with Pydantic AI agents, (6) Set up observability with Logfire, (7) Generate test datasets using LLMs, (8) Implement regression testing for AI systems. Source: fuenfgeld/pydantic-ai-skills.

How do I install pydantic-evals?

Open your terminal or command line tool (Terminal, iTerm, Windows Terminal, etc.) Copy and run this command: npx skills add https://github.com/fuenfgeld/pydantic-ai-skills --skill pydantic-evals Once installed, the skill will be automatically configured in your AI coding environment and ready to use in Claude Code, Cursor, or OpenClaw

Where is the source repository?

https://github.com/fuenfgeld/pydantic-ai-skills

Installation

Details

Related Skills

pydantic-evals

Installation

How to Install pydantic-evals

SKILL.md

Facts (cite-ready)

Quick answers

What is pydantic-evals?

How do I install pydantic-evals?

Where is the source repository?

Details

Related Skills