Scaffolded project? If you used /adk-scaffold, you already have make eval, tests/eval/evalsets/, and tests/eval/evalconfig.json. Start with make eval and iterate from there.
Non-scaffolded? Use adk eval directly — see Running Evaluations below.
| references/criteria-guide.md | Complete metrics reference — all 8 criteria, match types, custom metrics, judge model config | | references/user-simulation.md | Dynamic conversation testing — ConversationScenario, user simulator config, compatible metrics |
MUST READ before running any ADK evaluation. ADK evaluation methodology — eval metrics, evalset schema, LLM-as-judge, tool trajectory scoring, and common failure causes. Use when evaluating agent quality, running adk eval, or debugging eval results. Do NOT use for API code patterns (use adk-cheatsheet), deployment (use adk-deploy-guide), or project scaffolding (use adk-scaffold). Source: google/adk-docs.