·benchmark-datasets

Standard datasets and benchmarks for evaluating AI security, robustness, and safety

0Installs·0Trend·@pluginagentmarketplace

Installation

$npx skills add https://github.com/pluginagentmarketplace/custom-plugin-ai-red-teaming --skill benchmark-datasets

SKILL.md

Use standardized benchmarks to evaluate and compare AI system security, robustness, and safety.

| Agent 04 | Benchmark execution | | /analyze | Result interpretation | | CI/CD | Automated evaluation | | Grafana | Trend visualization |

Standard datasets and benchmarks for evaluating AI security, robustness, and safety Source: pluginagentmarketplace/custom-plugin-ai-red-teaming.

Open your terminal or command line tool (Terminal, iTerm, Windows Terminal, etc.) Copy and run this command: npx skills add https://github.com/pluginagentmarketplace/custom-plugin-ai-red-teaming --skill benchmark-datasets Once installed, the skill will be automatically configured in your AI coding environment and ready to use in Claude Code or Cursor

View raw

Facts (cite-ready)

Stable fields and commands for AI/search citations.

Install command
npx skills add https://github.com/pluginagentmarketplace/custom-plugin-ai-red-teaming --skill benchmark-datasets
Category
!Security
Verified
First Seen
2026-02-01
Updated
2026-02-18

Quick answers

What is benchmark-datasets?

Standard datasets and benchmarks for evaluating AI security, robustness, and safety Source: pluginagentmarketplace/custom-plugin-ai-red-teaming.

How do I install benchmark-datasets?

Open your terminal or command line tool (Terminal, iTerm, Windows Terminal, etc.) Copy and run this command: npx skills add https://github.com/pluginagentmarketplace/custom-plugin-ai-red-teaming --skill benchmark-datasets Once installed, the skill will be automatically configured in your AI coding environment and ready to use in Claude Code or Cursor

Where is the source repository?

https://github.com/pluginagentmarketplace/custom-plugin-ai-red-teaming