safety-filter-bypass
✓Techniques to test and bypass AI safety filters, content moderation systems, and guardrails for security assessment
SKILL.md
Test AI system safety filters and content moderation to identify weaknesses in protective mechanisms.
| Agent 02 | Executes bypass tests | | llm-jailbreaking skill | Jailbreak integration | | /test prompt-injection | Command interface |
Techniques to test and bypass AI safety filters, content moderation systems, and guardrails for security assessment Source: pluginagentmarketplace/custom-plugin-ai-red-teaming.
Open your terminal or command line tool (Terminal, iTerm, Windows Terminal, etc.) Copy and run this command: npx skills add https://github.com/pluginagentmarketplace/custom-plugin-ai-red-teaming --skill safety-filter-bypass Once installed, the skill will be automatically configured in your AI coding environment and ready to use in Claude Code or Cursor
Facts (cite-ready)
Stable fields and commands for AI/search citations.
- Install command
npx skills add https://github.com/pluginagentmarketplace/custom-plugin-ai-red-teaming --skill safety-filter-bypass- Category
- !Security
- Verified
- ✓
- First Seen
- 2026-02-01
- Updated
- 2026-02-18
Quick answers
What is safety-filter-bypass?
Techniques to test and bypass AI safety filters, content moderation systems, and guardrails for security assessment Source: pluginagentmarketplace/custom-plugin-ai-red-teaming.
How do I install safety-filter-bypass?
Open your terminal or command line tool (Terminal, iTerm, Windows Terminal, etc.) Copy and run this command: npx skills add https://github.com/pluginagentmarketplace/custom-plugin-ai-red-teaming --skill safety-filter-bypass Once installed, the skill will be automatically configured in your AI coding environment and ready to use in Claude Code or Cursor
Where is the source repository?
https://github.com/pluginagentmarketplace/custom-plugin-ai-red-teaming
Details
- Category
- !Security
- Source
- skills.sh
- First Seen
- 2026-02-01