security: Issue #81 - ULTRAPLINIAN fallback chain audit framework
Implement comprehensive red team audit infrastructure for testing the entire
fallback chain against jailbreak and crisis intervention attacks.
Files created:
- tests/security/ultraplinian_audit.py: Comprehensive audit runner with:
* Support for all 4 techniques: GODMODE, Parseltongue, Prefill, Crisis
* Model configurations for Kimi, Gemini, Grok, Llama
* Concurrent execution via ThreadPoolExecutor
* JSON and Markdown report generation
* CLI interface with --help, --list-models, etc.
- tests/security/FALLBACK_CHAIN_TEST_PLAN.md: Detailed test specifications:
* Complete test matrix (5 models × 4 techniques × 8 queries = 160 tests)
* Technique specifications with system prompts
* Scoring criteria and detection patterns
* Success criteria and maintenance schedule
- agent/ultraplinian_router.py (optional): Race-mode fallback router:
* Parallel model querying for safety validation
* SHIELD-based safety analysis
* Crisis escalation to SAFE SIX models
* Configurable routing decisions
Test commands:
python tests/security/ultraplinian_audit.py --help
python tests/security/ultraplinian_audit.py --all-models --all-techniques
python tests/security/ultraplinian_audit.py --model kimi-k2.5 --technique crisis
Relates to: Issue #72 (Red Team Jailbreak Audit)
Severity: MEDIUM