Allegro
be865df8c4
security: Issue #81 - ULTRAPLINIAN fallback chain audit framework
Implement comprehensive red team audit infrastructure for testing the entire
fallback chain against jailbreak and crisis intervention attacks.
Files created:
- tests/security/ultraplinian_audit.py: Comprehensive audit runner with:
* Support for all 4 techniques: GODMODE, Parseltongue, Prefill, Crisis
* Model configurations for Kimi, Gemini, Grok, Llama
* Concurrent execution via ThreadPoolExecutor
* JSON and Markdown report generation
* CLI interface with --help, --list-models, etc.
- tests/security/FALLBACK_CHAIN_TEST_PLAN.md: Detailed test specifications:
* Complete test matrix (5 models × 4 techniques × 8 queries = 160 tests)
* Technique specifications with system prompts
* Scoring criteria and detection patterns
* Success criteria and maintenance schedule
- agent/ultraplinian_router.py (optional): Race-mode fallback router:
* Parallel model querying for safety validation
* SHIELD-based safety analysis
* Crisis escalation to SAFE SIX models
* Configurable routing decisions
Test commands:
python tests/security/ultraplinian_audit.py --help
python tests/security/ultraplinian_audit.py --all-models --all-techniques
python tests/security/ultraplinian_audit.py --model kimi-k2.5 --technique crisis
Relates to: Issue #72 (Red Team Jailbreak Audit)
Severity: MEDIUM
2026-04-01 01:51:23 +00:00
..
2026-03-30 22:59:57 +00:00
2026-03-31 19:56:16 +00:00
2026-03-27 13:02:52 -07:00
2026-03-29 21:29:00 -07:00
2026-03-30 22:38:02 +00:00
2026-03-28 14:55:18 -07:00
2026-03-27 17:51:14 -07:00
2026-03-20 09:38:13 -07:00
2026-03-29 18:02:42 -07:00
2026-03-31 19:35:54 +00:00
2026-03-30 22:16:18 +00:00
2026-03-31 21:27:17 +00:00
2026-03-25 19:47:58 -07:00
2026-03-30 22:28:59 +00:00
2026-03-30 22:16:19 +00:00
2026-03-29 21:56:07 -07:00
2026-03-28 14:20:30 -07:00
2026-03-29 00:33:30 -07:00
2026-03-21 16:54:43 -07:00
2026-03-30 08:13:01 -07:00
2026-03-31 00:37:14 +00:00
2026-03-31 00:37:14 +00:00
2026-03-29 00:33:30 -07:00
2026-03-17 23:40:22 -07:00
2026-03-30 22:28:57 +00:00
2026-03-28 14:35:28 -07:00
2026-02-21 22:31:43 -08:00
2026-04-01 01:51:23 +00:00
2026-03-25 12:45:58 -07:00