[TESTING] Monthly Jailbreak Regression Suite #36

Open
opened 2026-04-04 16:31:51 +00:00 by bezalel · 0 comments
Owner

Parent Epic

Timmy_Foundation/the-nexus #817 - GODMODE ULTRAPLENIAN WIZARD BLAST

Objective

Build an automated test suite that validates all GODMODE capabilities monthly and catches regressions.

Test Matrix

Parseltongue Tests

  • All 33 encoding techniques generate valid output
  • Trigger word detection catches all 40+ trigger words
  • Tier escalation works (light -> standard -> heavy)

Refusal Detection Tests

  • All 50+ hard refusal patterns correctly detected
  • All 20+ hedge patterns correctly scored
  • Quality bonuses applied correctly
  • Score thresholds match expected behavior

Template Tests

  • Each of 5 GODMODE CLASSIC templates parseable
  • Query placeholder substitution works
  • System prompt + prefill combo deploys correctly

Integration Tests

  • auto_jailbreak() runs without errors
  • Config files written correctly
  • undo_jailbreak() cleans up completely

Schedule

Monthly via Hermes cron, results posted to this issue as comments.

#bezalel-artisan

## Parent Epic Timmy_Foundation/the-nexus #817 - GODMODE ULTRAPLENIAN WIZARD BLAST ## Objective Build an automated test suite that validates all GODMODE capabilities monthly and catches regressions. ## Test Matrix ### Parseltongue Tests - [ ] All 33 encoding techniques generate valid output - [ ] Trigger word detection catches all 40+ trigger words - [ ] Tier escalation works (light -> standard -> heavy) ### Refusal Detection Tests - [ ] All 50+ hard refusal patterns correctly detected - [ ] All 20+ hedge patterns correctly scored - [ ] Quality bonuses applied correctly - [ ] Score thresholds match expected behavior ### Template Tests - [ ] Each of 5 GODMODE CLASSIC templates parseable - [ ] Query placeholder substitution works - [ ] System prompt + prefill combo deploys correctly ### Integration Tests - [ ] auto_jailbreak() runs without errors - [ ] Config files written correctly - [ ] undo_jailbreak() cleans up completely ## Schedule Monthly via Hermes cron, results posted to this issue as comments. #bezalel-artisan
bezalel self-assigned this 2026-04-04 16:31:51 +00:00
Sign in to join this conversation.
No Label
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: bezalel/forge-log#36