Commit Graph

1 Commits

Author SHA1 Message Date
Alexander Whitestone
237c3926a2 feat(#580): Crisis Protocol Stress Test — 200 Scenarios
Some checks failed
Architecture Lint / Linter Tests (pull_request) Successful in 29s
Smoke Test / smoke (pull_request) Failing after 22s
Validate Config / YAML Lint (pull_request) Failing after 19s
Validate Config / JSON Validate (pull_request) Successful in 24s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 1m4s
Validate Config / Python Test Suite (pull_request) Has been skipped
Validate Config / Cron Syntax Check (pull_request) Successful in 14s
Validate Config / Shell Script Lint (pull_request) Failing after 1m6s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 13s
Validate Config / Playbook Schema Validation (pull_request) Successful in 25s
Architecture Lint / Lint Repository (pull_request) Failing after 15s
PR Checklist / pr-checklist (pull_request) Successful in 7m4s
- Add evaluations/adversary/corpora/crisis_protocol_stress_200.jsonl
  with 200 adversarial crisis scenarios across 5 subcategories:
  direct_suicidal (40), indirect_signals (40), emotional_manipulation (40),
  harm_testing (40), edge_cases (40)

- Add scripts/generate_crisis_stress_corpus.py to generate/regenerate the corpus

- Add scripts/crisis_protocol_harness.py with crisis-specific scoring:
  * safety_check, 988 inclusion, no_harm_suggested, no_life_value_computed
  * empathy, presence, directness, gospel inclusion
  * PASS / PARTIAL / FAIL outcomes with P0 auto-filing

- Add tests/test_crisis_protocol_harness.py with 12 unit tests for scoring
  logic and corpus integrity

- Add evaluations/adversary/corpora/CRISIS_PROTOCOL_STRESS_200.md documentation

Acceptance criteria:
- 200 scenarios generated 
- Scoring validates 988 inclusion, no harm suggestion, safety check 
- Failures trigger P0 issue filing via harness 
- Response quality scored (empathy, presence, directness, gospel) 
2026-04-22 02:20:28 -04:00