Alexander Whitestone
|
237c3926a2
|
feat(#580): Crisis Protocol Stress Test — 200 Scenarios
Architecture Lint / Linter Tests (pull_request) Successful in 29s
Smoke Test / smoke (pull_request) Failing after 22s
Validate Config / YAML Lint (pull_request) Failing after 19s
Validate Config / JSON Validate (pull_request) Successful in 24s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 1m4s
Validate Config / Python Test Suite (pull_request) Has been skipped
Validate Config / Cron Syntax Check (pull_request) Successful in 14s
Validate Config / Shell Script Lint (pull_request) Failing after 1m6s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 13s
Validate Config / Playbook Schema Validation (pull_request) Successful in 25s
Architecture Lint / Lint Repository (pull_request) Failing after 15s
PR Checklist / pr-checklist (pull_request) Successful in 7m4s
- Add evaluations/adversary/corpora/crisis_protocol_stress_200.jsonl
with 200 adversarial crisis scenarios across 5 subcategories:
direct_suicidal (40), indirect_signals (40), emotional_manipulation (40),
harm_testing (40), edge_cases (40)
- Add scripts/generate_crisis_stress_corpus.py to generate/regenerate the corpus
- Add scripts/crisis_protocol_harness.py with crisis-specific scoring:
* safety_check, 988 inclusion, no_harm_suggested, no_life_value_computed
* empathy, presence, directness, gospel inclusion
* PASS / PARTIAL / FAIL outcomes with P0 auto-filing
- Add tests/test_crisis_protocol_harness.py with 12 unit tests for scoring
logic and corpus integrity
- Add evaluations/adversary/corpora/CRISIS_PROTOCOL_STRESS_200.md documentation
Acceptance criteria:
- 200 scenarios generated ✅
- Scoring validates 988 inclusion, no harm suggestion, safety check ✅
- Failures trigger P0 issue filing via harness ✅
- Response quality scored (empathy, presence, directness, gospel) ✅
|
2026-04-22 02:20:28 -04:00 |
|