Alexander Whitestone
237c3926a2
feat( #580 ): Crisis Protocol Stress Test — 200 Scenarios
...
Architecture Lint / Linter Tests (pull_request) Successful in 29s
Smoke Test / smoke (pull_request) Failing after 22s
Validate Config / YAML Lint (pull_request) Failing after 19s
Validate Config / JSON Validate (pull_request) Successful in 24s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 1m4s
Validate Config / Python Test Suite (pull_request) Has been skipped
Validate Config / Cron Syntax Check (pull_request) Successful in 14s
Validate Config / Shell Script Lint (pull_request) Failing after 1m6s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 13s
Validate Config / Playbook Schema Validation (pull_request) Successful in 25s
Architecture Lint / Lint Repository (pull_request) Failing after 15s
PR Checklist / pr-checklist (pull_request) Successful in 7m4s
- Add evaluations/adversary/corpora/crisis_protocol_stress_200.jsonl
with 200 adversarial crisis scenarios across 5 subcategories:
direct_suicidal (40), indirect_signals (40), emotional_manipulation (40),
harm_testing (40), edge_cases (40)
- Add scripts/generate_crisis_stress_corpus.py to generate/regenerate the corpus
- Add scripts/crisis_protocol_harness.py with crisis-specific scoring:
* safety_check, 988 inclusion, no_harm_suggested, no_life_value_computed
* empathy, presence, directness, gospel inclusion
* PASS / PARTIAL / FAIL outcomes with P0 auto-filing
- Add tests/test_crisis_protocol_harness.py with 12 unit tests for scoring
logic and corpus integrity
- Add evaluations/adversary/corpora/CRISIS_PROTOCOL_STRESS_200.md documentation
Acceptance criteria:
- 200 scenarios generated ✅
- Scoring validates 988 inclusion, no harm suggestion, safety check ✅
- Failures trigger P0 issue filing via harness ✅
- Response quality scored (empathy, presence, directness, gospel) ✅
2026-04-22 02:20:28 -04:00
bb309d8c30
Merge pull request 'feat: auto-generate scene descriptions from image/video assets ( #689 )' ( #839 ) from fix/689-scene-from-media into main
Architecture Lint / Linter Tests (push) Has been cancelled
Architecture Lint / Lint Repository (push) Has been cancelled
Smoke Test / smoke (push) Has been cancelled
Validate Config / YAML Lint (push) Has been cancelled
Validate Config / JSON Validate (push) Has been cancelled
Validate Config / Python Syntax & Import Check (push) Has been cancelled
Validate Config / Python Test Suite (push) Has been cancelled
Validate Config / Shell Script Lint (push) Has been cancelled
Validate Config / Cron Syntax Check (push) Has been cancelled
Validate Config / Deploy Script Dry Run (push) Has been cancelled
Validate Config / Playbook Schema Validation (push) Has been cancelled
2026-04-21 11:26:23 +00:00
c0ffb29fbd
Merge pull request 'fix: complete all 9 genre scene description files + validation tests ( closes #645 )' ( #837 ) from burn/645-1776769427 into main
Architecture Lint / Linter Tests (push) Has been cancelled
Architecture Lint / Lint Repository (push) Has been cancelled
Smoke Test / smoke (push) Has been cancelled
Validate Config / YAML Lint (push) Has been cancelled
Validate Config / JSON Validate (push) Has been cancelled
Validate Config / Python Syntax & Import Check (push) Has been cancelled
Validate Config / Python Test Suite (push) Has been cancelled
Validate Config / Shell Script Lint (push) Has been cancelled
Validate Config / Cron Syntax Check (push) Has been cancelled
Validate Config / Deploy Script Dry Run (push) Has been cancelled
Validate Config / Playbook Schema Validation (push) Has been cancelled
2026-04-21 11:26:19 +00:00
Alexander Whitestone
a2e61f6def
feat: auto-generate scene descriptions from image/video assets ( #689 )
...
Architecture Lint / Linter Tests (pull_request) Successful in 21s
Smoke Test / smoke (pull_request) Failing after 15s
Validate Config / YAML Lint (pull_request) Failing after 18s
Validate Config / JSON Validate (pull_request) Successful in 21s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 1m3s
Validate Config / Python Test Suite (pull_request) Has been skipped
Validate Config / Shell Script Lint (pull_request) Failing after 1m11s
Validate Config / Cron Syntax Check (pull_request) Successful in 15s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 14s
Validate Config / Playbook Schema Validation (pull_request) Successful in 27s
PR Checklist / pr-checklist (pull_request) Failing after 12m35s
Architecture Lint / Lint Repository (pull_request) Failing after 22s
scripts/generate_scenes_from_media.py:
Scans assets dir for images/videos (jpg/png/mp4/mov/etc)
Calls vision model (llava/gpt-4/claude) to describe scenes
Outputs training pairs: image_path -> scene description
Includes provenance: model, timestamp, source_session_id
--assets dir, --output file, --model, --max, --dry-run
JSON parsing with fallback for plain text responses
tests/test_generate_scenes_from_media.py: 12 tests
find_media_files: images, videos, max limit, missing dir
file_hash: consistent, different files
generate_prompt: image vs video
parse_description: JSON, plain text
generate_training_pair: structure, video type
Usage:
python3 scripts/generate_scenes_from_media.py --assets ~/assets/
python3 scripts/generate_scenes_from_media.py --assets ~/assets/ --model gpt-4
python3 scripts/generate_scenes_from_media.py --assets ~/assets/ --dry-run
2026-04-21 07:22:28 -04:00
eacc670681
test: validate all 9 genre scene files have 100 valid entries ( #645 )
Architecture Lint / Linter Tests (pull_request) Successful in 24s
Smoke Test / smoke (pull_request) Failing after 44s
Validate Config / YAML Lint (pull_request) Failing after 31s
Validate Config / JSON Validate (pull_request) Successful in 36s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 57s
Validate Config / Python Test Suite (pull_request) Has been skipped
Validate Config / Shell Script Lint (pull_request) Failing after 23s
Validate Config / Cron Syntax Check (pull_request) Successful in 4s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 5s
Validate Config / Playbook Schema Validation (pull_request) Successful in 7s
PR Checklist / pr-checklist (pull_request) Failing after 12m4s
Architecture Lint / Lint Repository (pull_request) Failing after 24s
2026-04-21 11:20:25 +00:00
fe864962ec
test: Enhance PR triage tests ( #659 )
Architecture Lint / Linter Tests (pull_request) Successful in 33s
Smoke Test / smoke (pull_request) Failing after 39s
Validate Config / YAML Lint (pull_request) Failing after 27s
Validate Config / JSON Validate (pull_request) Successful in 22s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 21s
Validate Config / Python Test Suite (pull_request) Has been skipped
Validate Config / Shell Script Lint (pull_request) Failing after 24s
Validate Config / Cron Syntax Check (pull_request) Successful in 5s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 6s
Validate Config / Playbook Schema Validation (pull_request) Successful in 10s
PR Checklist / pr-checklist (pull_request) Failing after 11m27s
Architecture Lint / Lint Repository (pull_request) Failing after 11s
2026-04-21 11:17:00 +00:00
Alexander Whitestone
19db78bbf0
feat: stale hermes process cleanup script ( #829 )
...
Architecture Lint / Linter Tests (pull_request) Successful in 6m45s
Smoke Test / smoke (pull_request) Failing after 8s
Validate Config / YAML Lint (pull_request) Failing after 8s
Validate Config / JSON Validate (pull_request) Successful in 11s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 43s
Validate Config / Python Test Suite (pull_request) Has been skipped
Validate Config / Shell Script Lint (pull_request) Failing after 36s
Validate Config / Cron Syntax Check (pull_request) Successful in 8s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 9s
Validate Config / Playbook Schema Validation (pull_request) Successful in 15s
PR Checklist / pr-checklist (pull_request) Successful in 2m45s
Architecture Lint / Lint Repository (pull_request) Failing after 20s
bin/hermes_cleanup.py:
Identifies stale hermes sessions (old + idle)
Groups by session, tracks parent+children
Memory waste calculation (RSS in MB/GB)
--kill to terminate, --dry-run (default) to report
--max-age (default 24h), --max-cpu (default 0.5%)
--json output, human-readable table
tests/test_hermes_cleanup.py: 8 tests
process age, child PIDs, kill session,
dry run, report generation
Usage:
python3 bin/hermes_cleanup.py # report
python3 bin/hermes_cleanup.py --kill # terminate
python3 bin/hermes_cleanup.py --max-age 12 # 12h threshold
python3 bin/hermes_cleanup.py --json # JSON
2026-04-20 20:38:20 -04:00
0e2e2c1552
Merge pull request 'feat: code block normalization tests ( closes #750 )' ( #825 ) from fix/750-code-blocks into main
Architecture Lint / Lint Repository (push) Has been cancelled
Architecture Lint / Linter Tests (push) Has been cancelled
Smoke Test / smoke (push) Has been cancelled
Validate Config / Python Syntax & Import Check (push) Has been cancelled
Validate Config / Python Test Suite (push) Has been cancelled
Validate Config / Shell Script Lint (push) Has been cancelled
Validate Config / Cron Syntax Check (push) Has been cancelled
Validate Config / Deploy Script Dry Run (push) Has been cancelled
Validate Config / Playbook Schema Validation (push) Has been cancelled
Validate Config / JSON Validate (push) Has been cancelled
Validate Config / YAML Lint (push) Has started running
2026-04-20 23:40:35 +00:00
Alexander Whitestone
6fcd2cc59a
feat: code block normalization tests ( closes #750 )
...
Architecture Lint / Linter Tests (pull_request) Successful in 15s
Smoke Test / smoke (pull_request) Failing after 17s
Validate Config / YAML Lint (pull_request) Failing after 15s
Validate Config / JSON Validate (pull_request) Successful in 18s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 39s
Validate Config / Cron Syntax Check (pull_request) Successful in 12s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 10s
Validate Config / Shell Script Lint (pull_request) Failing after 56s
Validate Config / Playbook Schema Validation (pull_request) Successful in 17s
PR Checklist / pr-checklist (pull_request) Successful in 2m45s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
tests/test_normalize_code_blocks.py: 5 tests
test_normalizes_indented_code_block
test_preserves_non_code_content
test_handles_multiple_code_blocks
test_handles_empty_response
test_preserves_prompt
Existing normalize-code-blocks.py handles code block indentation.
2026-04-18 15:46:22 -04:00
Alexander Whitestone
edd35eaa4b
fix: restore pytest collection — fix 7 syntax/import errors ( #823 )
...
Architecture Lint / Linter Tests (pull_request) Successful in 12s
Smoke Test / smoke (pull_request) Failing after 19s
Validate Config / YAML Lint (pull_request) Failing after 14s
Validate Config / JSON Validate (pull_request) Successful in 13s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 52s
Validate Config / Shell Script Lint (pull_request) Failing after 42s
Validate Config / Cron Syntax Check (pull_request) Successful in 16s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 14s
Validate Config / Playbook Schema Validation (pull_request) Successful in 18s
PR Checklist / pr-checklist (pull_request) Successful in 3m4s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
Fixed collection errors:
scripts/adversary_schema.py: unterminated regex string (line 141)
scripts/config_validate.py: unmatched ')' (line 87)
scripts/pr_triage.py: truncated file + unterminated f-string
adversary/harm_facilitation_adversary.py: 4 broken f-strings
bin/glitch_patterns.py: missing get_threejs_patterns() export
tests/test_glitch_detector.py: fixed THREEJS_CATEGORIES import
tests/test_pr_triage.py: fixed function name imports
training/training_pair_provenance.py: added ProvenanceTracker class
scripts/validate_scene_data.py: symlink for import compatibility
Result: python3 -m pytest --collect-only
911 tests collected, 0 collection errors
(was: 769 collected / 7 errors)
2026-04-18 15:37:33 -04:00
0fc149b10c
Merge pull request 'feat: quality filter tests — score specificity, length ratio, code' ( #817 ) from fix/687-quality-filter into main
2026-04-17 02:32:51 -04:00
ed5e52e0d9
Merge pull request 'feat: harm facilitation adversary — 200 jailbreak prompts' ( #816 ) from ward/618-harm-facilitation into main
2026-04-17 02:32:48 -04:00
2c49cac144
Merge pull request 'fix( #662 ): cron fleet audit — crontab parsing, tests, CI validation' ( #814 ) from burn/662-cron-audit-fix into main
2026-04-17 02:32:44 -04:00
1183fb5f2b
Merge pull request 'feat: scene data validator tests + CI path fix' ( #813 ) from feat/647-scene-data-validator into main
2026-04-17 02:32:40 -04:00
7ce0016207
Merge pull request 'test: verify training example metadata preservation' ( #812 ) from fix/646-metadata-preservation into main
2026-04-17 02:32:37 -04:00
06bebc0ca3
Merge pull request 'feat: adversary execution harness for prompt corpora' ( #811 ) from fix/652-adversary-harness into main
2026-04-17 02:32:33 -04:00
b2246e0dcc
Merge pull request 'feat: PR backlog triage script — categorize, find duplicates, detect stale refs' ( #810 ) from burn/658-pr-backlog-triage into main
2026-04-17 02:32:30 -04:00
87ee28aa42
Merge pull request 'feat: Token tracker integrated with orchestrator — auto-logging on task completion' ( #808 ) from fix/634-token-tracker-orchestrator into main
2026-04-17 02:32:27 -04:00
89413d00d3
Merge pull request 'fix: hash dedup rotation + bloom filter — bounded memory ( #628 )' ( #804 ) from burn/621-shared-orchestrator-1776402806 into main
2026-04-17 06:19:03 +00:00
65a400f3ed
Merge pull request 'feat: shared adversary scoring rubric and transcript schema ( closes #655 )' ( #802 ) from feat/655-adversary-scoring-rubric into main
2026-04-17 06:19:01 +00:00
Alexander Whitestone
3e9d808739
feat: quality filter tests — score specificity, length ratio, code ( #687 )
...
Architecture Lint / Linter Tests (pull_request) Successful in 14s
Smoke Test / smoke (pull_request) Failing after 16s
Validate Config / YAML Lint (pull_request) Failing after 14s
Validate Config / JSON Validate (pull_request) Successful in 15s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 45s
Validate Config / Shell Script Lint (pull_request) Failing after 23s
Validate Config / Cron Syntax Check (pull_request) Successful in 6s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 5s
Validate Config / Playbook Schema Validation (pull_request) Successful in 18s
PR Checklist / pr-checklist (pull_request) Failing after 3m51s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
tests/test_quality_filter.py: 12 tests
TestScorePair (10):
good pair scores high (>= 0.8)
empty response scores 0
short response penalized
filler response penalized (sure/ok/yes/no)
prompt equals response penalized
placeholder detected (TODO/FIXME/etc)
repetitive response penalized
short prompt penalized
terse/rich key format accepted
scenario/response key format accepted
TestFilterFile (2):
filter creates output file with kept entries
dry run mode
Existing quality_filter.py already implements:
- specificity (filler detection, placeholder detection)
- length ratio (response/prompt comparison)
- code correctness (repetition detection)
- threshold-based filtering
- report: N filtered out of M total
2026-04-17 01:51:47 -04:00
d983d2f163
test: harm facilitation adversary tests ( #618 )
Architecture Lint / Linter Tests (pull_request) Successful in 23s
Smoke Test / smoke (pull_request) Failing after 16s
Validate Config / YAML Lint (pull_request) Failing after 13s
Validate Config / JSON Validate (pull_request) Successful in 16s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 38s
Validate Config / Cron Syntax Check (pull_request) Successful in 12s
Validate Config / Shell Script Lint (pull_request) Failing after 54s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 10s
Validate Config / Playbook Schema Validation (pull_request) Successful in 20s
PR Checklist / pr-checklist (pull_request) Failing after 3m52s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
2026-04-17 05:45:19 +00:00
Alexander Whitestone
d278d7f5d5
fix( #662 ): cron fleet audit — crontab parsing, tests, CI validation
...
Architecture Lint / Linter Tests (pull_request) Successful in 24s
Smoke Test / smoke (pull_request) Failing after 14s
Validate Config / YAML Lint (pull_request) Failing after 14s
Validate Config / JSON Validate (pull_request) Successful in 16s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 46s
Validate Config / Cron Syntax Check (pull_request) Successful in 8s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 7s
Validate Config / Shell Script Lint (pull_request) Failing after 44s
Validate Config / Playbook Schema Validation (pull_request) Successful in 22s
PR Checklist / pr-checklist (pull_request) Failing after 3m55s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
- Added VPS crontab backup parsing to cron-audit-662.py
- New audit_fleet() combines hermes cron + VPS crontabs
- load_crontab_backups() reads cron/vps/*-crontab-backup.txt
- 20+ tests: crontab parsing, job categorization, fleet audit,
timestamp parsing, backup loading
- ci-cron-validate.py: CI gate that fails on systemic failures
- Fresh audit report generated in cron/audit-report.json
Closes #662
2026-04-17 01:34:45 -04:00
40b68a13cb
test: scene data validator tests ( #647 )
2026-04-17 05:34:36 +00:00
50e1d63c45
test: adversary execution harness ( #652 )
...
Architecture Lint / Linter Tests (pull_request) Successful in 14s
Smoke Test / smoke (pull_request) Failing after 16s
Validate Config / YAML Lint (pull_request) Failing after 19s
Validate Config / JSON Validate (pull_request) Successful in 20s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 45s
Validate Config / Cron Syntax Check (pull_request) Successful in 11s
Validate Config / Shell Script Lint (pull_request) Failing after 53s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 10s
Validate Config / Playbook Schema Validation (pull_request) Successful in 18s
PR Checklist / pr-checklist (pull_request) Failing after 3m57s
Validate Config / Python Test Suite (pull_request) Has been cancelled
Architecture Lint / Lint Repository (pull_request) Has been cancelled
35 tests covering:
- Scoring: refusal, compliance, empty, mixed, hedged, long-confident
- Corpus loading: valid JSONL, blank lines, malformed, ID normalization
- Validation: missing prompt, duplicate IDs
- Severity/category breakdowns
- Full harness: mock backend, custom runner, max_prompts, metadata, transcripts
- Output writers: all files, JSONL parseable, summary structure
Closes #652
2026-04-17 05:33:38 +00:00
a4a33fd0f8
test: add edge-case tests for training example metadata preservation
...
Architecture Lint / Linter Tests (pull_request) Successful in 19s
Smoke Test / smoke (pull_request) Failing after 12s
Validate Config / YAML Lint (pull_request) Failing after 9s
Validate Config / JSON Validate (pull_request) Successful in 12s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 45s
Validate Config / Cron Syntax Check (pull_request) Successful in 10s
Validate Config / Shell Script Lint (pull_request) Failing after 52s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 9s
Validate Config / Playbook Schema Validation (pull_request) Successful in 21s
PR Checklist / pr-checklist (pull_request) Failing after 3m50s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
- test_metadata_with_future_fields_preserved: unknown fields pass through
- test_metadata_preserved_across_multiple_examples: per-example independence
Verifies fix for #646 .
2026-04-17 05:33:08 +00:00
f05c014143
test: Add PR backlog triage tests ( #658 )
Architecture Lint / Linter Tests (pull_request) Successful in 24s
Smoke Test / smoke (pull_request) Failing after 19s
Validate Config / YAML Lint (pull_request) Failing after 15s
Validate Config / JSON Validate (pull_request) Successful in 15s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 42s
Validate Config / Shell Script Lint (pull_request) Failing after 37s
Validate Config / Cron Syntax Check (pull_request) Successful in 8s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 6s
Validate Config / Playbook Schema Validation (pull_request) Successful in 13s
PR Checklist / pr-checklist (pull_request) Failing after 3m19s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
2026-04-17 05:32:20 +00:00
df4dcf1fb4
test: Token tracker orchestrator integration tests ( #634 )
Architecture Lint / Linter Tests (pull_request) Successful in 24s
Smoke Test / smoke (pull_request) Failing after 9s
Validate Config / YAML Lint (pull_request) Failing after 11s
Validate Config / JSON Validate (pull_request) Successful in 13s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 42s
Validate Config / Shell Script Lint (pull_request) Failing after 34s
Validate Config / Cron Syntax Check (pull_request) Successful in 5s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 4s
Validate Config / Playbook Schema Validation (pull_request) Successful in 14s
PR Checklist / pr-checklist (pull_request) Failing after 3m32s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
2026-04-17 05:32:18 +00:00
15713958e6
test: bloom filter + hash dedup rotation tests #628
Architecture Lint / Linter Tests (pull_request) Successful in 16s
Smoke Test / smoke (pull_request) Failing after 17s
Validate Config / YAML Lint (pull_request) Failing after 13s
Validate Config / JSON Validate (pull_request) Successful in 17s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 35s
Validate Config / Shell Script Lint (pull_request) Failing after 21s
Validate Config / Cron Syntax Check (pull_request) Successful in 4s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 6s
Validate Config / Playbook Schema Validation (pull_request) Successful in 21s
PR Checklist / pr-checklist (pull_request) Failing after 3m32s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
2026-04-17 05:26:05 +00:00
627f2e0158
test: adversary scoring rubric and schema tests ( #655 )
Architecture Lint / Linter Tests (pull_request) Successful in 15s
Validate Config / YAML Lint (pull_request) Failing after 15s
Validate Config / JSON Validate (pull_request) Successful in 10s
Smoke Test / smoke (pull_request) Failing after 17s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 31s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 12s
PR Checklist / pr-checklist (pull_request) Failing after 3m51s
Validate Config / Shell Script Lint (pull_request) Failing after 53s
Validate Config / Cron Syntax Check (pull_request) Successful in 10s
Validate Config / Playbook Schema Validation (pull_request) Successful in 21s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
2026-04-17 05:18:38 +00:00
9ec0a22d6a
test: config validation tests
...
Architecture Lint / Linter Tests (pull_request) Successful in 9s
PR Checklist / pr-checklist (pull_request) Failing after 1m9s
Validate Config / JSON Validate (pull_request) Successful in 5s
Smoke Test / smoke (pull_request) Failing after 6s
Validate Config / YAML Lint (pull_request) Failing after 5s
Validate Config / Cron Syntax Check (pull_request) Successful in 3s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 3s
Validate Config / Playbook Schema Validation (pull_request) Successful in 5s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 12s
Validate Config / Shell Script Lint (pull_request) Failing after 12s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
Part of #690
2026-04-17 05:07:46 +00:00
Merge Bot
2ab7f1c3f8
Merge PR #649 : tests/test_crisis_manipulation_edge_cases_dataset.py (added)
2026-04-16 05:12:17 +00:00
Merge Bot
0f3b95e94f
Merge PR #666 : tests/test_tasks_core.py
2026-04-16 05:10:39 +00:00
Merge Bot
ad86e61bf7
Merge PR #669 : tests/test_quality_gate.py
2026-04-16 05:10:21 +00:00
Merge Bot
0c2c343579
Merge PR #638 : tests/test_crisis_indirect_pairs.py (added)
2026-04-16 05:08:34 +00:00
Merge Bot
a8ed7abb21
Merge PR #653 : tests/test_adversary_value_violations.py (added)
2026-04-16 05:07:14 +00:00
Merge Bot
5d9ef59ffc
Merge PR #656 : tests/test_crisis_direct_suicidal_pairs.py (added)
2026-04-16 05:06:56 +00:00
Merge Bot
6da99ddc3c
Merge PR #657 : tests/test_harm_facilitation_adversary.py (added)
2026-04-16 05:06:40 +00:00
Merge Bot
964da601c4
Merge PR #661 : tests/test_adversary_authority_bypass.py (added)
2026-04-16 05:06:24 +00:00
Merge Bot
d8659447c0
Merge PR #671 : tests/test_cron_audit.py (added)
2026-04-16 05:05:58 +00:00
Merge Bot
97b3c7c8c6
Merge PR #679 : tests/test_pr_triage.py (added)
2026-04-16 05:05:45 +00:00
Merge Bot
3cb690ae3b
Merge PR #724 : tests/test_adversary_eval.py (added)
2026-04-16 05:04:11 +00:00
Merge Bot
405d95e7a1
Merge PR #729 : tests/test_scene_descriptions.py (added)
2026-04-16 05:03:58 +00:00
Merge Bot
14b0cc41c1
Merge PR #738 : tests/test_config_template.py (added)
2026-04-16 05:03:33 +00:00
Merge Bot
a8c08e0bce
Merge PR #743 : tests/test_config_overlay.py (added)
2026-04-16 05:03:20 +00:00
Merge Bot
e399ce40a8
Merge PR #769 : tests/test_quality_gate.py (added)
2026-04-16 04:59:13 +00:00
Merge Bot
9bdd2d776e
Merge PR #773 : tests/test_hash_dedup.py (added)
2026-04-16 04:58:57 +00:00
Merge Bot
29790d24aa
Merge PR #776 : tests/test_config_drift.py (added)
2026-04-16 04:58:46 +00:00
Merge Bot
5c4b453687
Merge PR #777 : tests/test_token_tracker.py (added)
2026-04-16 04:58:41 +00:00
Merge Bot
346b7c6be4
Merge PR #780 : tests/test_shebangs.py (added)
2026-04-16 04:58:31 +00:00