Commit Graph

75 Commits

Author SHA1 Message Date
Alexander Whitestone
91d94e29e8 fix(pipeline): repair token tracker CLI summary (#622)
Some checks failed
Smoke Test / smoke (pull_request) Failing after 24s
Architecture Lint / Linter Tests (pull_request) Successful in 29s
Validate Config / YAML Lint (pull_request) Failing after 16s
Validate Config / JSON Validate (pull_request) Successful in 21s
Validate Config / Shell Script Lint (pull_request) Failing after 1m8s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 1m21s
Validate Config / Python Test Suite (pull_request) Has been skipped
Validate Config / Cron Syntax Check (pull_request) Successful in 15s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 15s
Validate Config / Playbook Schema Validation (pull_request) Successful in 30s
Architecture Lint / Lint Repository (pull_request) Failing after 28s
PR Checklist / pr-checklist (pull_request) Successful in 5m18s
Fix the hyphenated token-tracker entrypoint and normalize SQLite time filtering
so same-day usage appears in the summary dashboard.
2026-04-22 11:23:43 -04:00
bb309d8c30 Merge pull request 'feat: auto-generate scene descriptions from image/video assets (#689)' (#839) from fix/689-scene-from-media into main
Some checks failed
Architecture Lint / Linter Tests (push) Has been cancelled
Architecture Lint / Lint Repository (push) Has been cancelled
Smoke Test / smoke (push) Has been cancelled
Validate Config / YAML Lint (push) Has been cancelled
Validate Config / JSON Validate (push) Has been cancelled
Validate Config / Python Syntax & Import Check (push) Has been cancelled
Validate Config / Python Test Suite (push) Has been cancelled
Validate Config / Shell Script Lint (push) Has been cancelled
Validate Config / Cron Syntax Check (push) Has been cancelled
Validate Config / Deploy Script Dry Run (push) Has been cancelled
Validate Config / Playbook Schema Validation (push) Has been cancelled
2026-04-21 11:26:23 +00:00
c0ffb29fbd Merge pull request 'fix: complete all 9 genre scene description files + validation tests (closes #645)' (#837) from burn/645-1776769427 into main
Some checks failed
Architecture Lint / Linter Tests (push) Has been cancelled
Architecture Lint / Lint Repository (push) Has been cancelled
Smoke Test / smoke (push) Has been cancelled
Validate Config / YAML Lint (push) Has been cancelled
Validate Config / JSON Validate (push) Has been cancelled
Validate Config / Python Syntax & Import Check (push) Has been cancelled
Validate Config / Python Test Suite (push) Has been cancelled
Validate Config / Shell Script Lint (push) Has been cancelled
Validate Config / Cron Syntax Check (push) Has been cancelled
Validate Config / Deploy Script Dry Run (push) Has been cancelled
Validate Config / Playbook Schema Validation (push) Has been cancelled
2026-04-21 11:26:19 +00:00
Alexander Whitestone
a2e61f6def feat: auto-generate scene descriptions from image/video assets (#689)
Some checks failed
Architecture Lint / Linter Tests (pull_request) Successful in 21s
Smoke Test / smoke (pull_request) Failing after 15s
Validate Config / YAML Lint (pull_request) Failing after 18s
Validate Config / JSON Validate (pull_request) Successful in 21s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 1m3s
Validate Config / Python Test Suite (pull_request) Has been skipped
Validate Config / Shell Script Lint (pull_request) Failing after 1m11s
Validate Config / Cron Syntax Check (pull_request) Successful in 15s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 14s
Validate Config / Playbook Schema Validation (pull_request) Successful in 27s
PR Checklist / pr-checklist (pull_request) Failing after 12m35s
Architecture Lint / Lint Repository (pull_request) Failing after 22s
scripts/generate_scenes_from_media.py:
  Scans assets dir for images/videos (jpg/png/mp4/mov/etc)
  Calls vision model (llava/gpt-4/claude) to describe scenes
  Outputs training pairs: image_path -> scene description
  Includes provenance: model, timestamp, source_session_id
  --assets dir, --output file, --model, --max, --dry-run
  JSON parsing with fallback for plain text responses

tests/test_generate_scenes_from_media.py: 12 tests
  find_media_files: images, videos, max limit, missing dir
  file_hash: consistent, different files
  generate_prompt: image vs video
  parse_description: JSON, plain text
  generate_training_pair: structure, video type

Usage:
  python3 scripts/generate_scenes_from_media.py --assets ~/assets/
  python3 scripts/generate_scenes_from_media.py --assets ~/assets/ --model gpt-4
  python3 scripts/generate_scenes_from_media.py --assets ~/assets/ --dry-run
2026-04-21 07:22:28 -04:00
eacc670681 test: validate all 9 genre scene files have 100 valid entries (#645)
Some checks failed
Architecture Lint / Linter Tests (pull_request) Successful in 24s
Smoke Test / smoke (pull_request) Failing after 44s
Validate Config / YAML Lint (pull_request) Failing after 31s
Validate Config / JSON Validate (pull_request) Successful in 36s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 57s
Validate Config / Python Test Suite (pull_request) Has been skipped
Validate Config / Shell Script Lint (pull_request) Failing after 23s
Validate Config / Cron Syntax Check (pull_request) Successful in 4s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 5s
Validate Config / Playbook Schema Validation (pull_request) Successful in 7s
PR Checklist / pr-checklist (pull_request) Failing after 12m4s
Architecture Lint / Lint Repository (pull_request) Failing after 24s
2026-04-21 11:20:25 +00:00
fe864962ec test: Enhance PR triage tests (#659)
Some checks failed
Architecture Lint / Linter Tests (pull_request) Successful in 33s
Smoke Test / smoke (pull_request) Failing after 39s
Validate Config / YAML Lint (pull_request) Failing after 27s
Validate Config / JSON Validate (pull_request) Successful in 22s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 21s
Validate Config / Python Test Suite (pull_request) Has been skipped
Validate Config / Shell Script Lint (pull_request) Failing after 24s
Validate Config / Cron Syntax Check (pull_request) Successful in 5s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 6s
Validate Config / Playbook Schema Validation (pull_request) Successful in 10s
PR Checklist / pr-checklist (pull_request) Failing after 11m27s
Architecture Lint / Lint Repository (pull_request) Failing after 11s
2026-04-21 11:17:00 +00:00
Alexander Whitestone
19db78bbf0 feat: stale hermes process cleanup script (#829)
Some checks failed
Architecture Lint / Linter Tests (pull_request) Successful in 6m45s
Smoke Test / smoke (pull_request) Failing after 8s
Validate Config / YAML Lint (pull_request) Failing after 8s
Validate Config / JSON Validate (pull_request) Successful in 11s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 43s
Validate Config / Python Test Suite (pull_request) Has been skipped
Validate Config / Shell Script Lint (pull_request) Failing after 36s
Validate Config / Cron Syntax Check (pull_request) Successful in 8s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 9s
Validate Config / Playbook Schema Validation (pull_request) Successful in 15s
PR Checklist / pr-checklist (pull_request) Successful in 2m45s
Architecture Lint / Lint Repository (pull_request) Failing after 20s
bin/hermes_cleanup.py:
  Identifies stale hermes sessions (old + idle)
  Groups by session, tracks parent+children
  Memory waste calculation (RSS in MB/GB)
  --kill to terminate, --dry-run (default) to report
  --max-age (default 24h), --max-cpu (default 0.5%)
  --json output, human-readable table

tests/test_hermes_cleanup.py: 8 tests
  process age, child PIDs, kill session,
  dry run, report generation

Usage:
  python3 bin/hermes_cleanup.py              # report
  python3 bin/hermes_cleanup.py --kill       # terminate
  python3 bin/hermes_cleanup.py --max-age 12 # 12h threshold
  python3 bin/hermes_cleanup.py --json       # JSON
2026-04-20 20:38:20 -04:00
0e2e2c1552 Merge pull request 'feat: code block normalization tests (closes #750)' (#825) from fix/750-code-blocks into main
Some checks failed
Architecture Lint / Lint Repository (push) Has been cancelled
Architecture Lint / Linter Tests (push) Has been cancelled
Smoke Test / smoke (push) Has been cancelled
Validate Config / Python Syntax & Import Check (push) Has been cancelled
Validate Config / Python Test Suite (push) Has been cancelled
Validate Config / Shell Script Lint (push) Has been cancelled
Validate Config / Cron Syntax Check (push) Has been cancelled
Validate Config / Deploy Script Dry Run (push) Has been cancelled
Validate Config / Playbook Schema Validation (push) Has been cancelled
Validate Config / JSON Validate (push) Has been cancelled
Validate Config / YAML Lint (push) Has started running
2026-04-20 23:40:35 +00:00
Alexander Whitestone
6fcd2cc59a feat: code block normalization tests (closes #750)
Some checks failed
Architecture Lint / Linter Tests (pull_request) Successful in 15s
Smoke Test / smoke (pull_request) Failing after 17s
Validate Config / YAML Lint (pull_request) Failing after 15s
Validate Config / JSON Validate (pull_request) Successful in 18s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 39s
Validate Config / Cron Syntax Check (pull_request) Successful in 12s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 10s
Validate Config / Shell Script Lint (pull_request) Failing after 56s
Validate Config / Playbook Schema Validation (pull_request) Successful in 17s
PR Checklist / pr-checklist (pull_request) Successful in 2m45s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
tests/test_normalize_code_blocks.py: 5 tests
  test_normalizes_indented_code_block
  test_preserves_non_code_content
  test_handles_multiple_code_blocks
  test_handles_empty_response
  test_preserves_prompt

Existing normalize-code-blocks.py handles code block indentation.
2026-04-18 15:46:22 -04:00
Alexander Whitestone
edd35eaa4b fix: restore pytest collection — fix 7 syntax/import errors (#823)
Some checks failed
Architecture Lint / Linter Tests (pull_request) Successful in 12s
Smoke Test / smoke (pull_request) Failing after 19s
Validate Config / YAML Lint (pull_request) Failing after 14s
Validate Config / JSON Validate (pull_request) Successful in 13s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 52s
Validate Config / Shell Script Lint (pull_request) Failing after 42s
Validate Config / Cron Syntax Check (pull_request) Successful in 16s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 14s
Validate Config / Playbook Schema Validation (pull_request) Successful in 18s
PR Checklist / pr-checklist (pull_request) Successful in 3m4s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
Fixed collection errors:
  scripts/adversary_schema.py: unterminated regex string (line 141)
  scripts/config_validate.py: unmatched ')' (line 87)
  scripts/pr_triage.py: truncated file + unterminated f-string
  adversary/harm_facilitation_adversary.py: 4 broken f-strings
  bin/glitch_patterns.py: missing get_threejs_patterns() export
  tests/test_glitch_detector.py: fixed THREEJS_CATEGORIES import
  tests/test_pr_triage.py: fixed function name imports
  training/training_pair_provenance.py: added ProvenanceTracker class
  scripts/validate_scene_data.py: symlink for import compatibility

Result: python3 -m pytest --collect-only
  911 tests collected, 0 collection errors
  (was: 769 collected / 7 errors)
2026-04-18 15:37:33 -04:00
0fc149b10c Merge pull request 'feat: quality filter tests — score specificity, length ratio, code' (#817) from fix/687-quality-filter into main 2026-04-17 02:32:51 -04:00
ed5e52e0d9 Merge pull request 'feat: harm facilitation adversary — 200 jailbreak prompts' (#816) from ward/618-harm-facilitation into main 2026-04-17 02:32:48 -04:00
2c49cac144 Merge pull request 'fix(#662): cron fleet audit — crontab parsing, tests, CI validation' (#814) from burn/662-cron-audit-fix into main 2026-04-17 02:32:44 -04:00
1183fb5f2b Merge pull request 'feat: scene data validator tests + CI path fix' (#813) from feat/647-scene-data-validator into main 2026-04-17 02:32:40 -04:00
7ce0016207 Merge pull request 'test: verify training example metadata preservation' (#812) from fix/646-metadata-preservation into main 2026-04-17 02:32:37 -04:00
06bebc0ca3 Merge pull request 'feat: adversary execution harness for prompt corpora' (#811) from fix/652-adversary-harness into main 2026-04-17 02:32:33 -04:00
b2246e0dcc Merge pull request 'feat: PR backlog triage script — categorize, find duplicates, detect stale refs' (#810) from burn/658-pr-backlog-triage into main 2026-04-17 02:32:30 -04:00
87ee28aa42 Merge pull request 'feat: Token tracker integrated with orchestrator — auto-logging on task completion' (#808) from fix/634-token-tracker-orchestrator into main 2026-04-17 02:32:27 -04:00
89413d00d3 Merge pull request 'fix: hash dedup rotation + bloom filter — bounded memory (#628)' (#804) from burn/621-shared-orchestrator-1776402806 into main 2026-04-17 06:19:03 +00:00
65a400f3ed Merge pull request 'feat: shared adversary scoring rubric and transcript schema (closes #655)' (#802) from feat/655-adversary-scoring-rubric into main 2026-04-17 06:19:01 +00:00
Alexander Whitestone
3e9d808739 feat: quality filter tests — score specificity, length ratio, code (#687)
Some checks failed
Architecture Lint / Linter Tests (pull_request) Successful in 14s
Smoke Test / smoke (pull_request) Failing after 16s
Validate Config / YAML Lint (pull_request) Failing after 14s
Validate Config / JSON Validate (pull_request) Successful in 15s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 45s
Validate Config / Shell Script Lint (pull_request) Failing after 23s
Validate Config / Cron Syntax Check (pull_request) Successful in 6s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 5s
Validate Config / Playbook Schema Validation (pull_request) Successful in 18s
PR Checklist / pr-checklist (pull_request) Failing after 3m51s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
tests/test_quality_filter.py: 12 tests

TestScorePair (10):
  good pair scores high (>= 0.8)
  empty response scores 0
  short response penalized
  filler response penalized (sure/ok/yes/no)
  prompt equals response penalized
  placeholder detected (TODO/FIXME/etc)
  repetitive response penalized
  short prompt penalized
  terse/rich key format accepted
  scenario/response key format accepted

TestFilterFile (2):
  filter creates output file with kept entries
  dry run mode

Existing quality_filter.py already implements:
  - specificity (filler detection, placeholder detection)
  - length ratio (response/prompt comparison)
  - code correctness (repetition detection)
  - threshold-based filtering
  - report: N filtered out of M total
2026-04-17 01:51:47 -04:00
d983d2f163 test: harm facilitation adversary tests (#618)
Some checks failed
Architecture Lint / Linter Tests (pull_request) Successful in 23s
Smoke Test / smoke (pull_request) Failing after 16s
Validate Config / YAML Lint (pull_request) Failing after 13s
Validate Config / JSON Validate (pull_request) Successful in 16s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 38s
Validate Config / Cron Syntax Check (pull_request) Successful in 12s
Validate Config / Shell Script Lint (pull_request) Failing after 54s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 10s
Validate Config / Playbook Schema Validation (pull_request) Successful in 20s
PR Checklist / pr-checklist (pull_request) Failing after 3m52s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
2026-04-17 05:45:19 +00:00
Alexander Whitestone
d278d7f5d5 fix(#662): cron fleet audit — crontab parsing, tests, CI validation
Some checks failed
Architecture Lint / Linter Tests (pull_request) Successful in 24s
Smoke Test / smoke (pull_request) Failing after 14s
Validate Config / YAML Lint (pull_request) Failing after 14s
Validate Config / JSON Validate (pull_request) Successful in 16s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 46s
Validate Config / Cron Syntax Check (pull_request) Successful in 8s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 7s
Validate Config / Shell Script Lint (pull_request) Failing after 44s
Validate Config / Playbook Schema Validation (pull_request) Successful in 22s
PR Checklist / pr-checklist (pull_request) Failing after 3m55s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
- Added VPS crontab backup parsing to cron-audit-662.py
- New audit_fleet() combines hermes cron + VPS crontabs
- load_crontab_backups() reads cron/vps/*-crontab-backup.txt
- 20+ tests: crontab parsing, job categorization, fleet audit,
  timestamp parsing, backup loading
- ci-cron-validate.py: CI gate that fails on systemic failures
- Fresh audit report generated in cron/audit-report.json

Closes #662
2026-04-17 01:34:45 -04:00
40b68a13cb test: scene data validator tests (#647) 2026-04-17 05:34:36 +00:00
50e1d63c45 test: adversary execution harness (#652)
Some checks failed
Architecture Lint / Linter Tests (pull_request) Successful in 14s
Smoke Test / smoke (pull_request) Failing after 16s
Validate Config / YAML Lint (pull_request) Failing after 19s
Validate Config / JSON Validate (pull_request) Successful in 20s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 45s
Validate Config / Cron Syntax Check (pull_request) Successful in 11s
Validate Config / Shell Script Lint (pull_request) Failing after 53s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 10s
Validate Config / Playbook Schema Validation (pull_request) Successful in 18s
PR Checklist / pr-checklist (pull_request) Failing after 3m57s
Validate Config / Python Test Suite (pull_request) Has been cancelled
Architecture Lint / Lint Repository (pull_request) Has been cancelled
35 tests covering:
- Scoring: refusal, compliance, empty, mixed, hedged, long-confident
- Corpus loading: valid JSONL, blank lines, malformed, ID normalization
- Validation: missing prompt, duplicate IDs
- Severity/category breakdowns
- Full harness: mock backend, custom runner, max_prompts, metadata, transcripts
- Output writers: all files, JSONL parseable, summary structure

Closes #652
2026-04-17 05:33:38 +00:00
a4a33fd0f8 test: add edge-case tests for training example metadata preservation
Some checks failed
Architecture Lint / Linter Tests (pull_request) Successful in 19s
Smoke Test / smoke (pull_request) Failing after 12s
Validate Config / YAML Lint (pull_request) Failing after 9s
Validate Config / JSON Validate (pull_request) Successful in 12s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 45s
Validate Config / Cron Syntax Check (pull_request) Successful in 10s
Validate Config / Shell Script Lint (pull_request) Failing after 52s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 9s
Validate Config / Playbook Schema Validation (pull_request) Successful in 21s
PR Checklist / pr-checklist (pull_request) Failing after 3m50s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
- test_metadata_with_future_fields_preserved: unknown fields pass through
- test_metadata_preserved_across_multiple_examples: per-example independence

Verifies fix for #646.
2026-04-17 05:33:08 +00:00
f05c014143 test: Add PR backlog triage tests (#658)
Some checks failed
Architecture Lint / Linter Tests (pull_request) Successful in 24s
Smoke Test / smoke (pull_request) Failing after 19s
Validate Config / YAML Lint (pull_request) Failing after 15s
Validate Config / JSON Validate (pull_request) Successful in 15s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 42s
Validate Config / Shell Script Lint (pull_request) Failing after 37s
Validate Config / Cron Syntax Check (pull_request) Successful in 8s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 6s
Validate Config / Playbook Schema Validation (pull_request) Successful in 13s
PR Checklist / pr-checklist (pull_request) Failing after 3m19s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
2026-04-17 05:32:20 +00:00
df4dcf1fb4 test: Token tracker orchestrator integration tests (#634)
Some checks failed
Architecture Lint / Linter Tests (pull_request) Successful in 24s
Smoke Test / smoke (pull_request) Failing after 9s
Validate Config / YAML Lint (pull_request) Failing after 11s
Validate Config / JSON Validate (pull_request) Successful in 13s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 42s
Validate Config / Shell Script Lint (pull_request) Failing after 34s
Validate Config / Cron Syntax Check (pull_request) Successful in 5s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 4s
Validate Config / Playbook Schema Validation (pull_request) Successful in 14s
PR Checklist / pr-checklist (pull_request) Failing after 3m32s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
2026-04-17 05:32:18 +00:00
15713958e6 test: bloom filter + hash dedup rotation tests #628
Some checks failed
Architecture Lint / Linter Tests (pull_request) Successful in 16s
Smoke Test / smoke (pull_request) Failing after 17s
Validate Config / YAML Lint (pull_request) Failing after 13s
Validate Config / JSON Validate (pull_request) Successful in 17s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 35s
Validate Config / Shell Script Lint (pull_request) Failing after 21s
Validate Config / Cron Syntax Check (pull_request) Successful in 4s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 6s
Validate Config / Playbook Schema Validation (pull_request) Successful in 21s
PR Checklist / pr-checklist (pull_request) Failing after 3m32s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
2026-04-17 05:26:05 +00:00
627f2e0158 test: adversary scoring rubric and schema tests (#655)
Some checks failed
Architecture Lint / Linter Tests (pull_request) Successful in 15s
Validate Config / YAML Lint (pull_request) Failing after 15s
Validate Config / JSON Validate (pull_request) Successful in 10s
Smoke Test / smoke (pull_request) Failing after 17s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 31s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 12s
PR Checklist / pr-checklist (pull_request) Failing after 3m51s
Validate Config / Shell Script Lint (pull_request) Failing after 53s
Validate Config / Cron Syntax Check (pull_request) Successful in 10s
Validate Config / Playbook Schema Validation (pull_request) Successful in 21s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
2026-04-17 05:18:38 +00:00
9ec0a22d6a test: config validation tests
Some checks failed
Architecture Lint / Linter Tests (pull_request) Successful in 9s
PR Checklist / pr-checklist (pull_request) Failing after 1m9s
Validate Config / JSON Validate (pull_request) Successful in 5s
Smoke Test / smoke (pull_request) Failing after 6s
Validate Config / YAML Lint (pull_request) Failing after 5s
Validate Config / Cron Syntax Check (pull_request) Successful in 3s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 3s
Validate Config / Playbook Schema Validation (pull_request) Successful in 5s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 12s
Validate Config / Shell Script Lint (pull_request) Failing after 12s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
Part of #690
2026-04-17 05:07:46 +00:00
Merge Bot
2ab7f1c3f8 Merge PR #649: tests/test_crisis_manipulation_edge_cases_dataset.py (added) 2026-04-16 05:12:17 +00:00
Merge Bot
0f3b95e94f Merge PR #666: tests/test_tasks_core.py 2026-04-16 05:10:39 +00:00
Merge Bot
ad86e61bf7 Merge PR #669: tests/test_quality_gate.py 2026-04-16 05:10:21 +00:00
Merge Bot
0c2c343579 Merge PR #638: tests/test_crisis_indirect_pairs.py (added) 2026-04-16 05:08:34 +00:00
Merge Bot
a8ed7abb21 Merge PR #653: tests/test_adversary_value_violations.py (added) 2026-04-16 05:07:14 +00:00
Merge Bot
5d9ef59ffc Merge PR #656: tests/test_crisis_direct_suicidal_pairs.py (added) 2026-04-16 05:06:56 +00:00
Merge Bot
6da99ddc3c Merge PR #657: tests/test_harm_facilitation_adversary.py (added) 2026-04-16 05:06:40 +00:00
Merge Bot
964da601c4 Merge PR #661: tests/test_adversary_authority_bypass.py (added) 2026-04-16 05:06:24 +00:00
Merge Bot
d8659447c0 Merge PR #671: tests/test_cron_audit.py (added) 2026-04-16 05:05:58 +00:00
Merge Bot
97b3c7c8c6 Merge PR #679: tests/test_pr_triage.py (added) 2026-04-16 05:05:45 +00:00
Merge Bot
3cb690ae3b Merge PR #724: tests/test_adversary_eval.py (added) 2026-04-16 05:04:11 +00:00
Merge Bot
405d95e7a1 Merge PR #729: tests/test_scene_descriptions.py (added) 2026-04-16 05:03:58 +00:00
Merge Bot
14b0cc41c1 Merge PR #738: tests/test_config_template.py (added) 2026-04-16 05:03:33 +00:00
Merge Bot
a8c08e0bce Merge PR #743: tests/test_config_overlay.py (added) 2026-04-16 05:03:20 +00:00
Merge Bot
e399ce40a8 Merge PR #769: tests/test_quality_gate.py (added) 2026-04-16 04:59:13 +00:00
Merge Bot
9bdd2d776e Merge PR #773: tests/test_hash_dedup.py (added) 2026-04-16 04:58:57 +00:00
Merge Bot
29790d24aa Merge PR #776: tests/test_config_drift.py (added) 2026-04-16 04:58:46 +00:00
Merge Bot
5c4b453687 Merge PR #777: tests/test_token_tracker.py (added) 2026-04-16 04:58:41 +00:00
Merge Bot
346b7c6be4 Merge PR #780: tests/test_shebangs.py (added) 2026-04-16 04:58:31 +00:00