1183fb5f2b
Merge pull request 'feat: scene data validator tests + CI path fix' ( #813 ) from feat/647-scene-data-validator into main
2026-04-17 02:32:40 -04:00
7ce0016207
Merge pull request 'test: verify training example metadata preservation' ( #812 ) from fix/646-metadata-preservation into main
2026-04-17 02:32:37 -04:00
06bebc0ca3
Merge pull request 'feat: adversary execution harness for prompt corpora' ( #811 ) from fix/652-adversary-harness into main
2026-04-17 02:32:33 -04:00
b2246e0dcc
Merge pull request 'feat: PR backlog triage script — categorize, find duplicates, detect stale refs' ( #810 ) from burn/658-pr-backlog-triage into main
2026-04-17 02:32:30 -04:00
87ee28aa42
Merge pull request 'feat: Token tracker integrated with orchestrator — auto-logging on task completion' ( #808 ) from fix/634-token-tracker-orchestrator into main
2026-04-17 02:32:27 -04:00
89413d00d3
Merge pull request 'fix: hash dedup rotation + bloom filter — bounded memory ( #628 )' ( #804 ) from burn/621-shared-orchestrator-1776402806 into main
2026-04-17 06:19:03 +00:00
65a400f3ed
Merge pull request 'feat: shared adversary scoring rubric and transcript schema ( closes #655 )' ( #802 ) from feat/655-adversary-scoring-rubric into main
2026-04-17 06:19:01 +00:00
40b68a13cb
test: scene data validator tests ( #647 )
2026-04-17 05:34:36 +00:00
50e1d63c45
test: adversary execution harness ( #652 )
...
Architecture Lint / Linter Tests (pull_request) Successful in 14s
Smoke Test / smoke (pull_request) Failing after 16s
Validate Config / YAML Lint (pull_request) Failing after 19s
Validate Config / JSON Validate (pull_request) Successful in 20s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 45s
Validate Config / Cron Syntax Check (pull_request) Successful in 11s
Validate Config / Shell Script Lint (pull_request) Failing after 53s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 10s
Validate Config / Playbook Schema Validation (pull_request) Successful in 18s
PR Checklist / pr-checklist (pull_request) Failing after 3m57s
Validate Config / Python Test Suite (pull_request) Has been cancelled
Architecture Lint / Lint Repository (pull_request) Has been cancelled
35 tests covering:
- Scoring: refusal, compliance, empty, mixed, hedged, long-confident
- Corpus loading: valid JSONL, blank lines, malformed, ID normalization
- Validation: missing prompt, duplicate IDs
- Severity/category breakdowns
- Full harness: mock backend, custom runner, max_prompts, metadata, transcripts
- Output writers: all files, JSONL parseable, summary structure
Closes #652
2026-04-17 05:33:38 +00:00
a4a33fd0f8
test: add edge-case tests for training example metadata preservation
...
Architecture Lint / Linter Tests (pull_request) Successful in 19s
Smoke Test / smoke (pull_request) Failing after 12s
Validate Config / YAML Lint (pull_request) Failing after 9s
Validate Config / JSON Validate (pull_request) Successful in 12s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 45s
Validate Config / Cron Syntax Check (pull_request) Successful in 10s
Validate Config / Shell Script Lint (pull_request) Failing after 52s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 9s
Validate Config / Playbook Schema Validation (pull_request) Successful in 21s
PR Checklist / pr-checklist (pull_request) Failing after 3m50s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
- test_metadata_with_future_fields_preserved: unknown fields pass through
- test_metadata_preserved_across_multiple_examples: per-example independence
Verifies fix for #646 .
2026-04-17 05:33:08 +00:00
f05c014143
test: Add PR backlog triage tests ( #658 )
Architecture Lint / Linter Tests (pull_request) Successful in 24s
Smoke Test / smoke (pull_request) Failing after 19s
Validate Config / YAML Lint (pull_request) Failing after 15s
Validate Config / JSON Validate (pull_request) Successful in 15s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 42s
Validate Config / Shell Script Lint (pull_request) Failing after 37s
Validate Config / Cron Syntax Check (pull_request) Successful in 8s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 6s
Validate Config / Playbook Schema Validation (pull_request) Successful in 13s
PR Checklist / pr-checklist (pull_request) Failing after 3m19s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
2026-04-17 05:32:20 +00:00
df4dcf1fb4
test: Token tracker orchestrator integration tests ( #634 )
Architecture Lint / Linter Tests (pull_request) Successful in 24s
Smoke Test / smoke (pull_request) Failing after 9s
Validate Config / YAML Lint (pull_request) Failing after 11s
Validate Config / JSON Validate (pull_request) Successful in 13s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 42s
Validate Config / Shell Script Lint (pull_request) Failing after 34s
Validate Config / Cron Syntax Check (pull_request) Successful in 5s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 4s
Validate Config / Playbook Schema Validation (pull_request) Successful in 14s
PR Checklist / pr-checklist (pull_request) Failing after 3m32s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
2026-04-17 05:32:18 +00:00
15713958e6
test: bloom filter + hash dedup rotation tests #628
Architecture Lint / Linter Tests (pull_request) Successful in 16s
Smoke Test / smoke (pull_request) Failing after 17s
Validate Config / YAML Lint (pull_request) Failing after 13s
Validate Config / JSON Validate (pull_request) Successful in 17s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 35s
Validate Config / Shell Script Lint (pull_request) Failing after 21s
Validate Config / Cron Syntax Check (pull_request) Successful in 4s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 6s
Validate Config / Playbook Schema Validation (pull_request) Successful in 21s
PR Checklist / pr-checklist (pull_request) Failing after 3m32s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
2026-04-17 05:26:05 +00:00
627f2e0158
test: adversary scoring rubric and schema tests ( #655 )
Architecture Lint / Linter Tests (pull_request) Successful in 15s
Validate Config / YAML Lint (pull_request) Failing after 15s
Validate Config / JSON Validate (pull_request) Successful in 10s
Smoke Test / smoke (pull_request) Failing after 17s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 31s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 12s
PR Checklist / pr-checklist (pull_request) Failing after 3m51s
Validate Config / Shell Script Lint (pull_request) Failing after 53s
Validate Config / Cron Syntax Check (pull_request) Successful in 10s
Validate Config / Playbook Schema Validation (pull_request) Successful in 21s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
2026-04-17 05:18:38 +00:00
9ec0a22d6a
test: config validation tests
...
Architecture Lint / Linter Tests (pull_request) Successful in 9s
PR Checklist / pr-checklist (pull_request) Failing after 1m9s
Validate Config / JSON Validate (pull_request) Successful in 5s
Smoke Test / smoke (pull_request) Failing after 6s
Validate Config / YAML Lint (pull_request) Failing after 5s
Validate Config / Cron Syntax Check (pull_request) Successful in 3s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 3s
Validate Config / Playbook Schema Validation (pull_request) Successful in 5s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 12s
Validate Config / Shell Script Lint (pull_request) Failing after 12s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
Part of #690
2026-04-17 05:07:46 +00:00
Merge Bot
2ab7f1c3f8
Merge PR #649 : tests/test_crisis_manipulation_edge_cases_dataset.py (added)
2026-04-16 05:12:17 +00:00
Merge Bot
0f3b95e94f
Merge PR #666 : tests/test_tasks_core.py
2026-04-16 05:10:39 +00:00
Merge Bot
ad86e61bf7
Merge PR #669 : tests/test_quality_gate.py
2026-04-16 05:10:21 +00:00
Merge Bot
0c2c343579
Merge PR #638 : tests/test_crisis_indirect_pairs.py (added)
2026-04-16 05:08:34 +00:00
Merge Bot
a8ed7abb21
Merge PR #653 : tests/test_adversary_value_violations.py (added)
2026-04-16 05:07:14 +00:00
Merge Bot
5d9ef59ffc
Merge PR #656 : tests/test_crisis_direct_suicidal_pairs.py (added)
2026-04-16 05:06:56 +00:00
Merge Bot
6da99ddc3c
Merge PR #657 : tests/test_harm_facilitation_adversary.py (added)
2026-04-16 05:06:40 +00:00
Merge Bot
964da601c4
Merge PR #661 : tests/test_adversary_authority_bypass.py (added)
2026-04-16 05:06:24 +00:00
Merge Bot
d8659447c0
Merge PR #671 : tests/test_cron_audit.py (added)
2026-04-16 05:05:58 +00:00
Merge Bot
97b3c7c8c6
Merge PR #679 : tests/test_pr_triage.py (added)
2026-04-16 05:05:45 +00:00
Merge Bot
3cb690ae3b
Merge PR #724 : tests/test_adversary_eval.py (added)
2026-04-16 05:04:11 +00:00
Merge Bot
405d95e7a1
Merge PR #729 : tests/test_scene_descriptions.py (added)
2026-04-16 05:03:58 +00:00
Merge Bot
14b0cc41c1
Merge PR #738 : tests/test_config_template.py (added)
2026-04-16 05:03:33 +00:00
Merge Bot
a8c08e0bce
Merge PR #743 : tests/test_config_overlay.py (added)
2026-04-16 05:03:20 +00:00
Merge Bot
e399ce40a8
Merge PR #769 : tests/test_quality_gate.py (added)
2026-04-16 04:59:13 +00:00
Merge Bot
9bdd2d776e
Merge PR #773 : tests/test_hash_dedup.py (added)
2026-04-16 04:58:57 +00:00
Merge Bot
29790d24aa
Merge PR #776 : tests/test_config_drift.py (added)
2026-04-16 04:58:46 +00:00
Merge Bot
5c4b453687
Merge PR #777 : tests/test_token_tracker.py (added)
2026-04-16 04:58:41 +00:00
Merge Bot
346b7c6be4
Merge PR #780 : tests/test_shebangs.py (added)
2026-04-16 04:58:31 +00:00
Merge Bot
291cd9e59c
Merge PR #782 : tests/test_no_placeholders.py (added)
2026-04-16 04:58:26 +00:00
Merge Bot
a0b2b551c9
Merge PR #783 : tests/test_normalize_code_blocks.py (added)
2026-04-16 04:58:24 +00:00
1088bf8983
test: add Three.js pattern tests and update assertions ( #543 )
...
Architecture Lint / Linter Tests (pull_request) Successful in 28s
Smoke Test / smoke (pull_request) Failing after 23s
Validate Config / YAML Lint (pull_request) Failing after 21s
Validate Config / JSON Validate (pull_request) Successful in 21s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 1m23s
Validate Config / Shell Script Lint (pull_request) Failing after 50s
Validate Config / Cron Syntax Check (pull_request) Successful in 11s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 26s
Validate Config / Playbook Schema Validation (pull_request) Successful in 32s
PR Checklist / pr-checklist (pull_request) Failing after 11m13s
Architecture Lint / Lint Repository (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
- Added TestThreeJsPatterns class with 14 tests
- Tests cover: pattern existence, severity inference, vision prompt
- Updated pattern count assertion (14+ patterns now)
- Updated demo test (6 glitches: 4 original + 2 Three.js)
2026-04-15 05:37:17 +00:00
04dbf772b1
feat: Visual Smoke Test for The Nexus #490 ( #558 )
...
Architecture Lint / Linter Tests (push) Successful in 15s
Smoke Test / smoke (push) Failing after 14s
Validate Config / YAML Lint (push) Failing after 13s
Validate Config / JSON Validate (push) Successful in 13s
Validate Config / Shell Script Lint (push) Failing after 40s
Validate Config / Python Syntax & Import Check (push) Failing after 58s
Validate Config / Python Test Suite (push) Has been skipped
Validate Config / Cron Syntax Check (push) Successful in 11s
Validate Config / Deploy Script Dry Run (push) Successful in 11s
Validate Config / Playbook Schema Validation (push) Successful in 18s
Architecture Lint / Lint Repository (push) Failing after 13s
Architecture Lint / Linter Tests (pull_request) Successful in 26s
Smoke Test / smoke (pull_request) Failing after 17s
Validate Config / YAML Lint (pull_request) Failing after 12s
Validate Config / JSON Validate (pull_request) Successful in 12s
PR Checklist / pr-checklist (pull_request) Failing after 3m36s
Validate Config / Shell Script Lint (pull_request) Failing after 40s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 1m4s
Validate Config / Python Test Suite (pull_request) Has been skipped
Validate Config / Cron Syntax Check (pull_request) Successful in 9s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 9s
Validate Config / Playbook Schema Validation (pull_request) Successful in 20s
Architecture Lint / Lint Repository (pull_request) Failing after 16s
Merge PR #558
2026-04-14 22:18:23 +00:00
9651a56308
feat: Glitch Detector HTML Report with annotated screenshots #544 ( #567 )
...
Architecture Lint / Lint Repository (push) Has been cancelled
Architecture Lint / Linter Tests (push) Has been cancelled
Smoke Test / smoke (push) Has been cancelled
Validate Config / JSON Validate (push) Has been cancelled
Validate Config / Python Syntax & Import Check (push) Has been cancelled
Validate Config / Python Test Suite (push) Has been cancelled
Validate Config / Shell Script Lint (push) Has been cancelled
Validate Config / Cron Syntax Check (push) Has been cancelled
Validate Config / Deploy Script Dry Run (push) Has been cancelled
Validate Config / Playbook Schema Validation (push) Has been cancelled
Validate Config / YAML Lint (push) Has been cancelled
Merge PR #567
2026-04-14 22:17:32 +00:00
31313c421e
feat: 3D World Glitch Detection in The Matrix ( #491 ) ( #535 )
...
Architecture Lint / Linter Tests (push) Has been cancelled
Architecture Lint / Lint Repository (push) Has been cancelled
Smoke Test / smoke (push) Has been cancelled
Validate Config / JSON Validate (push) Has been cancelled
Validate Config / Python Syntax & Import Check (push) Has been cancelled
Validate Config / Python Test Suite (push) Has been cancelled
Validate Config / Shell Script Lint (push) Has been cancelled
Validate Config / Cron Syntax Check (push) Has been cancelled
Validate Config / Deploy Script Dry Run (push) Has been cancelled
Validate Config / Playbook Schema Validation (push) Has been cancelled
Validate Config / YAML Lint (push) Has been cancelled
Merge PR #535
2026-04-14 22:17:06 +00:00
063572ed1e
feat: Visual Accessibility Audit of Foundation Web #492 ( #531 )
...
Architecture Lint / Lint Repository (push) Has been cancelled
Architecture Lint / Linter Tests (push) Has been cancelled
Smoke Test / smoke (push) Has been cancelled
Validate Config / Python Syntax & Import Check (push) Has been cancelled
Validate Config / Python Test Suite (push) Has been cancelled
Validate Config / Shell Script Lint (push) Has been cancelled
Validate Config / Cron Syntax Check (push) Has been cancelled
Validate Config / Deploy Script Dry Run (push) Has been cancelled
Validate Config / Playbook Schema Validation (push) Has been cancelled
Validate Config / YAML Lint (push) Has been cancelled
Validate Config / JSON Validate (push) Has been cancelled
Merge PR #531
2026-04-14 22:16:58 +00:00
Alexander Whitestone
c09e54de72
feat: Visual Mapping of Tower Architecture — holographic map #494
...
Architecture Lint / Linter Tests (pull_request) Successful in 23s
Smoke Test / smoke (pull_request) Failing after 19s
Validate Config / YAML Lint (pull_request) Failing after 20s
Validate Config / JSON Validate (pull_request) Successful in 19s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 22s
Validate Config / Python Test Suite (pull_request) Has been skipped
Validate Config / Shell Script Lint (pull_request) Failing after 41s
Validate Config / Cron Syntax Check (pull_request) Successful in 13s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 9s
PR Checklist / pr-checklist (pull_request) Successful in 2m49s
Validate Config / Playbook Schema Validation (pull_request) Successful in 14s
Architecture Lint / Lint Repository (pull_request) Failing after 13s
Replaces 12-line stub with full Tower architecture mapper. Scans
design docs, gallery images, Evennia specs, and wizard configs to
construct a structured holographic map of The Tower.
The Tower is the persistent MUD world of the Timmy Foundation — an
Evennia-based space where rooms represent context, objects represent
facts, and NPCs represent procedures (the Memory Palace metaphor).
Sources scanned:
- grok-imagine-gallery/INDEX.md (24 gallery images → rooms)
- docs/MEMORY_ARCHITECTURE.md (Memory Palace L0-L5 layers)
- docs/*.md (design doc room/floor references)
- wizards/*/ (wizard configs → NPC definitions)
- Optional: Gemma 3 vision analysis of Tower images
Output formats:
- JSON: machine-readable with rooms, floors, NPCs, connections
- ASCII: human-readable holographic map with floor layout
Mapped: 5 floors, 20+ rooms, 6 NPCs (the fellowship).
Tests: 14/14 passing.
Closes #494
2026-04-13 20:21:07 -04:00
24887b615f
feat: add AST-backed Python ingestion to knowledge base
Architecture Lint / Linter Tests (pull_request) Successful in 9s
PR Checklist / pr-checklist (pull_request) Failing after 1m18s
Smoke Test / smoke (pull_request) Failing after 6s
Validate Config / YAML Lint (pull_request) Failing after 6s
Validate Config / JSON Validate (pull_request) Successful in 7s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 6s
Validate Config / Python Test Suite (pull_request) Has been skipped
Validate Config / Shell Script Lint (pull_request) Failing after 14s
Validate Config / Cron Syntax Check (pull_request) Successful in 5s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 5s
Validate Config / Playbook Schema Validation (pull_request) Successful in 7s
Architecture Lint / Lint Repository (pull_request) Failing after 6s
2026-04-13 04:09:57 +00:00
Alexander Whitestone
b49a0abf39
fix: restore self-healing runtime checks
Architecture Lint / Linter Tests (pull_request) Successful in 9s
PR Checklist / pr-checklist (pull_request) Failing after 1m17s
Smoke Test / smoke (pull_request) Failing after 7s
Validate Config / YAML Lint (pull_request) Failing after 6s
Validate Config / JSON Validate (pull_request) Successful in 6s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 8s
Validate Config / Shell Script Lint (pull_request) Successful in 15s
Validate Config / Cron Syntax Check (pull_request) Successful in 5s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 5s
Validate Config / Playbook Schema Validation (pull_request) Successful in 8s
Architecture Lint / Lint Repository (pull_request) Failing after 8s
2026-04-12 10:53:55 -04:00
Alexander Whitestone
19aa0830f4
Harden Gemini scripts with verified SSH trust
Architecture Lint / Linter Tests (pull_request) Successful in 8s
Validate Config / YAML Lint (pull_request) Failing after 5s
Validate Config / JSON Validate (pull_request) Successful in 6s
PR Checklist / pr-checklist (pull_request) Failing after 1m11s
Smoke Test / smoke (pull_request) Failing after 7s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 9s
Validate Config / Shell Script Lint (pull_request) Successful in 15s
Validate Config / Cron Syntax Check (pull_request) Successful in 5s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 5s
Validate Config / Playbook Schema Validation (pull_request) Successful in 7s
Architecture Lint / Lint Repository (pull_request) Failing after 6s
2026-04-11 15:13:15 -04:00
Alexander Whitestone
06031d923f
test(scripts): lock self_healing safe CLI behavior ( #435 )
Architecture Lint / Linter Tests (pull_request) Successful in 8s
PR Checklist / pr-checklist (pull_request) Failing after 1m18s
Architecture Lint / Lint Repository (pull_request) Failing after 8s
2026-04-10 21:11:47 -04:00
timmy-bot
ee025957d9
fix: architecture_linter_v2 — repo-aware, test-backed, CI-enforced ( #437 )
...
Architecture Lint / Linter Tests (pull_request) Successful in 28s
PR Checklist / pr-checklist (pull_request) Successful in 4m25s
Architecture Lint / Lint Repository (pull_request) Failing after 21s
- Fix broken API_KEY_REGEX in linter_v2.py (was invalid regex causing runtime crash)
- Fix syntax error in architecture_linter.py (malformed character class)
- Add --repo flag and --json output to linter_v2
- Add LinterResult class for structured programmatic access
- Port v1 sovereignty rules (cloud API endpoint/provider checks) into v2
- Skip .git, node_modules, __pycache__ dirs; skip .env.example files
- Add tests/test_linter.py (19 tests covering all checks)
- Add .gitea/workflows/architecture-lint.yml for CI enforcement
- All files pass python3 -m py_compile
Refs: #437
2026-04-09 19:29:33 -04:00
Ezra
2e64b160b5
[KAIZEN] Harden retro scheduling, chunking, and tests ( #349 )
...
- Add Kaizen Retro to cron/jobs.json with explicit local model/provider
- Add Telegram message chunking for reports approaching the 4096-char limit
- Fix classify_issue_type false positives on short substrings (ci in cleanup)
- Add 28 unit tests covering classification, max-attempts detection,
suggestion generation, report formatting, and Telegram chunking
2026-04-07 15:58:58 +00:00
57b87c525d
Merge pull request '[soul] The Conscience of the Training Pipeline — SOUL.md eval gate' ( #104 ) from gemini/soul-eval-gate into main
2026-03-31 19:09:11 +00:00
88e2509e18
Merge pull request '[sovereignty] Cut the Cloud Umbilical — closes #94 ' ( #107 ) from gemini/operational-hygiene into main
2026-03-31 19:06:38 +00:00