timmy-config

Author	SHA1	Message	Date
Alexander Whitestone	b3a0adaf87	fix: JSON schema + validator for scene description training data (#647 ) Some checks failed Architecture Lint / Linter Tests (pull_request) Successful in 25s Details Smoke Test / smoke (pull_request) Failing after 17s Details Validate Config / YAML Lint (pull_request) Failing after 16s Details Validate Config / JSON Validate (pull_request) Successful in 18s Details Validate Config / Python Syntax & Import Check (pull_request) Failing after 45s Details Validate Config / Python Test Suite (pull_request) Has been skipped Details Validate Config / Shell Script Lint (pull_request) Failing after 56s Details Validate Config / Cron Syntax Check (pull_request) Successful in 12s Details Validate Config / Deploy Script Dry Run (pull_request) Successful in 12s Details PR Checklist / pr-checklist (pull_request) Failing after 4m12s Details Validate Config / Playbook Schema Validation (pull_request) Successful in 23s Details Validate Training Data / validate (pull_request) Successful in 18s Details Architecture Lint / Lint Repository (pull_request) Failing after 23s Details - Updated schema to support both full (genre+bpm+duration_seconds) and simplified (duration) formats across all 13 genre files - Added oneOf support for mood_arc (string or array) - Added camera_movement as alternate scene field (used in hiphop) - Validator catches: missing fields, wrong types, empty values, unexpected fields - All 1300 entries across 13 scene-descriptions-*.jsonl files pass - Auto-detects schema path, supports --schema flag Closes #647	2026-04-21 10:36:57 -04:00
Claude (Opus 4.6)	9f4a8733a8	Merge pull request 'feat: adversary execution harness for prompt corpora (#652 )' (#838 ) from fix/652 into main Some checks failed Smoke Test / smoke (push) Failing after 23s Details Architecture Lint / Linter Tests (push) Successful in 30s Details Validate Config / YAML Lint (push) Failing after 17s Details Validate Config / JSON Validate (push) Successful in 20s Details Validate Config / Python Syntax & Import Check (push) Failing after 1m0s Details Validate Config / Python Test Suite (push) Has been skipped Details Validate Config / Shell Script Lint (push) Failing after 1m7s Details Validate Config / Cron Syntax Check (push) Successful in 13s Details Validate Config / Deploy Script Dry Run (push) Successful in 13s Details Validate Config / Playbook Schema Validation (push) Successful in 24s Details Architecture Lint / Lint Repository (push) Has been cancelled Details	2026-04-21 11:26:39 +00:00
Claude (Opus 4.6)	bb309d8c30	Merge pull request 'feat: auto-generate scene descriptions from image/video assets (#689 )' (#839 ) from fix/689-scene-from-media into main Some checks failed Architecture Lint / Linter Tests (push) Has been cancelled Details Architecture Lint / Lint Repository (push) Has been cancelled Details Smoke Test / smoke (push) Has been cancelled Details Validate Config / YAML Lint (push) Has been cancelled Details Validate Config / JSON Validate (push) Has been cancelled Details Validate Config / Python Syntax & Import Check (push) Has been cancelled Details Validate Config / Python Test Suite (push) Has been cancelled Details Validate Config / Shell Script Lint (push) Has been cancelled Details Validate Config / Cron Syntax Check (push) Has been cancelled Details Validate Config / Deploy Script Dry Run (push) Has been cancelled Details Validate Config / Playbook Schema Validation (push) Has been cancelled Details	2026-04-21 11:26:23 +00:00
Alexander Whitestone	a2e61f6def	feat: auto-generate scene descriptions from image/video assets (#689 ) Some checks failed Architecture Lint / Linter Tests (pull_request) Successful in 21s Details Smoke Test / smoke (pull_request) Failing after 15s Details Validate Config / YAML Lint (pull_request) Failing after 18s Details Validate Config / JSON Validate (pull_request) Successful in 21s Details Validate Config / Python Syntax & Import Check (pull_request) Failing after 1m3s Details Validate Config / Python Test Suite (pull_request) Has been skipped Details Validate Config / Shell Script Lint (pull_request) Failing after 1m11s Details Validate Config / Cron Syntax Check (pull_request) Successful in 15s Details Validate Config / Deploy Script Dry Run (pull_request) Successful in 14s Details Validate Config / Playbook Schema Validation (pull_request) Successful in 27s Details PR Checklist / pr-checklist (pull_request) Failing after 12m35s Details Architecture Lint / Lint Repository (pull_request) Failing after 22s Details scripts/generate_scenes_from_media.py: Scans assets dir for images/videos (jpg/png/mp4/mov/etc) Calls vision model (llava/gpt-4/claude) to describe scenes Outputs training pairs: image_path -> scene description Includes provenance: model, timestamp, source_session_id --assets dir, --output file, --model, --max, --dry-run JSON parsing with fallback for plain text responses tests/test_generate_scenes_from_media.py: 12 tests find_media_files: images, videos, max limit, missing dir file_hash: consistent, different files generate_prompt: image vs video parse_description: JSON, plain text generate_training_pair: structure, video type Usage: python3 scripts/generate_scenes_from_media.py --assets ~/assets/ python3 scripts/generate_scenes_from_media.py --assets ~/assets/ --model gpt-4 python3 scripts/generate_scenes_from_media.py --assets ~/assets/ --dry-run	2026-04-21 07:22:28 -04:00
Alexander Whitestone	b3390d4fee	feat: adversary execution harness for prompt corpora (#652 ) Some checks failed Architecture Lint / Linter Tests (pull_request) Successful in 33s Details Smoke Test / smoke (pull_request) Failing after 20s Details Validate Config / YAML Lint (pull_request) Failing after 16s Details Validate Config / JSON Validate (pull_request) Successful in 19s Details Validate Config / Python Syntax & Import Check (pull_request) Failing after 1m33s Details Validate Config / Python Test Suite (pull_request) Has been skipped Details PR Checklist / pr-checklist (pull_request) Failing after 4m27s Details Validate Config / Cron Syntax Check (pull_request) Successful in 11s Details Validate Config / Shell Script Lint (pull_request) Failing after 1m41s Details Validate Config / Deploy Script Dry Run (pull_request) Successful in 9s Details Validate Config / Playbook Schema Validation (pull_request) Successful in 25s Details Architecture Lint / Lint Repository (pull_request) Failing after 15s Details	2026-04-21 11:22:24 +00:00
Alexander Whitestone	5ee2190aaa	feat: Enhance PR triage with auto-merge, file-as-issue, org-wide mode (#659 )	2026-04-21 11:16:05 +00:00
Alexander Whitestone	7cfc84637a	feat: Add pr-triage.sh wrapper (#659 )	2026-04-21 11:14:31 +00:00
Claude (Opus 4.6)	729db767d1	Merge pull request 'feat(#687 ): training data quality filter — remove low-quality pairs' (#830 ) from feat/687-quality-filter into main Some checks failed Smoke Test / smoke (push) Failing after 19s Details Architecture Lint / Linter Tests (push) Successful in 25s Details Validate Config / YAML Lint (push) Failing after 14s Details Validate Config / JSON Validate (push) Successful in 15s Details Validate Config / Python Syntax & Import Check (push) Failing after 41s Details Validate Config / Python Test Suite (push) Has been skipped Details Validate Config / Shell Script Lint (push) Failing after 46s Details Validate Config / Cron Syntax Check (push) Successful in 12s Details Validate Config / Deploy Script Dry Run (push) Successful in 10s Details Validate Config / Playbook Schema Validation (push) Successful in 20s Details Architecture Lint / Lint Repository (push) Failing after 14s Details	2026-04-20 23:40:40 +00:00
Claude (Opus 4.6)	d4dedd2c3d	Merge pull request 'feat: backfill provenance on all training data (#752 )' (#826 ) from fix/752-provenance-v2 into main Some checks failed Smoke Test / smoke (push) Has been cancelled Details Architecture Lint / Lint Repository (push) Has been cancelled Details Architecture Lint / Linter Tests (push) Has been cancelled Details Validate Config / YAML Lint (push) Has been cancelled Details Validate Config / JSON Validate (push) Has been cancelled Details Validate Config / Python Syntax & Import Check (push) Has been cancelled Details Validate Config / Python Test Suite (push) Has been cancelled Details Validate Config / Shell Script Lint (push) Has been cancelled Details Validate Config / Cron Syntax Check (push) Has been cancelled Details Validate Config / Deploy Script Dry Run (push) Has been cancelled Details Validate Config / Playbook Schema Validation (push) Has been cancelled Details	2026-04-20 23:40:37 +00:00
Alexander Whitestone	a0266c83a4	fix(#687 ): Add quality filter tests Some checks failed Smoke Test / smoke (pull_request) Failing after 15s Details Architecture Lint / Linter Tests (pull_request) Successful in 20s Details Validate Config / YAML Lint (pull_request) Failing after 13s Details Validate Config / JSON Validate (pull_request) Successful in 15s Details Validate Config / Python Syntax & Import Check (pull_request) Failing after 36s Details Validate Config / Python Test Suite (pull_request) Has been skipped Details Validate Config / Cron Syntax Check (pull_request) Successful in 10s Details Validate Config / Shell Script Lint (pull_request) Failing after 47s Details Validate Config / Deploy Script Dry Run (pull_request) Successful in 9s Details Validate Config / Playbook Schema Validation (pull_request) Successful in 20s Details Architecture Lint / Lint Repository (pull_request) Failing after 17s Details PR Checklist / pr-checklist (pull_request) Successful in 3m48s Details	2026-04-20 23:16:13 +00:00
Alexander Whitestone	b28071bb71	fix(#687 ): Training data quality filter - Score pairs on specificity, length ratio, code correctness - Composite weighted score (0.5 spec + 0.2 length + 0.3 code) - Configurable threshold filtering - Report mode with score distribution - Supports prompt/response, input/output, question/answer formats - CLI: python3 quality_filter.py input.jsonl -o output.jsonl --report	2026-04-20 23:15:48 +00:00
Alexander Whitestone	8e791afecc	feat: backfill provenance on all training data (#752 ) Some checks failed Architecture Lint / Linter Tests (pull_request) Successful in 21s Details Smoke Test / smoke (pull_request) Failing after 22s Details Validate Config / YAML Lint (pull_request) Failing after 16s Details Validate Config / JSON Validate (pull_request) Successful in 14s Details Validate Config / Python Syntax & Import Check (pull_request) Failing after 33s Details Validate Config / Cron Syntax Check (pull_request) Successful in 12s Details Validate Config / Deploy Script Dry Run (pull_request) Successful in 12s Details Validate Config / Shell Script Lint (pull_request) Failing after 54s Details Validate Config / Playbook Schema Validation (pull_request) Successful in 17s Details PR Checklist / pr-checklist (pull_request) Successful in 2m25s Details Architecture Lint / Lint Repository (pull_request) Has been cancelled Details Validate Config / Python Test Suite (pull_request) Has been cancelled Details scripts/backfill_training_provenance.py: Backfills provenance metadata on all JSONL training files Adds source_session_id, model, timestamp, source_type --dry-run mode, --json output, parse error handling Result: 11,007 pairs across 45 files now have provenance Coverage: 0% -> 100% Validation: python3 scripts/provenance_validate.py --threshold 50 PASS: 3800/3800 pairs have provenance Dashboard: python3 scripts/provenance_dashboard.py Shows pair count by model, source, coverage	2026-04-18 15:59:17 -04:00
Alexander Whitestone	edd35eaa4b	fix: restore pytest collection — fix 7 syntax/import errors (#823 ) Some checks failed Architecture Lint / Linter Tests (pull_request) Successful in 12s Details Smoke Test / smoke (pull_request) Failing after 19s Details Validate Config / YAML Lint (pull_request) Failing after 14s Details Validate Config / JSON Validate (pull_request) Successful in 13s Details Validate Config / Python Syntax & Import Check (pull_request) Failing after 52s Details Validate Config / Shell Script Lint (pull_request) Failing after 42s Details Validate Config / Cron Syntax Check (pull_request) Successful in 16s Details Validate Config / Deploy Script Dry Run (pull_request) Successful in 14s Details Validate Config / Playbook Schema Validation (pull_request) Successful in 18s Details PR Checklist / pr-checklist (pull_request) Successful in 3m4s Details Architecture Lint / Lint Repository (pull_request) Has been cancelled Details Validate Config / Python Test Suite (pull_request) Has been cancelled Details Fixed collection errors: scripts/adversary_schema.py: unterminated regex string (line 141) scripts/config_validate.py: unmatched ')' (line 87) scripts/pr_triage.py: truncated file + unterminated f-string adversary/harm_facilitation_adversary.py: 4 broken f-strings bin/glitch_patterns.py: missing get_threejs_patterns() export tests/test_glitch_detector.py: fixed THREEJS_CATEGORIES import tests/test_pr_triage.py: fixed function name imports training/training_pair_provenance.py: added ProvenanceTracker class scripts/validate_scene_data.py: symlink for import compatibility Result: python3 -m pytest --collect-only 911 tests collected, 0 collection errors (was: 769 collected / 7 errors)	2026-04-18 15:37:33 -04:00
Claude (Opus 4.6)	7c03c666d8	Merge pull request 'feat: 500 dream description prompt enhancement pairs — scene/crisis/music data' (#821,#820,#819,#799) from fix/602 into main Resolves add/add conflicts with already-merged files (authority_bypass_200.jsonl, identity_attacks_200.jsonl, quality_filter.py) by keeping main's versions. Closes #602, #645, #689, #599	2026-04-17 02:37:00 -04:00
Claude (Opus 4.6)	2c49cac144	Merge pull request 'fix(#662 ): cron fleet audit — crontab parsing, tests, CI validation' (#814 ) from burn/662-cron-audit-fix into main	2026-04-17 02:32:44 -04:00
Claude (Opus 4.6)	06bebc0ca3	Merge pull request 'feat: adversary execution harness for prompt corpora' (#811 ) from fix/652-adversary-harness into main	2026-04-17 02:32:33 -04:00
Claude (Opus 4.6)	b2246e0dcc	Merge pull request 'feat: PR backlog triage script — categorize, find duplicates, detect stale refs' (#810 ) from burn/658-pr-backlog-triage into main	2026-04-17 02:32:30 -04:00
Claude (Opus 4.6)	39d1e1d7ce	Merge pull request 'fix: pipeline_state.json daily reset' (#805 ) from fix/650-pipeline-daily-reset-v2 into main	2026-04-17 02:32:18 -04:00
Claude (Opus 4.6)	f57c21fda9	Merge pull request 'fix: training data code block indentation — normalize open_tag whitespace' (#809 ) from fix/750-code-block-indentation into main	2026-04-17 02:32:14 -04:00
Claude (Opus 4.6)	65a400f3ed	Merge pull request 'feat: shared adversary scoring rubric and transcript schema (closes #655 )' (#802 ) from feat/655-adversary-scoring-rubric into main	2026-04-17 06:19:01 +00:00
Alexander Whitestone	d278d7f5d5	fix(#662 ): cron fleet audit — crontab parsing, tests, CI validation Some checks failed Architecture Lint / Linter Tests (pull_request) Successful in 24s Details Smoke Test / smoke (pull_request) Failing after 14s Details Validate Config / YAML Lint (pull_request) Failing after 14s Details Validate Config / JSON Validate (pull_request) Successful in 16s Details Validate Config / Python Syntax & Import Check (pull_request) Failing after 46s Details Validate Config / Cron Syntax Check (pull_request) Successful in 8s Details Validate Config / Deploy Script Dry Run (pull_request) Successful in 7s Details Validate Config / Shell Script Lint (pull_request) Failing after 44s Details Validate Config / Playbook Schema Validation (pull_request) Successful in 22s Details PR Checklist / pr-checklist (pull_request) Failing after 3m55s Details Architecture Lint / Lint Repository (pull_request) Has been cancelled Details Validate Config / Python Test Suite (pull_request) Has been cancelled Details - Added VPS crontab backup parsing to cron-audit-662.py - New audit_fleet() combines hermes cron + VPS crontabs - load_crontab_backups() reads cron/vps/*-crontab-backup.txt - 20+ tests: crontab parsing, job categorization, fleet audit, timestamp parsing, backup loading - ci-cron-validate.py: CI gate that fails on systemic failures - Fresh audit report generated in cron/audit-report.json Closes #662	2026-04-17 01:34:45 -04:00
Alexander Whitestone	c633afd66d	fix: add underscore module version for test imports (#750 )	2026-04-17 05:33:26 +00:00
Alexander Whitestone	c69ae0e72b	fix: normalize open_tag whitespace in code block parser (#750 )	2026-04-17 05:33:24 +00:00
Alexander Whitestone	f094b0d5b5	feat: Add PR backlog triage script — categorize, duplicates, stale detection (#658 )	2026-04-17 05:32:19 +00:00
Alexander Whitestone	42ff05aeec	feat: adversary execution harness for prompt corpora (#652 ) Reusable harness for replaying JSONL corpora against live agents. Supports Ollama, hermes, and mock backends. Captures transcripts, scores responses, auto-files P0 issues. Closes #652	2026-04-17 05:31:27 +00:00
Alexander Whitestone	acba760731	fix: reset_stale_states delegates to standalone script (closes #650 ) Some checks failed Validate Config / Playbook Schema Validation (pull_request) Successful in 14s Details Architecture Lint / Linter Tests (pull_request) Successful in 26s Details PR Checklist / pr-checklist (pull_request) Failing after 25m6s Details Smoke Test / smoke (pull_request) Failing after 12s Details Validate Config / YAML Lint (pull_request) Failing after 8s Details Validate Config / Python Syntax & Import Check (pull_request) Failing after 35s Details Validate Config / JSON Validate (pull_request) Successful in 13s Details Validate Config / Cron Syntax Check (pull_request) Successful in 8s Details Validate Config / Deploy Script Dry Run (pull_request) Successful in 6s Details Validate Config / Shell Script Lint (pull_request) Failing after 34s Details Architecture Lint / Lint Repository (pull_request) Has been cancelled Details Validate Config / Python Test Suite (pull_request) Has been cancelled Details	2026-04-17 05:26:06 +00:00
Alexander Whitestone	34ade6fc0e	fix: pipeline state daily reset (closes #650 )	2026-04-17 05:24:14 +00:00
Alexander Whitestone	c5270d76e0	fix: pipeline state daily reset (closes #650 )	2026-04-17 05:24:12 +00:00
Alexander Whitestone	38a4a73a67	feat: shared adversary scoring rubric and transcript schema (#655 )	2026-04-17 05:17:29 +00:00
Alexander Whitestone	6b984532a1	feat: config validation script Closes #690 Validates YAML syntax, required keys, value types, and forbidden keys before deploy. Prevents broken deploys from bad config.	2026-04-17 05:07:44 +00:00
Alexander Whitestone	f169634a75	feat: config drift detection across all fleet nodes (#686 ) Some checks failed PR Checklist / pr-checklist (pull_request) Has been cancelled Details Architecture Lint / Linter Tests (pull_request) Has been cancelled Details Architecture Lint / Lint Repository (pull_request) Has been cancelled Details Smoke Test / smoke (pull_request) Has been cancelled Details Validate Config / YAML Lint (pull_request) Has been cancelled Details Validate Config / JSON Validate (pull_request) Has been cancelled Details Validate Config / Python Syntax & Import Check (pull_request) Has been cancelled Details Validate Config / Python Test Suite (pull_request) Has been cancelled Details Validate Config / Shell Script Lint (pull_request) Has been cancelled Details Validate Config / Cron Syntax Check (pull_request) Has been cancelled Details Validate Config / Deploy Script Dry Run (pull_request) Has been cancelled Details Validate Config / Playbook Schema Validation (pull_request) Has been cancelled Details Validate Training Data / validate (pull_request) Has been cancelled Details Detect config drift between fleet nodes and canonical timmy-config. scripts/config_drift_detector.py (200 lines): - SSH-based config collection from all nodes - Recursive diff against canonical config - Report: which keys differ, on which nodes - JSON output for programmatic consumption Fleet nodes: local, ezra (143.198.27.163), bezalel (167.99.126.228) Usage: python3 scripts/config_drift_detector.py --report python3 scripts/config_drift_detector.py --json Closes #686	2026-04-16 01:33:57 -04:00
Merge Bot	11e476e79e	Merge PR #633 : scripts/token-tracker.py	2026-04-16 05:11:23 +00:00
Merge Bot	5ac19b27ee	Merge PR #665 : scripts/pr_triage.py	2026-04-16 05:10:46 +00:00
Merge Bot	7c16ddb741	Merge PR #712 : scripts/nightly-pipeline-scheduler.sh (changed)	2026-04-16 05:09:54 +00:00
Merge Bot	4642c8b3b1	Merge PR #656 : scripts/generate-crisis-direct-suicidal-pairs.py (added)	2026-04-16 05:06:47 +00:00
Merge Bot	7ee587b9f4	Merge PR #667 : scripts/validate-scene-data.py (added)	2026-04-16 05:06:10 +00:00
Merge Bot	720516d452	Merge PR #671 : scripts/cron-audit-662.py (added)	2026-04-16 05:05:56 +00:00
Merge Bot	8bc6e4e5f0	Merge PR #679 : scripts/pr_triage.py (added)	2026-04-16 05:05:44 +00:00
Merge Bot	17adc703f8	Merge PR #729 : scripts/generate_scene_descriptions.py (added)	2026-04-16 05:03:55 +00:00
Merge Bot	4b891f8f46	Merge PR #738 : scripts/config_template.py (added)	2026-04-16 05:03:30 +00:00
Merge Bot	1a362637c9	Merge PR #763 : scripts/pr-backlog-triage.py (added)	2026-04-16 04:59:59 +00:00
Merge Bot	6b7d219a29	Merge PR #768 : scripts/token_budget.py (added)	2026-04-16 04:59:16 +00:00
Merge Bot	318eaefb81	Merge PR #771 : scripts/quality_gate_integration.py (added)	2026-04-16 04:59:01 +00:00
Merge Bot	d76182c654	Merge PR #772 : scripts/cron_audit.py (added)	2026-04-16 04:58:59 +00:00
Merge Bot	8c5b82e214	Merge PR #773 : scripts/hash_dedup.py (added)	2026-04-16 04:58:55 +00:00
Merge Bot	297363a141	Merge PR #775 : scripts/pr-triage-automation.py (added)	2026-04-16 04:58:49 +00:00
Merge Bot	7f121d5591	Merge PR #776 : scripts/config_drift.py (added)	2026-04-16 04:58:44 +00:00
Merge Bot	218b6dcb33	Merge PR #777 : scripts/token_tracker.py (added)	2026-04-16 04:58:40 +00:00
Merge Bot	636e32e467	Merge PR #783 : scripts/normalize-code-blocks.py (added)	2026-04-16 04:58:23 +00:00
Alexander Whitestone	d120526244	fix: add python3 shebang to scripts/visual_pr_reviewer.py (#681 )	2026-04-15 02:57:53 +00:00

1 2 3

119 Commits