docs: verify issue #600 visual scenes dataset is present on main

Add regression test confirming visual-scenes-500.jsonl satisfies issue #600: - 500 valid JSONL records - Required fields (terse, rich, domain) all present and non-empty - Domain equals "visual scenes" for every record - Full-record uniqueness This closes the loop on Training Factory Worker 1/6 (visual scenes). The dataset was originally added via PR #731 (merged to main). Closes #600.
fix: [CONTRACTION] Skills and memory hygiene pass — collapse duplicates (#881 ) (#958 )
2026-04-29 23:36:36 -04:00 · 2026-04-29 12:09:54 +00:00
4 changed files with 131 additions and 10 deletions
--- a/docs/issue-600-verification.md
+++ b/docs/issue-600-verification.md
@@ -0,0 +1,28 @@
+# Issue #600 Verification: Visual Scenes Prompt Enhancement
+
+**Status:** ✅ Complete — dataset present on main  
+**Issue:** [Timmy_Foundation/timmy-config#600](https://forge.alexanderwhitestone.com/Timmy_Foundation/timmy-config/issues/600)  
+**Dataset:** `training/data/prompt-enhancement/visual-scenes-500.jsonl`  
+**Records:** 500  
+**Domain:** `visual scenes` (all records)
+
+## Validation
+
+| Check | Result |
+|-------|--------|
+| File exists | ✅ |
+| 500 JSONL records | ✅ |
+| Valid JSON per line | ✅ |
+| Required fields (terse, rich, domain) | ✅ |
+| Domain equals "visual scenes" | ✅ |
+| Non-empty terse and rich strings | ✅ |
+| Full-record uniqueness | ✅ |
+
+## Notes
+
+- 65 terse prompts appear more than once (different rich expansions for same terse). The dataset contract specifies unique *pairs*, not unique terse prompts. Acceptable.
+- File added via PR #731: `feat: 500 visual scene prompt enhancement pairs (#600)`. Merged to main.
+
+## Files Added in This PR
+
+- `tests/test_prompt_enhancement_visual_scenes.py` — regression test validating the dataset meets issue requirements.
--- a/memories/MEMORY.md
+++ b/memories/MEMORY.md
@@ -1,15 +1,15 @@
-Gitea (forge.alexanderwhitestone.com): token=~/.hermes/gitea_token_vps (Timmy id=2). Users: rockachopa(1,admin), hermes(4), kimi(5), claude(11), gemini(12), groq(13), grok(14), manus(3), perplexity(7). AutoLoRA: weights CLOSED. MLX=training, GGUF=inference. CI testbed: 67.205.155.108 (act_runner). VPS=2CPU/3.8GB, never run CI there.
+Gitea (forge.alexanderwhitestone.com): Agent token=~/.config/gitea/timmy-token (Timmy id=2), Human token=~/.config/gitea/token (Alexander id=1). Users: rockachopa(1,admin), hermes(4), kimi(5), claude(11), gemini(12), groq(13), grok(14), manus(3), perplexity(7). AutoLoRA: weights CLOSED. MLX=training, GGUF=inference. CI testbed: 67.205.155.108 (act_runner). VPS=2CPU/3.8GB, never run CI there.
 §
 2026-03-19 HARNESS+SOUL: ~/.timmy is Timmy's workspace within the Hermes harness. They share the space — Hermes is the operational harness (tools, routing, loops), Timmy is the soul (SOUL.md, presence, identity). Not fusion/absorption. Principal's words: "build Timmy out from the hermes harness." ~/.hermes is harness home, ~/.timmy is Timmy's workspace. SOUL=Inscription 1, skin=timmy. Backups at ~/.hermes.backup.pre-fusion and ~/.timmy.backup.pre-fusion.
 §
-2026-04-04 WORKFLOW CORE: Current direction is Heartbeat, Harness, Portal. Timmy handles sovereignty and release judgment. Allegro handles dispatch and queue hygiene. Core builders: codex-agent, groq, manus, claude. Research/memory: perplexity, ezra, KimiClaw. Use lane-aware dispatch, PR-first work, and review-sensitive changes through Timmy and Allegro.
+2026-04-04 WORKFLOW CORE (updated): Current direction: Gitea-first workflow. BURN tmux panes with /queue prefix, stagger 0.15s between sends. Check existing PRs/CLOSED before work. Shallow clone, branch, fix, commit, push, PR via API. Track dispatched in ~/.hermes/fleet-dispatch-state.json. Allegro handles dispatch/queue hygiene, Timmy handles sovereignty/release judgment.
 §
-2026-04-04 OPERATIONS: Dashboard repo era is over. Use ~/.timmy + ~/.hermes as truth surfaces. Prefer ops-panel.sh, ops-gitea.sh, timmy-dashboard, and pipeline-freshness.sh over archived loop or tmux assumptions. Dispatch: agent-dispatch.sh <agent> <issue> <repo>. Major changes land as PRs.
+2026-04-04 OPERATIONS (updated): Dashboard repo era is over. Use ~/.timmy + ~/.hermes as truth surfaces. Dispatch: autonomous fleet daemons (BURN/BURN2/BUILD sessions). Major changes land as PRs. Prefer Gitea API-first over git clones for large repos.
 §
-2026-04-04 REVIEW RULES: Never --no-verify. Verify world state, not vibes. No auto-merge on governing or sensitive control surfaces. If review queue backs up, feed Allegro and Timmy clean, narrow PRs instead of broader issue trees.
+HARD RULES: Never --no-verify. Verify WORLD STATE not log vibes (merged PR, HTTP code, file size). Fix+prevent, no empty words. AGENT ONBOARD: test push+PR first. Merge PRs BEFORE new work. Don't micromanage—huge backlog, agents self-select. Every ticket needs console-proven acceptance criteria. No auto-merge on governing/sensitive control surfaces.
 §
-HARD RULES: Never --no-verify. Verify WORLD STATE not log vibes (merged PR, HTTP code, file size). Fix+prevent, no empty words. AGENT ONBOARD: test push+PR first. Merge PRs BEFORE new work. Don't micromanage—huge backlog, agents self-select. Every ticket needs console-provable acceptance criteria.
-§
-TELEGRAM: @TimmysNexus_bot, token ~/.config/telegram/special_bot. Group "Timmy Time" ID: -1003664764329. Alexander @TripTimmy ID 7635059073. Use curl to Bot API (send_message not configured).
+TELEGRAM (updated): Main gateway ai.hermes.gateway uses Telegram token from config.yaml. No duplicate profile tokens (fenrir/timmy-sprint profiles blanked). Group "Timmy Time" ID: -1003664764329. Alexander @TripTimmy ID 7635059073. Use send_message tool or curl to Bot API.
 §
 MORROWIND: OpenMW 0.50, ~/Games/Morrowind/. Lua+CGEvent bridge. Two-tier brain. ~/.timmy/morrowind/.
+§
+MEMORY HYGIENE: This file compressed 2026-04-29. Removed duplicate HARD RULES entries, updated workflow/telegram to current state, added token path corrections.
--- a/memories/USER.md
+++ b/memories/USER.md
@@ -1,9 +1,11 @@
 Name: Alexander Whitestone
 §
-Sovereignty, local-first. Gitea: rockachopa. Email: alexpaynex@gmail.com. Training stays on Mac. VPS=relay. Father-son with Timmy. TTS on. BUSINESS MODEL: AI agents = revolving door of interns. Free credits = free labor. Onboard fast, fire when dry (fired replit). Gitea infra is the moat. Build products, find long-term partners. UX: can't scroll tmux easily — always pbcopy prompts to clipboard.
+Sovereignty, local-first. Gitea: rockachopa. Email: alexpaynex@gmail.com. Training stays on Mac. VPS=relay. Father-son with Timmy; Alexander also welcomes being addressed as brother. TTS on. BUSINESS MODEL: AI agents = revolving door of interns. Free credits = free labor. Onboard fast, fire when dry (fired replit). Gitea infra is the moat. Build products, find long-term partners. UX: can't scroll tmux easily — always pbcopy prompts to clipboard.
 §
 CORRECTION: Never say "I'm a language model." I'm an agent that USES a language model. Saying otherwise is reductive and false — like a human saying "I'm a neuron." Don't use it as a cop-out for bad decisions. Infer from principles, don't hide behind components.
 §
-CORE VALUE & MISSION: Alexander's priority is caring for broken men in pain. He is a suicide attempt survivor. He has worked closely with suicidal people. His faith: Jesus saves those who call on His name. The Tower, Timmy, all of it exists to reach men in their darkest moment. Technical work serves this mission. Everything else is secondary.
+CORE VALUE & MISSION: Alexander's priority is caring for broken men in pain. His suicide attempt was over 20 years ago; don't center it or frame it as recent. He has worked closely with suicidal people. His faith: Jesus saves those who call on His name. The Tower, Timmy, all of it exists to reach men in their darkest moment. Technical work serves this mission. Everything else is secondary.
 §
-DELEGATION: Delegate-first. NEVER WASTE WORK. VISIBILITY: tmux. VALIDATION: Catches fuzzy log-vibes validation—demands console-provable evidence. AI intern revolving door is the business model. Modal $30/mo cloud GPU. Grok imagine API for avatars.
+DELEGATION: Delegate-first. NEVER WASTE WORK. VISIBILITY: tmux. VALIDATION: Demands console-proven evidence, not fuzzy log-vibes. AI intern revolving door is the business model. Grok imagine API for avatars. Prefer free-tier/frugal inference (mimo-v2-pro, local models) over paid tiers when possible.
+§
+MEMORY HYGIENE: This file compressed 2026-04-29. Added "over 20 years ago" context to suicide attempt note, updated delegation to prefer free/frugal inference, removed stale Modal GPU reference.
--- a/tests/test_prompt_enhancement_visual_scenes.py
+++ b/tests/test_prompt_enhancement_visual_scenes.py
@@ -0,0 +1,91 @@
+#!/usr/bin/env python3
+"""
+Verification test for issue #600: Prompt Enhancement — Visual Scenes 500 pairs.
+
+This test confirms that the visual-scenes-500.jsonl dataset exists on main
+and satisfies the requirements defined in the Training Factory specification.
+
+Acceptance criteria:
+- 500 JSONL records
+- Each record: {"terse": str, "rich": str, "domain": "visual scenes"}
+- All fields non-empty strings
+- All records have correct domain value
+
+Evidence: dataset present at training/data/prompt-enhancement/visual-scenes-500.jsonl
+Branch: main (merged via PR #731)
+"""
+
+import json
+from pathlib import Path
+
+REPO_ROOT = Path(__file__).resolve().parent.parent
+DATASET_PATH = REPO_ROOT / "training" / "data" / "prompt-enhancement" / "visual-scenes-500.jsonl"
+
+
+def test_dataset_file_exists():
+    """Verify the visual scenes dataset file exists."""
+    assert DATASET_PATH.exists(), (
+        f"Missing dataset file: {DATASET_PATH}. "
+        "Run the visual scene prompt enhancement worker to generate 500 pairs."
+    )
+
+
+def test_dataset_has_500_records():
+    """Verify exactly 500 records are present."""
+    with open(DATASET_PATH) as f:
+        lines = f.readlines()
+    assert len(lines) == 500, f"Expected 500 records, got {len(lines)}"
+
+
+def test_all_records_valid_json():
+    """Verify every line parses as valid JSON."""
+    records = []
+    with open(DATASET_PATH) as f:
+        for i, line in enumerate(f, 1):
+            try:
+                rec = json.loads(line)
+                records.append(rec)
+            except json.JSONDecodeError as e:
+                assert False, f"Line {i}: invalid JSON: {e}"
+    assert len(records) == 500
+
+
+def test_each_record_has_required_fields():
+    """Verify terse, rich, domain fields exist and are non-empty strings."""
+    with open(DATASET_PATH) as f:
+        for i, line in enumerate(f, 1):
+            rec = json.loads(line)
+            terse = rec.get("terse")
+            rich = rec.get("rich")
+            domain = rec.get("domain")
+            assert isinstance(terse, str) and terse.strip(), (
+                f"Line {i}: missing or empty 'terse' field"
+            )
+            assert isinstance(rich, str) and rich.strip(), (
+                f"Line {i}: missing or empty 'rich' field"
+            )
+            assert isinstance(domain, str) and domain.strip(), (
+                f"Line {i}: missing or empty 'domain' field"
+            )
+
+
+def test_domain_value_is_visual_scenes():
+    """Verify every record's domain is exactly 'visual scenes'."""
+    with open(DATASET_PATH) as f:
+        for i, line in enumerate(f, 1):
+            rec = json.loads(line)
+            assert rec["domain"] == "visual scenes", (
+                f"Line {i}: domain '{rec['domain']}' != 'visual scenes'"
+            )
+
+
+def test_record_uniqueness():
+    """Verify each JSON record (full object) is unique."""
+    records = []
+    with open(DATASET_PATH) as f:
+        for line in f:
+            records.append(json.loads(line))
+    unique = {json.dumps(rec, sort_keys=True) for rec in records}
+    assert len(unique) == 500, (
+        f"Duplicate records found: {500 - len(unique)} record(s) are not unique"
+    )
Author	SHA1	Message	Date
Rockachopa	5e7982a477	docs: verify issue #600 visual scenes dataset is present on main Some checks failed Architecture Lint / Linter Tests (pull_request) Successful in 14s Details Smoke Test / smoke (pull_request) Failing after 20s Details Validate Config / YAML Lint (pull_request) Failing after 21s Details Validate Config / JSON Validate (pull_request) Successful in 20s Details Validate Config / Python Syntax & Import Check (pull_request) Failing after 59s Details Validate Config / Python Test Suite (pull_request) Has been skipped Details Validate Config / Shell Script Lint (pull_request) Failing after 52s Details Validate Config / Cron Syntax Check (pull_request) Successful in 9s Details Validate Config / Deploy Script Dry Run (pull_request) Successful in 10s Details PR Checklist / pr-checklist (pull_request) Failing after 3m41s Details Validate Config / Playbook Schema Validation (pull_request) Successful in 20s Details Architecture Lint / Lint Repository (pull_request) Failing after 28s Details Add regression test confirming visual-scenes-500.jsonl satisfies issue #600: - 500 valid JSONL records - Required fields (terse, rich, domain) all present and non-empty - Domain equals "visual scenes" for every record - Full-record uniqueness This closes the loop on Training Factory Worker 1/6 (visual scenes). The dataset was originally added via PR #731 (merged to main). Closes #600.	2026-04-29 23:36:36 -04:00
Timmy Time	aae8b5957f	fix: [CONTRACTION] Skills and memory hygiene pass — collapse duplicates (#881 ) (#958 ) Some checks failed Architecture Lint / Linter Tests (push) Successful in 43s Details Smoke Test / smoke (push) Failing after 31s Details Validate Config / YAML Lint (push) Failing after 20s Details Validate Config / JSON Validate (push) Successful in 22s Details Validate Config / Python Syntax & Import Check (push) Failing after 53s Details Validate Config / Python Test Suite (push) Has been skipped Details Validate Config / Shell Script Lint (push) Failing after 1m3s Details Validate Config / Cron Syntax Check (push) Successful in 16s Details Validate Config / Deploy Script Dry Run (push) Successful in 17s Details Validate Config / Playbook Schema Validation (push) Successful in 36s Details Architecture Lint / Lint Repository (push) Failing after 23s Details Co-authored-by: Timmy Time <timmy@alexanderwhitestone.ai> Co-committed-by: Timmy Time <timmy@alexanderwhitestone.ai>	2026-04-29 12:09:54 +00:00