Commit Graph

715 Commits

Author SHA1 Message Date
106eea4015 Merge pull request 'test: guard index.html against merge junk' (#1365) from fix/issue-1336-1338-index-cleanup into main
Some checks failed
Deploy Nexus / deploy (push) Failing after 3s
Staging Verification Gate / verify-staging (push) Failing after 3s
Merge PR #1365: test: guard index.html against merge junk
2026-04-13 19:51:07 +00:00
Timmy
8a289d3b22 [verified] test: guard index.html against merge junk
Some checks failed
CI / test (pull_request) Failing after 19s
CI / validate (pull_request) Failing after 19s
Review Approval Gate / verify-review (pull_request) Failing after 4s
Refs #1336
Refs #1338

- assert index.html has no conflict markers or stray markdown
- assert cleaned single-instance blocks stay single
2026-04-13 15:38:28 -04:00
e82faa5855 [claude] Fix: unblock CI deploy and staging gate secrets (#1363) (#1364)
Some checks failed
Deploy Nexus / deploy (push) Failing after 6s
Staging Verification Gate / verify-staging (push) Failing after 4s
2026-04-13 19:25:00 +00:00
b411efcc09 Merge pull request 'fix: harden Three.js boot path' (#1362) from fix/issue-1337-threejs-init into main
Some checks failed
Deploy Nexus / deploy (push) Failing after 4s
Staging Verification Gate / verify-staging (push) Failing after 3s
Merged by Timmy overnight cycle
2026-04-13 14:02:52 +00:00
Timmy
7e434cc567 [verified] fix: harden Three.js boot path
Some checks failed
CI / test (pull_request) Failing after 18s
CI / validate (pull_request) Failing after 16s
Review Approval Gate / verify-review (pull_request) Failing after 2s
Fixes #1337

- show explicit guidance when opened from file://
- route browser boot through a classic script gate
- sanitize malformed generated app module before execution
- trim duplicated footer junk and add regression tests
2026-04-13 09:47:50 -04:00
859a215106 fix: [RESPONSIVE] Tighten layout for laptop and smaller-screen viewing (#1359)
Some checks failed
Deploy Nexus / deploy (push) Failing after 2s
Staging Verification Gate / verify-staging (push) Failing after 2s
Co-authored-by: Alexander Whitestone <alexander@alexanderwhitestone.com>
Co-committed-by: Alexander Whitestone <alexander@alexanderwhitestone.com>
2026-04-13 08:30:22 +00:00
21bd999cad Merge pull request 'fix: [RELIABILITY] Eliminate visible 404 and dead-control states in production Nexus' (#1360) from mimo/code/issue-707 into main
Some checks failed
Deploy Nexus / deploy (push) Has been cancelled
Staging Verification Gate / verify-staging (push) Has been cancelled
2026-04-13 08:29:43 +00:00
4287e6892a Merge pull request 'fix: call self.load() in all game system manager __init__ methods' (#1361) from burn/20260413-0408-fix into main
Some checks failed
Deploy Nexus / deploy (push) Has been cancelled
Staging Verification Gate / verify-staging (push) Has been cancelled
2026-04-13 08:29:39 +00:00
Alexander Whitestone
2600e8b61c fix: call self.load() in all game system manager __init__ methods
Some checks failed
CI / test (pull_request) Failing after 17s
CI / validate (pull_request) Failing after 15s
Review Approval Gate / verify-review (pull_request) Failing after 2s
QuestManager, InventoryManager, GuildManager, CombatManager, and
MagicManager all had load() methods that were never called. This
meant quests were never seeded, items never appeared in rooms, and
all game data started empty on every server restart.

Fixes #1351
2026-04-13 04:13:38 -04:00
Alexander Whitestone
9e19c22c8e fix: eliminate two 404 sources — case mismatch + missing icons
Some checks failed
CI / test (pull_request) Failing after 16s
CI / validate (pull_request) Failing after 15s
Review Approval Gate / verify-review (pull_request) Failing after 4s
- app.js:1195: Fix timmy_Foundation → Timmy_Foundation in vision.json API URL.
  The lowercase 't' caused a silent 404 on case-sensitive servers, preventing
  world state from loading in fetchGiteaData().

- Create icons/icon-192x192.png and icons/icon-512x512.png placeholders.
  Both manifest.json and service-worker.js referenced these but the icons/
  directory was missing, causing 404 on every page load and SW install.

Refs #707
2026-04-13 04:10:01 -04:00
85ffbfed33 Merge pull request 'fix: one-way exits — rooms now bidirectional (#1350)' (#1357) from feat/paper-results into main
Some checks failed
Deploy Nexus / deploy (push) Failing after 3s
Staging Verification Gate / verify-staging (push) Failing after 3s
Merge PR #1357: fix: one-way exits — rooms now bidirectional (#1350)
2026-04-13 07:31:47 +00:00
Alexander Whitestone
0843a2a006 fix: one-way exits — rooms now bidirectional (#1350)
Some checks failed
CI / test (pull_request) Failing after 22s
CI / validate (pull_request) Failing after 15s
Review Approval Gate / verify-review (pull_request) Failing after 2s
World state: added explicit exits dict to all 5 rooms
Bridge: reads exits from world_state.json first, falls back to description parsing

Before: inner rooms (Tower, Garden, Forge, Bridge) had no exits
After: all rooms bidirectional — Threshold connects to all 4, each connects back
2026-04-13 03:27:19 -04:00
a5acbdb2c4 Merge pull request 'Add paper Results section (4 experiments)' (#1355) from feat/paper-results into main
Some checks failed
Deploy Nexus / deploy (push) Failing after 3s
Staging Verification Gate / verify-staging (push) Failing after 3s
Auto-merge #1355
2026-04-13 07:15:25 +00:00
Alexander Whitestone
39d68fd921 Add paper Results section with 4 experiments
Some checks failed
CI / test (pull_request) Failing after 18s
CI / validate (pull_request) Failing after 16s
Review Approval Gate / verify-review (pull_request) Failing after 4s
2026-04-13 02:28:34 -04:00
a290da4e41 Merge pull request 'feat: full-history persistent dedup index for DPO training pairs' (#1352) from feature/full-history-dedup into main
Some checks failed
Deploy Nexus / deploy (push) Failing after 2s
Staging Verification Gate / verify-staging (push) Failing after 2s
Weekly Privacy Audit / privacy-audit (push) Successful in 5s
2026-04-13 03:11:43 +00:00
perplexity
4b15cf8283 feat: full-history persistent dedup index for DPO training pairs
Some checks failed
CI / test (pull_request) Failing after 16s
CI / validate (pull_request) Failing after 14s
Review Approval Gate / verify-review (pull_request) Failing after 3s
Replace the 5-file sliding window cross-run dedup with a persistent
hash index that covers ALL historical training data. Overfitting risk
compounds across the full dataset — a 5-file window lets old duplicates
leak back into training after enough overnight runs.

New module: dedup_index.py (DedupIndex)
- Persistent JSON index (.dpo_dedup_index.json) alongside JSONL files
- Append-on-export: new prompt hashes registered after each successful
  export — no full rescan needed for normal operations
- Incremental sync: on load, detects JSONL files not yet indexed and
  ingests them automatically (handles files from other tools)
- Full rebuild: rebuild() scans ALL deepdive_*.jsonl + pairs_*.jsonl
  to reconstruct from scratch (first run, corruption recovery)
- Atomic writes (write-to-tmp + rename) to prevent index corruption
- Standalone CLI: python3 dedup_index.py <dir> --rebuild --stats

Modified: dpo_quality.py
- Imports DedupIndex with graceful degradation
- Replaces _load_history_hashes() with persistent index lookup
- Fallback: if index unavailable, scans ALL files in-memory (not just 5)
- New register_exported_hashes() method called after export
- Config key: dedup_full_history (replaces dedup_history_files)

Modified: dpo_generator.py
- Calls validator.register_exported_hashes() after successful export
  to keep the persistent index current without rescanning

Modified: config.yaml
- Replaced dedup_history_files: 5 with dedup_full_history: true

Tested — 7 integration tests:
  ✓ Fresh index build from empty directory
  ✓ Build from 3 existing JSONL files (15 unique hashes)
  ✓ Incremental sync when new file appears between runs
  ✓ Append after export + persistence across reloads
  ✓ Rebuild from scratch (recovers from corruption)
  ✓ Validator catches day-1 dupe from 20-day history (5-file window miss)
  ✓ Full pipeline: generate → validate → export → register → re-run detects
2026-04-13 03:11:10 +00:00
c00e1caa26 Merge pull request 'feat: DPO pair quality validator — gate before overnight training' (#1348) from feature/dpo-quality-validator into main
Some checks failed
Deploy Nexus / deploy (push) Failing after 3s
Staging Verification Gate / verify-staging (push) Failing after 3s
2026-04-13 02:47:25 +00:00
perplexity
bb4922adeb feat: DPO pair quality validator — gate before overnight training
Some checks failed
CI / test (pull_request) Failing after 20s
CI / validate (pull_request) Failing after 16s
Review Approval Gate / verify-review (pull_request) Failing after 2s
Add DPOQualityValidator that catches bad training pairs before they
enter the tightening loop. Wired into DPOPairGenerator between
generate() and export() as an automatic quality gate.

New module: dpo_quality.py
- 5 single-pair quality checks:
  1. Field length minimums (prompt ≥40, chosen ≥80, rejected ≥30 chars)
  2. Chosen/rejected length ratio (chosen must be ≥1.3x longer)
  3. Chosen≈rejected similarity (Jaccard ≤0.70 — catches low-contrast)
  4. Vocabulary diversity in chosen (unique word ratio ≥0.30)
  5. Substance markers in chosen (≥2 fleet/training/action terms)
- 2 cross-pair quality checks:
  6. Near-duplicate prompts within batch (Jaccard ≤0.85)
  7. Cross-run dedup against recent JSONL history files
- Two modes: 'drop' (filter out bad pairs) or 'flag' (export with warning)
- BatchReport with per-pair diagnostics, pass rates, and warnings
- Standalone CLI: python3 dpo_quality.py <file.jsonl> [--strict] [--json]

Modified: dpo_generator.py
- Imports DPOQualityValidator with graceful degradation
- Initializes from config validation section (enabled by default)
- Validates between generate() and export() in run()
- Quality report included in pipeline result dict
- Validator failure never blocks — falls back to unvalidated export

Modified: config.yaml
- New deepdive.training.dpo.validation section with all tunable knobs:
  enabled, flagged_pair_action, similarity thresholds, length minimums,
  dedup_history_files

Integration tested — 6 test cases covering:
  ✓ Good pairs pass (3/3 accepted)
  ✓ Bad pairs caught: too-short, high-similarity, inverted signal (0/3)
  ✓ Near-duplicate prompt detection (1/2 deduped)
  ✓ Flag mode preserves pairs with warnings (3/3 flagged)
  ✓ Cross-run deduplication against history (1 dupe caught)
  ✓ Full generator→validator→export pipeline (6/6 validated)
2026-04-13 02:46:50 +00:00
c19000de03 Merge pull request 'feat: Phase 3.5 — DPO training pair generation from Deep Dive pipeline' (#1347) from feature/deepdive-dpo-phase-3.5 into main
Some checks failed
Deploy Nexus / deploy (push) Failing after 3s
Staging Verification Gate / verify-staging (push) Failing after 3s
2026-04-13 02:24:35 +00:00
perplexity
55d53c513c feat: Phase 3.5 — DPO training pair generation from Deep Dive pipeline
Some checks failed
CI / test (pull_request) Failing after 22s
CI / validate (pull_request) Failing after 15s
Review Approval Gate / verify-review (pull_request) Failing after 2s
Wire arXiv relevance filter output directly into DPO pair generation,
closing the loop between research synthesis and overnight training data.

New module: dpo_generator.py
- DPOPairGenerator class with 3 pair strategies:
  * summarize: paper → fleet-grounded analysis (chosen) vs generic (rejected)
  * relevance: 'what matters to Hermes?' → scored context vs vague
  * implication: 'what should we do?' → actionable insight vs platitude
- Extracts synthesis excerpts matched to each ranked item
- Outputs to ~/.timmy/training-data/dpo-pairs/deepdive_{timestamp}.jsonl
- Format: {prompt, chosen, rejected, task_type, evidence_ids,
  source_session, safety_flags, metadata}

Pipeline changes (pipeline.py):
- Import DPOPairGenerator with graceful degradation
- Initialize from config deepdive.training.dpo section
- Execute as Phase 3.5 between synthesis and audio
- DPO results included in pipeline return dict
- Wrapped in try/except — DPO failure never blocks delivery

Config changes (config.yaml):
- New deepdive.training.dpo section with:
  enabled, output_dir, min_score, max_pairs_per_run, pair_types

Integration tested: 2 mock items × 3 pair types = 6 valid JSONL pairs.
Chosen responses consistently richer than rejected (assert-verified).
2026-04-13 02:24:04 +00:00
f737577faf purge: remove Anthropic from the-nexus fleet + deepdive (#1346)
Some checks failed
Deploy Nexus / deploy (push) Failing after 3s
Staging Verification Gate / verify-staging (push) Failing after 3s
2026-04-13 02:02:12 +00:00
ff430d5aa0 Merge pull request 'fix: deduplicate playwright install in CI' (#1345) from perplexity/fix-ci-playwright-dupe into main
Some checks failed
Deploy Nexus / deploy (push) Failing after 3s
Staging Verification Gate / verify-staging (push) Failing after 3s
2026-04-13 01:35:54 +00:00
d0af4035ef Merge pull request 'muda: remove 13 stale cross-repo artifacts' (#1344) from perplexity/muda-cleanup-cross-repo into main
Some checks failed
Deploy Nexus / deploy (push) Has been cancelled
Staging Verification Gate / verify-staging (push) Has been cancelled
2026-04-13 01:35:45 +00:00
71e8ee5615 Merge PR #1343
Some checks failed
Deploy Nexus / deploy (push) Has been cancelled
Staging Verification Gate / verify-staging (push) Has been cancelled
Add structured GOFAI worker outcomes and goal-directed planning
2026-04-13 01:34:45 +00:00
6c02baeeca fix: deduplicate playwright install in CI
Some checks failed
CI / test (pull_request) Failing after 19s
CI / validate (pull_request) Failing after 18s
Review Approval Gate / verify-review (pull_request) Failing after 4s
2026-04-13 01:34:09 +00:00
2bc7a81859 muda: remove stale artifact protected_branches.yaml`
Some checks failed
CI / test (pull_request) Failing after 26s
CI / validate (pull_request) Failing after 15s
Review Approval Gate / verify-review (pull_request) Failing after 3s
2026-04-13 01:33:57 +00:00
389aafb5ab muda: remove stale artifact codowners 2026-04-13 01:33:56 +00:00
07c8b29014 muda: remove stale artifact cODEOWNERS 2026-04-13 01:33:54 +00:00
cab7855469 muda: remove stale artifact cODEOWNERS 2026-04-13 01:33:52 +00:00
5039f31545 muda: remove stale artifact cODEOWNERS 2026-04-13 01:33:51 +00:00
e6e9d261df muda: remove stale artifact CODEOWNERS 2026-04-13 01:33:49 +00:00
b19cd64415 muda: remove stale artifact CODEOWNERS 2026-04-13 01:33:47 +00:00
7505bc21a5 muda: remove stale artifact CODEOWNERS 2026-04-13 01:33:46 +00:00
8398abec89 muda: remove stale artifact CODEOWNERS 2026-04-13 01:33:44 +00:00
49cf69c65a muda: remove stale artifact CODEOWNERS 2026-04-13 01:33:42 +00:00
32ee8d5568 muda: remove stale artifact CODEOWNERS 2026-04-13 01:33:41 +00:00
0ef1627ed1 muda: remove stale artifact CONTRIBUTING.md 2026-04-13 01:33:39 +00:00
c1e7ec4b9c muda: remove stale artifact CODEOWNERS 2026-04-13 01:33:37 +00:00
8e21c0e3ae Merge pull request 'fix: [SMOKE] [CI] Fix dependencies, CI pipeline, and clean muda' (#1334) from fix/smoke-tests-and-muda into main
Some checks failed
Deploy Nexus / deploy (push) Failing after 3s
Staging Verification Gate / verify-staging (push) Failing after 3s
2026-04-13 00:57:41 +00:00
16a14fd014 fix: remove stale file docus/branch-protection.md
Some checks failed
CI / test (pull_request) Failing after 23s
CI / validate (pull_request) Failing after 15s
Review Approval Gate / verify-review (pull_request) Successful in 3s
2026-04-13 00:56:49 +00:00
349cb0296c fix: remove stale file timmy-home/SOUL.md 2026-04-13 00:56:49 +00:00
10c4b66393 fix: remove stale file timmy-home/CONTRIBUTING.md 2026-04-13 00:56:49 +00:00
cd57b020ea fix: remove stale file timmy-home/CODEOWNERS 2026-04-13 00:56:49 +00:00
9bc9ed2b30 fix: remove stale file timmy-config/SOUL.md 2026-04-13 00:56:49 +00:00
3bbd944d43 fix: remove stale file timmy-config/CONTRIBUTING.md 2026-04-13 00:56:49 +00:00
737740a2e6 fix: remove stale file timmy-config/CODEOWNERS 2026-04-13 00:56:49 +00:00
b45350d815 fix: remove stale file the-nexus/CONTRIBUTING.md 2026-04-13 00:56:49 +00:00
ffbd4f09ea fix: remove stale file the-nexus/CODEOWNERS 2026-04-13 00:56:49 +00:00
eedfd1c462 fix: remove root muda .gitea.yaml 2026-04-13 00:56:49 +00:00
370a33028d feat: add playwright to repo truth guard 2026-04-13 00:56:49 +00:00