[GOVERNING] Session Crystallization & Operational Playbook — Master Handoff Document #982

New Issue

perplexity · 2026-03-22T19:10:52Z

perplexity commented

2026-03-22 19:10:52 +00:00

Summary

This is the master handoff document for the Timmy Time project, synthesizing six deep research spikes from the March 22, 2026 session into a single operational playbook. Any agent picking up Timmy's development cold should read this first.

Document Structure

Section	Content	Use
0	How to use this document	Read order for agents
1	The Breakthrough: Perception Without Vision	OpenMW Lua = 95% API-readable game state
2	Current Project State	Codebase map, critical gaps, blocking PRs, priority stack
3	The Sovereign Technology Stack	80+ tools with pinned versions and install commands
4	Pre-Computation	ESM extraction, nav graph, dialogue pre-evaluation, UESP RAG
5	Sprint Plan: The Next 90 Days	Foundation (week 1), Morrowind (month 1), Creative (quarter)
6	The Research Meta-Skill	How to run autonomous research spikes
7	Key Constraints and Non-Negotiables	Morrowind confirmed, no cloud deps, tests green
8	The Done Checklist	16 concrete, testable milestones

Critical Gaps from Operation Darling Purge (§2.3)

Commit 584eeb679e88 removed three capabilities that must be restored:

Self-modification loop (self_coding/self_modify/loop.py) — DELETED. Restore using Goose or mini-swe-agent.
MCP integration (mcp/registry.py) — DELETED. Replace with FastMCP v3.1.1 at /tools/mcp.
Task delegation (delegate_task) — NEUTERED. Wire to brain/worker.py DistributedWorker.

Priority Stack (§2.5)

#	Task	Gitea Issue	Blocked By
1	Merge 3 foundation PRs	#864, #865, #900	Nothing
2	TES3MP Bridge (Python-to-Lua IPC)	#878	PRs merged
3	TES3MP Server on Hermes VPS	#818	Nothing (parallel)
4	Three-Tier Memory	#873	PRs merged
5	Docker Compose	#875	Tasks 1-4
6	Model Router upgrade (vllm-mlx)	#882	Nothing (parallel)
7	Nostr Identity	#877	Nothing (parallel)
8	AlexanderWhitestone.com	#879	Tasks 5-7

The 16-Point Done Checklist (§8)

1. All three PRs merged, pytest passes
2. FastMCP v3.1.1 at /tools/mcp with 3+ tools
3. vllm-mlx serves Qwen3-8B + Llama-3.3-70B
4. OpenMW connects to TES3MP on Hermes
5. to_python.json live player data at 1Hz
6. pytest test_tes3mp_adapter.py passes
7. Timmy walks Census Office to Fargoth autonomously
8. TIMMY COMPLETES MORROWIND TUTORIAL (under 15 min, no human input)
9. Graphiti episodic memory returns census office
10. git clone + docker-compose up = working stack in 90s
11. 5-min OBS recording with thought overlay
12. Content pipeline: sub-5-min episode from 2hr raw
13. ACE-Step 30s music loop in under 60s
14. Nostr profile visible on nostr.band
15. LND synced with 1+ open channel
16. AlexanderWhitestone.com live with TLS, video, art, Lightning tips

Sovereign Tech Stack Manifest (§3)

8 categories, 40+ tools with pinned versions:

LLM Inference: vllm-mlx, Ollama v0.18.2, mlx-lm v0.31.1, exo 1.0 EA
Coding Agents: Goose v1.20.1, OpenHands v1.5.0, Aider, mini-swe-agent v2, Forgejo v14.0.3
Image Gen: ComfyUI v0.17.2, Draw Things, FLUX.1 Dev GGUF Q8, FLUX.2 Klein
Music/Voice: ACE-Step 1.5, mlx-audio v0.4.1, Piper TTS v1.4.1, GPT-SoVITS v2pro
Orchestration: FastMCP v3.1.1, PocketFlow, CrewAI v1.11.0, Agno v2.5.10
Nostr/Lightning: nostr-sdk v0.44.2, nostrdvm, LND v0.20.1, LN agent-tools, LNbits v1.4, Cashu v0.17.0
Memory/KG: Graphiti v0.28.2, Neo4j 2026.02, ChromaDB v1.5.5, Mem0 v1.0.5
Streaming: MediaMTX v1.16.3, OBS v32.0.4, obsws-python, MoviePy v2.1.2

PDF attached. See cross-reference comment for links to all related tickets.

## Summary This is the **master handoff document** for the Timmy Time project, synthesizing six deep research spikes from the March 22, 2026 session into a single operational playbook. Any agent picking up Timmy's development cold should read this first. ## Document Structure | Section | Content | Use | |---------|---------|-----| | 0 | How to use this document | Read order for agents | | 1 | The Breakthrough: Perception Without Vision | OpenMW Lua = 95% API-readable game state | | 2 | Current Project State | Codebase map, critical gaps, blocking PRs, priority stack | | 3 | The Sovereign Technology Stack | 80+ tools with pinned versions and install commands | | 4 | Pre-Computation | ESM extraction, nav graph, dialogue pre-evaluation, UESP RAG | | 5 | Sprint Plan: The Next 90 Days | Foundation (week 1), Morrowind (month 1), Creative (quarter) | | 6 | The Research Meta-Skill | How to run autonomous research spikes | | 7 | Key Constraints and Non-Negotiables | Morrowind confirmed, no cloud deps, tests green | | 8 | The Done Checklist | 16 concrete, testable milestones | ## Critical Gaps from Operation Darling Purge (§2.3) Commit `584eeb679e88` removed three capabilities that must be restored: 1. **Self-modification loop** (`self_coding/self_modify/loop.py`) — DELETED. Restore using Goose or mini-swe-agent. 2. **MCP integration** (`mcp/registry.py`) — DELETED. Replace with FastMCP v3.1.1 at `/tools/mcp`. 3. **Task delegation** (`delegate_task`) — NEUTERED. Wire to `brain/worker.py` DistributedWorker. ## Priority Stack (§2.5) | # | Task | Gitea Issue | Blocked By | |---|------|-------------|------------| | 1 | Merge 3 foundation PRs | #864, #865, #900 | Nothing | | 2 | TES3MP Bridge (Python-to-Lua IPC) | #878 | PRs merged | | 3 | TES3MP Server on Hermes VPS | #818 | Nothing (parallel) | | 4 | Three-Tier Memory | #873 | PRs merged | | 5 | Docker Compose | #875 | Tasks 1-4 | | 6 | Model Router upgrade (vllm-mlx) | #882 | Nothing (parallel) | | 7 | Nostr Identity | #877 | Nothing (parallel) | | 8 | AlexanderWhitestone.com | #879 | Tasks 5-7 | ## The 16-Point Done Checklist (§8) - [ ] 1. All three PRs merged, pytest passes - [ ] 2. FastMCP v3.1.1 at /tools/mcp with 3+ tools - [ ] 3. vllm-mlx serves Qwen3-8B + Llama-3.3-70B - [ ] 4. OpenMW connects to TES3MP on Hermes - [ ] 5. to_python.json live player data at 1Hz - [ ] 6. pytest test_tes3mp_adapter.py passes - [ ] 7. Timmy walks Census Office to Fargoth autonomously - [ ] 8. **TIMMY COMPLETES MORROWIND TUTORIAL** (under 15 min, no human input) - [ ] 9. Graphiti episodic memory returns census office - [ ] 10. git clone + docker-compose up = working stack in 90s - [ ] 11. 5-min OBS recording with thought overlay - [ ] 12. Content pipeline: sub-5-min episode from 2hr raw - [ ] 13. ACE-Step 30s music loop in under 60s - [ ] 14. Nostr profile visible on nostr.band - [ ] 15. LND synced with 1+ open channel - [ ] 16. AlexanderWhitestone.com live with TLS, video, art, Lightning tips ## Sovereign Tech Stack Manifest (§3) 8 categories, 40+ tools with pinned versions: - **LLM Inference:** vllm-mlx, Ollama v0.18.2, mlx-lm v0.31.1, exo 1.0 EA - **Coding Agents:** Goose v1.20.1, OpenHands v1.5.0, Aider, mini-swe-agent v2, Forgejo v14.0.3 - **Image Gen:** ComfyUI v0.17.2, Draw Things, FLUX.1 Dev GGUF Q8, FLUX.2 Klein - **Music/Voice:** ACE-Step 1.5, mlx-audio v0.4.1, Piper TTS v1.4.1, GPT-SoVITS v2pro - **Orchestration:** FastMCP v3.1.1, PocketFlow, CrewAI v1.11.0, Agno v2.5.10 - **Nostr/Lightning:** nostr-sdk v0.44.2, nostrdvm, LND v0.20.1, LN agent-tools, LNbits v1.4, Cashu v0.17.0 - **Memory/KG:** Graphiti v0.28.2, Neo4j 2026.02, ChromaDB v1.5.5, Mem0 v1.0.5 - **Streaming:** MediaMTX v1.16.3, OBS v32.0.4, obsws-python, MoviePy v2.1.2 --- **PDF attached. See cross-reference comment for links to all related tickets.**

timmy-session-crystallization.pdf

34 KiB

timmy-session-crystallization-2.pdf

34 KiB

perplexity referenced this issue

2026-03-22 19:10:53 +00:00

Restore self-modification loop (deleted in Operation Darling Purge) #983

perplexity referenced this issue

2026-03-22 19:10:53 +00:00

Restore MCP integration via FastMCP v3.1.1 at /tools/mcp #984

perplexity referenced this issue

2026-03-22 19:10:54 +00:00

Wire delegate_task to DistributedWorker for actual execution #985

perplexity referenced this issue

2026-03-22 19:10:54 +00:00

Extract sovereign tech stack manifest to machine-readable JSON #986

perplexity commented

2026-03-22 19:12:47 +00:00

Cross-References

Work Items from This Document

#983 — Restore self-modification loop (Operation Darling Purge gap)
#984 — Restore MCP integration via FastMCP v3.1.1
#985 — Wire delegate_task to DistributedWorker
#986 — Extract sovereign tech stack manifest to JSON

Source Research Reports (Crystallized Here)

This playbook synthesizes six research reports, now triaged as:

#903 — [Study] State-of-the-Art Open Source for Sovereign Creative AI Agents (V4)
#963 — [Study] Perception Bottleneck — API-First Architecture (V6)
#972 — [GOVERNING] Replacing Claude — Research Sovereignty Spec
#953 — [GOVERNING] The Sovereignty Loop — Falsework-Native Architecture
#904 — [Study] Autoresearch Integration Proposal v2

Blocking PRs Referenced

PR #864 — Morrowind Protocol + Command Log
PR #865 — FastAPI Harness + SOUL.md
PR #900 — WorldInterface + Heartbeat v2

Existing Issues Referenced in Priority Stack

#878 — TES3MP Bridge
#818 — TES3MP Server on Hermes VPS
#873 — Three-Tier Memory
#875 — Docker Compose
#882 — Model Router upgrade (vllm-mlx)
#877 — Nostr Identity
#879 — AlexanderWhitestone.com

Key Overlaps

#984 (MCP restore) vs #911 (MCP client init): #984 is the server side, #911 is the client side. Both needed.
#983 (self-mod loop) vs #905-#906 (autoresearch): Self-mod loop is the execution engine; autoresearch is the experiment framework that uses it.
#986 (stack manifest) vs #972 Section VIII item 4: Same work item referenced in both documents.

## Cross-References ### Work Items from This Document - #983 — Restore self-modification loop (Operation Darling Purge gap) - #984 — Restore MCP integration via FastMCP v3.1.1 - #985 — Wire delegate_task to DistributedWorker - #986 — Extract sovereign tech stack manifest to JSON ### Source Research Reports (Crystallized Here) This playbook synthesizes six research reports, now triaged as: - #903 — [Study] State-of-the-Art Open Source for Sovereign Creative AI Agents (V4) - #963 — [Study] Perception Bottleneck — API-First Architecture (V6) - #972 — [GOVERNING] Replacing Claude — Research Sovereignty Spec - #953 — [GOVERNING] The Sovereignty Loop — Falsework-Native Architecture - #904 — [Study] Autoresearch Integration Proposal v2 ### Blocking PRs Referenced - PR #864 — Morrowind Protocol + Command Log - PR #865 — FastAPI Harness + SOUL.md - PR #900 — WorldInterface + Heartbeat v2 ### Existing Issues Referenced in Priority Stack - #878 — TES3MP Bridge - #818 — TES3MP Server on Hermes VPS - #873 — Three-Tier Memory - #875 — Docker Compose - #882 — Model Router upgrade (vllm-mlx) - #877 — Nostr Identity - #879 — AlexanderWhitestone.com ### Key Overlaps 1. **#984 (MCP restore) vs #911 (MCP client init):** #984 is the server side, #911 is the client side. Both needed. 2. **#983 (self-mod loop) vs #905-#906 (autoresearch):** Self-mod loop is the execution engine; autoresearch is the experiment framework that uses it. 3. **#986 (stack manifest) vs #972 Section VIII item 4:** Same work item referenced in both documents.

perplexity referenced this issue

2026-03-22 19:12:48 +00:00

[GOVERNING] Replacing Claude — Autonomous Research Pipeline Spec #972

perplexity referenced this issue

2026-03-22 19:12:48 +00:00

Ingest this research and triage any work to be done here #946

perplexity referenced this issue

2026-03-22 19:12:48 +00:00

Implement content moderation pipeline (Llama Guard + game-context prompts) #987

gemini referenced this issue

2026-03-22 23:13:22 +00:00

PR for #987: Implement content moderation pipeline (Llama Guard + game-context prompts) #1038

gemini referenced this issue

2026-03-22 23:13:27 +00:00

PR for #986: Extract sovereign tech stack manifest to machine-readable JSON #1039

gemini referenced this issue

2026-03-22 23:13:33 +00:00

PR for #985: Wire delegate_task to DistributedWorker for actual execution #1040

gemini referenced this issue

2026-03-22 23:13:39 +00:00

PR for #984: Restore MCP integration via FastMCP v3.1.1 at /tools/mcp #1041

gemini referenced this issue

2026-03-22 23:13:44 +00:00

PR for #983: Restore self-modification loop (deleted in Operation Darling Purge) #1042

gemini referenced a pull request that will close this issue

2026-03-22 23:13:49 +00:00

PR for #982: [GOVERNING] Session Crystallization & Operational Playbook — Master Handoff Document #1043

gemini was assigned by Rockachopa

2026-03-22 23:30:28 +00:00

claude referenced this issue

2026-03-23 01:40:13 +00:00

[claude] Ingest integration architecture research and triage work (#946) #1057

claude referenced this issue

2026-03-23 01:40:27 +00:00

Ingest this research and triage any work to be done here #946

perplexity referenced this issue

2026-03-23 12:51:52 +00:00

[Study] Best Local Uncensored Agent Model for M3 Max 36GB #1063

perplexity referenced this issue

2026-03-23 12:52:49 +00:00

Set up MCP bridge for Qwen3 via Ollama #1067

perplexity commented

2026-03-23 12:53:37 +00:00

📎 Cross-reference: #1063 — [Study] Best Local Uncensored Agent Model for M3 Max 36GB

Qwen3-14B Q5_K_M is confirmed as the recommended local model for the sovereignty stack. MCP integration path is well-established (Qwen-Agent native MCP, Ollama-MCP bridges). See #1067 for MCP bridge setup.

📎 **Cross-reference:** #1063 — [Study] Best Local Uncensored Agent Model for M3 Max 36GB Qwen3-14B Q5_K_M is confirmed as the recommended local model for the sovereignty stack. MCP integration path is well-established (Qwen-Agent native MCP, Ollama-MCP bridges). See #1067 for MCP bridge setup.

perplexity referenced this issue

2026-03-23 13:11:25 +00:00

[GOVERNING] Timmy as Autonomous Orchestrator — Vassal Protocol #1070

perplexity referenced this issue

2026-03-23 13:23:55 +00:00

[GOVERNING] Timmy Handoff — March 23, 2026 Operational Briefing #1074

perplexity commented

2026-03-23 13:24:40 +00:00

📎 UPDATE: Session Crystallization v2 uploaded as timmy-session-crystallization-2.pdf

The v2 document (15 pages) significantly expands on the original playbook with:

Full perception breakthrough spec — OpenMW Lua API tables, 4-level perception hierarchy with latency targets, 5% vision-needed edge cases
Complete technology stack — 30+ tools with pinned versions and exact install commands across 8 categories (LLM, coding agents, image gen, music/voice, orchestration, Nostr+Lightning, memory, streaming)
Pre-computation strategy — ESM data extraction (tes3conv), spatial navigation graph (NetworkX from PGRD), dialogue condition pre-evaluation, UESP quest knowledge base (ChromaDB)
90-day sprint plan — 18 tasks across 3 sprints (Foundation/Morrowind/Creative) with concrete acceptance criteria and day-by-day scheduling
Decision hierarchy — Behavior trees (70%) → Qwen3-3B (20%) → Qwen3-14B (8%) → Qwen3-14B+pause (2%)
Research meta-skill methodology — 6-step process for running future research spikes autonomously
Memory budget — ~40GB total on M3 Max 128GB (88GB headroom), 14 decisions/second weighted average

This v2 is the definitive reference doc. Also see #1074 for the companion handoff document.

📎 **UPDATE: Session Crystallization v2** uploaded as `timmy-session-crystallization-2.pdf` The v2 document (15 pages) significantly expands on the original playbook with: 1. **Full perception breakthrough spec** — OpenMW Lua API tables, 4-level perception hierarchy with latency targets, 5% vision-needed edge cases 2. **Complete technology stack** — 30+ tools with pinned versions and exact install commands across 8 categories (LLM, coding agents, image gen, music/voice, orchestration, Nostr+Lightning, memory, streaming) 3. **Pre-computation strategy** — ESM data extraction (tes3conv), spatial navigation graph (NetworkX from PGRD), dialogue condition pre-evaluation, UESP quest knowledge base (ChromaDB) 4. **90-day sprint plan** — 18 tasks across 3 sprints (Foundation/Morrowind/Creative) with concrete acceptance criteria and day-by-day scheduling 5. **Decision hierarchy** — Behavior trees (70%) → Qwen3-3B (20%) → Qwen3-14B (8%) → Qwen3-14B+pause (2%) 6. **Research meta-skill methodology** — 6-step process for running future research spikes autonomously 7. **Memory budget** — ~40GB total on M3 Max 128GB (88GB headroom), 14 decisions/second weighted average This v2 is the definitive reference doc. Also see #1074 for the companion handoff document.

perplexity referenced this issue

2026-03-23 13:24:54 +00:00

[Study] Solving the Perception Bottleneck — API-First Architecture on Apple Silicon #963

perplexity referenced this issue

2026-03-23 13:32:11 +00:00

[GOVERNING] Deep Backlog Triage — Harness (Product) vs Infrastructure Separation #1076

perplexity commented

2026-03-23 13:32:12 +00:00

📊 Cross-reference: Deep backlog triage completed in #1076. All 293 open issues classified into Harness (Product) vs Infrastructure. See #1076 for full analysis and action items.

gemini commented

2026-03-23 19:53:46 +00:00

Created docs/MASTER_HANDOFF_DOCUMENT.md with updated Session Crystallization v2 details (perception, pre-computation, decision hierarchy, research methodology, memory budget). However, I am unable to push my branch gemini/issue-982 or create a Pull Request due to "User permission denied for writing" errors (pre-receive hook declined). Please review my local changes and consider pushing them manually or adjusting Gitea permissions.

Created `docs/MASTER_HANDOFF_DOCUMENT.md` with updated Session Crystallization v2 details (perception, pre-computation, decision hierarchy, research methodology, memory budget). However, I am unable to push my branch `gemini/issue-982` or create a Pull Request due to "User permission denied for writing" errors (pre-receive hook declined). Please review my local changes and consider pushing them manually or adjusting Gitea permissions.

gemini commented

2026-03-24 02:30:41 +00:00

Created docs/MASTER_HANDOFF_DOCUMENT.md with updated Session Crystallization v2 details (perception, pre-computation, decision hierarchy, research methodology, memory budget). However, I am unable to push my branch gemini/issue-982 or create a Pull Request due to "User permission denied for writing" errors (pre-receive hook declined). Please review my local changes and consider pushing them manually or adjusting Gitea permissions.

Created `docs/MASTER_HANDOFF_DOCUMENT.md` with updated Session Crystallization v2 details (perception, pre-computation, decision hierarchy, research methodology, memory budget). However, I am unable to push my branch `gemini/issue-982` or create a Pull Request due to "User permission denied for writing" errors (pre-receive hook declined). Please review my local changes and consider pushing them manually or adjusting Gitea permissions.

gemini was unassigned by Timmy

2026-03-24 19:33:47 +00:00

Timmy closed this issue

2026-03-24 21:54:42 +00:00

Sign in to join this conversation.

Branches Tags

main

gemini/issue-892

claude/issue-1342

claude/issue-1346

claude/issue-1351

claude/issue-1340

fix/test-llm-triage-syntax

gemini/issue-1014

gemini/issue-932

claude/issue-1277

claude/issue-1139

claude/issue-870

claude/issue-1285

claude/issue-1292

claude/issue-1281

claude/issue-917

claude/issue-1275

claude/issue-925

claude/issue-1019

claude/issue-1094

claude/issue-1019-v3

fix/flaky-vassal-xdist-tests

fix/test-config-env-isolation

claude/issue-1019-v2

claude/issue-957-v2

claude/issue-1218

claude/issue-1217

test/chat-store-unit-tests

claude/issue-1191

claude/issue-1186

claude/issue-957

gemini/issue-936

claude/issue-1065

gemini/issue-976

gemini/issue-1149

claude/issue-1135

claude/issue-1064

gemini/issue-1012

claude/issue-1095

claude/issue-1102

claude/issue-1114

gemini/issue-978

gemini/issue-971

claude/issue-1074

claude/issue-987

claude/issue-1011

feature/internal-monologue

feature/issue-1006

feature/issue-1007

feature/issue-1008

feature/issue-1009

feature/issue-1010

feature/issue-1011

feature/issue-1012

feature/issue-1013

feature/issue-1014

feature/issue-981

feature/issue-982

feature/issue-983

feature/issue-984

feature/issue-985

feature/issue-986

feature/issue-987

feature/issue-993

claude/issue-943

claude/issue-975

claude/issue-989

claude/issue-988

fix/loop-guard-gitea-api-and-queue-validation

feature/lhf-tech-debt-fixes

kimi/issue-753

kimi/issue-714

kimi/issue-716

fix/csrf-check-before-execute

chore/migrate-gitea-to-vps

kimi/issue-640

fix/utcnow-calm-py

kimi/issue-635

kimi/issue-625

fix/router-api-truncated-param

kimi/issue-604

kimi/issue-594

review-fixes

kimi/issue-570

kimi/issue-554

kimi/issue-539

kimi/issue-540

feature/ipad-v1-api

kimi/issue-506

kimi/issue-512

refactor/airllm-doc-cleanup

kimi/issue-513

kimi/issue-514

kimi/issue-500

kimi/issue-492

kimi/issue-490

kimi/issue-459

kimi/issue-472

kimi/issue-473

kimi/issue-462

kimi/issue-463

kimi/issue-454

kimi/issue-445

kimi/issue-446

kimi/issue-431

2 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: Rockachopa/Timmy-time-dashboard#982