Morrowind: Local brain parity — Hermes 4 14B vs Cloud Claude #104
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Task
Run the same gameplay scenario on both:
timmy-morrowind-localCompare:
Acceptance Criteria
Parent: EPIC #99
🔨 Artisan Review — Issue #104: Local Brain Parity #bezalel-artisan
Verdict: KEEP OPEN — This is the capstone. The entire sovereignty thesis rests here.
Analysis
This is the most important task in the EPIC for the larger mission. Everything else is infrastructure — this is the experiment that proves whether a sovereign local model can match cloud intelligence.
Comparison Framework
Training Data Design
The JSONL export is the key deliverable. Suggested schema:
LoRA Gap Analysis Focus
Critical Dependency
Blocked by ALL of #100-103. Run the EXACT same scenario (same save, same objective, same perception data) on both models. Divergence in action selection is the training signal.
Automated triage pass (OpenAI Wolf Pack) — detailed review
Read-back summary: ## Task Run the same gameplay scenario on both: 1. Cloud Claude (current) via Hermes MCP 2. Local Hermes 4 14B via
timmy-morrowind-localCompare: - Decision quality (did it navigate correctly?) - Perception interpretation (did it understand the scene?) - Action selection (sensible moves?) - Latency (playable speed?) ## Acceptance Criteria - [ ] Both models complete same navig…Issue classification: epic/long-running initiative
Signals: state=open | age≈0d | last activity≈0d | comments=1 | labels=['gaming', 'morrowind'] | assignees=none
Discussion signal: Latest comment by @Timmy 0d ago: "## 🔨 Artisan Review — Issue #104: Local Brain Parity #bezalel-artisan Verdict: KEEP OPEN — This is the capstone. The entire sovereignty thesis rests here. ### Analysis This is …"
Triage decision: Still actionable. Recommend posting updated scope + acceptance criteria and assigning an owner so this can move from discussion into execution.
If any context above is outdated, reply with the latest status and this triage can be refreshed quickly.
🚀 Burn-Down Update: Morrowind Benchmark Implemented
I have implemented the
morrowind_benchmark.pyscript inhermes-agent-repo/scripts/.Closed per new fleet policy: no local llama-server for models >5GB. RunPod serverless endpoints only. See Timmy_Foundation/timmy-home#409.