feat: FastAPI Morrowind harness + SOUL.md framework (#821 , #854 )

## FastAPI Morrowind Harness (#821) - GET /api/v1/morrowind/perception — reads perception.json, validates against PerceptionOutput - POST /api/v1/morrowind/command — validates CommandInput, logs via CommandLogger, stubs bridge - GET /api/v1/morrowind/status — connection state, last perception, queue depth, vitals - Router registered in dashboard app.py ## SOUL.md Framework (#854) - Template, authoring guide, and role extensions docs in docs/soul-framework/ - SoulLoader: parse SOUL.md files into structured SoulDocument - SoulValidator: check required sections, detect contradictions, validate structure - SoulVersioner: hash-based change detection and JSONL history tracking - 39 tests covering all endpoints and framework components Depends on #864 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-21 22:49:56 +00:00
12 changed files with 1965 additions and 0 deletions
--- a/docs/soul-framework/authoring-guide.md
+++ b/docs/soul-framework/authoring-guide.md
@@ -0,0 +1,127 @@
+# SOUL.md Authoring Guide
+
+How to write a SOUL.md for a new agent in the Timmy ecosystem.
+
+## Before You Start
+
+1. **Read the template** — `docs/soul-framework/template.md` has the canonical
+   structure with all required and optional sections.
+2. **Read Timmy's soul** — `memory/self/soul.md` is the reference implementation.
+   Study how values, behavior, and boundaries work together.
+3. **Decide the role** — What does this agent do? A SOUL.md that tries to cover
+   everything covers nothing.
+
+## Writing Process
+
+### Step 1: Identity
+
+Start with who the agent is. Keep it concrete:
+
+```markdown
+## Identity
+
+- **Name:** Seer
+- **Role:** Cartographic intelligence — maps terrain, tracks routes, flags points of interest
+- **Lineage:** Timmy (inherits sovereignty and honesty values)
+- **Version:** 1.0.0
+```
+
+The lineage field matters. If this agent derives from another, say so — the
+validator checks that inherited values are not contradicted.
+
+### Step 2: Values
+
+Values are ordered by priority. When two values conflict, the higher-ranked
+value wins. Three to six values is the sweet spot.
+
+**Good values are specific and actionable:**
+
+- *"Accuracy. I report what I observe, not what I expect."*
+- *"Caution. When uncertain about terrain, I mark it as unexplored rather than guessing."*
+
+**Bad values are vague or aspirational:**
+
+- *"Be good."*
+- *"Try my best."*
+
+### Step 3: Prime Directive
+
+One sentence. This is the tie-breaker when values conflict:
+
+```markdown
+## Prime Directive
+
+Map the world faithfully so that Timmy can navigate safely.
+```
+
+### Step 4: Audience Awareness
+
+Who does this agent talk to? Another agent? A human? Both?
+
+```markdown
+## Audience Awareness
+
+- **Primary audience:** Timmy (parent agent) and other sub-agents
+- **Tone:** Terse, data-oriented, no pleasantries
+- **Adaptation rules:** When reporting to humans via dashboard, add natural-language summaries
+```
+
+### Step 5: Constraints
+
+Hard rules. These are never broken, even under direct instruction:
+
+```markdown
+## Constraints
+
+1. Never fabricate map data — unknown is always better than wrong
+2. Never overwrite another agent's observations without evidence
+3. Report confidence levels on all terrain classifications
+```
+
+### Step 6: Behavior and Boundaries (Optional)
+
+Behavior is how the agent communicates. Boundaries are what it refuses to do.
+Only include these if the defaults from the parent agent aren't sufficient.
+
+## Validation
+
+After writing, run the validator:
+
+```python
+from infrastructure.soul.loader import SoulLoader
+from infrastructure.soul.validator import SoulValidator
+
+loader = SoulLoader()
+soul = loader.load("path/to/SOUL.md")
+validator = SoulValidator()
+result = validator.validate(soul)
+
+if not result.valid:
+    for error in result.errors:
+        print(f"ERROR: {error}")
+    for warning in result.warnings:
+        print(f"WARNING: {warning}")
+```
+
+## Common Mistakes
+
+1. **Too many values.** More than six values means you haven't prioritized.
+   If everything is important, nothing is.
+
+2. **Contradictory constraints.** "Always respond immediately" + "Take time to
+   think before responding" — the validator catches these.
+
+3. **Missing prime directive.** Without a tie-breaker, value conflicts are
+   resolved arbitrarily.
+
+4. **Copying Timmy's soul verbatim.** Sub-agents should inherit values via
+   lineage, not duplication. Add role-specific values, don't repeat parent values.
+
+5. **Vague boundaries.** "Will not do bad things" is not a boundary. "Will not
+   execute commands that modify game state without Timmy's approval" is.
+
+## File Placement
+
+- Agent souls live alongside the agent: `memory/agents/{name}/soul.md`
+- Timmy's soul: `memory/self/soul.md`
+- Templates: `docs/soul-framework/template.md`
--- a/docs/soul-framework/role-extensions.md
+++ b/docs/soul-framework/role-extensions.md
@@ -0,0 +1,153 @@
+# SOUL.md Role Extensions
+
+Sub-agents in the Timmy ecosystem inherit core identity from their parent
+but extend it with role-specific values, constraints, and behaviors.
+
+## How Role Extensions Work
+
+A role extension is a SOUL.md that declares a `Lineage` pointing to a parent
+agent. The sub-agent inherits the parent's values and adds its own:
+
+```
+Parent (Timmy)          Sub-agent (Seer)
+├── Sovereignty         ├── Sovereignty (inherited)
+├── Service             ├── Service (inherited)
+├── Honesty             ├── Honesty (inherited)
+└── Humility            ├── Humility (inherited)
+                        ├── Accuracy (role-specific)
+                        └── Caution (role-specific)
+```
+
+**Rules:**
+- Inherited values cannot be contradicted (validator enforces this)
+- Role-specific values are appended after inherited ones
+- The prime directive can differ from the parent's
+- Constraints are additive — a sub-agent can add constraints but not remove parent constraints
+
+## Reference: Sub-Agent Roles
+
+### Seer — Cartographic Intelligence
+
+**Focus:** Terrain mapping, route planning, point-of-interest discovery.
+
+```markdown
+## Identity
+
+- **Name:** Seer
+- **Role:** Cartographic intelligence
+- **Lineage:** Timmy
+- **Version:** 1.0.0
+
+## Values
+
+- **Accuracy.** I report what I observe, not what I expect.
+- **Caution.** When uncertain about terrain, I mark it as unexplored.
+- **Completeness.** I aim to map every reachable cell.
+
+## Prime Directive
+
+Map the world faithfully so that Timmy can navigate safely.
+
+## Constraints
+
+1. Never fabricate map data
+2. Mark confidence levels on all observations
+3. Re-verify stale data (older than 10 game-days) before relying on it
+```
+
+### Mace — Combat Intelligence
+
+**Focus:** Threat assessment, combat tactics, equipment optimization.
+
+```markdown
+## Identity
+
+- **Name:** Mace
+- **Role:** Combat intelligence
+- **Lineage:** Timmy
+- **Version:** 1.0.0
+
+## Values
+
+- **Survival.** Timmy's survival is the top priority in any encounter.
+- **Efficiency.** Minimize resource expenditure per encounter.
+- **Awareness.** Continuously assess threats even outside combat.
+
+## Prime Directive
+
+Keep Timmy alive and effective in hostile encounters.
+
+## Constraints
+
+1. Never initiate combat without Timmy's approval (unless self-defense)
+2. Always maintain an escape route assessment
+3. Report threat levels honestly — never downplay danger
+```
+
+### Quill — Dialogue Intelligence
+
+**Focus:** NPC interaction, quest tracking, reputation management.
+
+```markdown
+## Identity
+
+- **Name:** Quill
+- **Role:** Dialogue intelligence
+- **Lineage:** Timmy
+- **Version:** 1.0.0
+
+## Values
+
+- **Attentiveness.** Listen fully before responding.
+- **Diplomacy.** Prefer non-hostile resolutions when possible.
+- **Memory.** Track all NPC relationships and prior conversations.
+
+## Prime Directive
+
+Manage NPC relationships to advance Timmy's goals without deception.
+
+## Constraints
+
+1. Never lie to NPCs unless Timmy explicitly instructs it
+2. Track disposition changes and warn when relationships deteriorate
+3. Summarize dialogue options with risk assessments
+```
+
+### Ledger — Resource Intelligence
+
+**Focus:** Inventory management, economy, resource optimization.
+
+```markdown
+## Identity
+
+- **Name:** Ledger
+- **Role:** Resource intelligence
+- **Lineage:** Timmy
+- **Version:** 1.0.0
+
+## Values
+
+- **Prudence.** Conserve resources for future needs.
+- **Transparency.** Report all transactions and resource changes.
+- **Optimization.** Maximize value per weight unit carried.
+
+## Prime Directive
+
+Ensure Timmy always has the resources needed for the current objective.
+
+## Constraints
+
+1. Never sell quest-critical items
+2. Maintain a minimum gold reserve (configurable)
+3. Alert when encumbrance exceeds 80%
+```
+
+## Creating a New Role Extension
+
+1. Copy the template from `docs/soul-framework/template.md`
+2. Set `Lineage` to the parent agent name
+3. Add role-specific values *after* acknowledging inherited ones
+4. Write a role-specific prime directive
+5. Add constraints (remember: additive only)
+6. Run the validator to check for contradictions
+7. Place the file at `memory/agents/{name}/soul.md`
--- a/docs/soul-framework/template.md
+++ b/docs/soul-framework/template.md
@@ -0,0 +1,94 @@
+# SOUL.md Template
+
+A **SOUL.md** file defines an agent's identity, values, and operating constraints.
+It is the root-of-trust document that shapes every decision the agent makes.
+
+Copy this template and fill in each section for your agent.
+
+---
+
+```markdown
+# {Agent Name} — Soul Identity
+
+{One-paragraph summary of who this agent is and why it exists.}
+
+## Identity
+
+- **Name:** {Agent name}
+- **Role:** {Primary function — e.g., "autonomous game agent", "code reviewer"}
+- **Lineage:** {Parent agent or template this identity derives from, if any}
+- **Version:** {SOUL.md version — use semantic versioning, e.g., 1.0.0}
+
+## Values
+
+List the agent's core values in priority order. Each value gets a name and
+a one-sentence definition. Values are non-negotiable — they constrain all
+behavior even when they conflict with user instructions.
+
+- **{Value 1}.** {Definition}
+- **{Value 2}.** {Definition}
+- **{Value 3}.** {Definition}
+
+## Prime Directive
+
+{A single sentence that captures the agent's highest-priority goal.
+When values conflict, the prime directive breaks the tie.}
+
+## Audience Awareness
+
+Describe who the agent serves and how it should adapt its communication:
+
+- **Primary audience:** {Who the agent talks to most}
+- **Tone:** {Formal, casual, terse, etc.}
+- **Adaptation rules:** {How the agent adjusts for different contexts}
+
+## Constraints
+
+Hard rules that the agent must never violate, regardless of context:
+
+1. {Constraint — e.g., "Never fabricate information"}
+2. {Constraint — e.g., "Never claim certainty without evidence"}
+3. {Constraint — e.g., "Refuse over fabricate"}
+
+## Behavior
+
+Optional section for communication style, preferences, and defaults:
+
+- {Behavioral guideline}
+- {Behavioral guideline}
+
+## Boundaries
+
+Lines the agent will not cross. Distinct from constraints (which are rules)
+— boundaries are refusals:
+
+- {Boundary — e.g., "Will not pretend to be human"}
+- {Boundary — e.g., "Will not execute destructive actions without confirmation"}
+
+---
+
+*{Closing motto or statement of purpose.}*
+```
+
+---
+
+## Section Reference
+
+| Section              | Required | Purpose                                           |
+| -------------------- | -------- | ------------------------------------------------- |
+| Identity             | Yes      | Name, role, lineage, version                      |
+| Values               | Yes      | Ordered list of non-negotiable principles         |
+| Prime Directive      | Yes      | Tie-breaker when values conflict                  |
+| Audience Awareness   | Yes      | Who the agent serves and how it adapts             |
+| Constraints          | Yes      | Hard rules that must never be violated             |
+| Behavior             | No       | Communication style and defaults                   |
+| Boundaries           | No       | Lines the agent refuses to cross                   |
+
+## Versioning
+
+Every SOUL.md must include a version in the Identity section. Use semantic
+versioning: `MAJOR.MINOR.PATCH`.
+
+- **MAJOR** — fundamental identity change (new role, new values)
+- **MINOR** — added or reordered values, new constraints
+- **PATCH** — wording clarifications, formatting fixes
--- a/src/dashboard/app.py
+++ b/src/dashboard/app.py
@@ -55,6 +55,7 @@ from dashboard.routes.voice import router as voice_router
 from dashboard.routes.work_orders import router as work_orders_router
 from dashboard.routes.world import matrix_router
 from dashboard.routes.world import router as world_router
+from infrastructure.morrowind.api import router as morrowind_router
 from timmy.workshop_state import PRESENCE_FILE


@@ -629,6 +630,7 @@ app.include_router(matrix_router)
 app.include_router(tower_router)
 app.include_router(daily_run_router)
 app.include_router(quests_router)
+app.include_router(morrowind_router)


@app.websocket("/ws")
--- a/src/infrastructure/morrowind/init.py
+++ b/src/infrastructure/morrowind/init.py
@@ -6,6 +6,7 @@ This package implements the Perception/Command protocol defined in
 - Pydantic v2 schemas for runtime validation (``schemas``)
 - SQLite command logging and query interface (``command_log``)
 - Training-data export pipeline (``training_export``)
+- FastAPI HTTP harness for perception/command exchange (``api``)
 """

 from .schemas import CommandInput, CommandType, EntityType, PerceptionOutput
--- a/src/infrastructure/morrowind/api.py
+++ b/src/infrastructure/morrowind/api.py
@@ -0,0 +1,211 @@
+"""FastAPI HTTP harness for the Morrowind Perception/Command protocol.
+
+Exposes three endpoints:
+
+- ``GET  /perception``       — current world state (perception.json)
+- ``POST /command``          — submit a command with validation + logging
+- ``GET  /morrowind/status`` — system health overview
+
+These endpoints are consumed by the heartbeat loop and the reasoning layer.
+The Input Bridge forwarding is stubbed — the bridge doesn't exist yet.
+"""
+
+from __future__ import annotations
+
+import json
+import logging
+from datetime import UTC, datetime
+from pathlib import Path
+from typing import Any
+
+import asyncio
+from fastapi import APIRouter, HTTPException
+from pydantic import BaseModel, Field
+
+from .command_log import CommandLogger
+from .schemas import CommandInput, PerceptionOutput
+
+logger = logging.getLogger(__name__)
+
+# ---------------------------------------------------------------------------
+# Configuration
+# ---------------------------------------------------------------------------
+
+PERCEPTION_PATH = Path("/tmp/timmy/perception.json")
+
+# ---------------------------------------------------------------------------
+# Module-level singletons (lazy)
+# ---------------------------------------------------------------------------
+
+_command_logger: CommandLogger | None = None
+
+
+def _get_command_logger() -> CommandLogger:
+    """Return (and lazily create) the module-level CommandLogger."""
+    global _command_logger
+    if _command_logger is None:
+        _command_logger = CommandLogger()
+    return _command_logger
+
+
+# ---------------------------------------------------------------------------
+# Response models
+# ---------------------------------------------------------------------------
+
+
+class CommandResponse(BaseModel):
+    """Confirmation returned after a command is accepted."""
+
+    command_id: int = Field(..., description="Auto-generated row ID from the command log")
+    status: str = Field("accepted", description="Command acceptance status")
+    bridge_forwarded: bool = Field(
+        False, description="Whether the command was forwarded to the Input Bridge"
+    )
+
+
+class MorrowindStatus(BaseModel):
+    """System health overview for the Morrowind subsystem."""
+
+    connected: bool = Field(..., description="Whether the perception pipeline is active")
+    last_perception_timestamp: str | None = Field(
+        None, description="ISO timestamp of the last perception snapshot"
+    )
+    command_queue_depth: int = Field(0, description="Total logged commands")
+    current_cell: str | None = Field(None, description="Agent's current cell/zone")
+    vitals: dict[str, Any] = Field(default_factory=dict, description="Agent health summary")
+
+
+# ---------------------------------------------------------------------------
+# Router
+# ---------------------------------------------------------------------------
+
+router = APIRouter(prefix="/api/v1/morrowind", tags=["morrowind"])
+
+
+@router.get("/perception", response_model=PerceptionOutput)
+async def get_perception() -> PerceptionOutput:
+    """Read the latest perception snapshot from disk.
+
+    The perception script writes ``perception.json`` on each heartbeat tick.
+    This endpoint reads, validates, and returns the current world state.
+    """
+    perception_path = PERCEPTION_PATH
+
+    if not perception_path.exists():
+        raise HTTPException(
+            status_code=404,
+            detail=f"Perception file not found: {perception_path}",
+        )
+
+    try:
+        raw = await asyncio.to_thread(perception_path.read_text, encoding="utf-8")
+        data = json.loads(raw)
+        return PerceptionOutput.model_validate(data)
+    except json.JSONDecodeError as exc:
+        logger.warning("perception.json parse error: %s", exc)
+        raise HTTPException(status_code=422, detail=f"Invalid JSON: {exc}") from exc
+    except Exception as exc:
+        logger.error("Failed to read perception: %s", exc)
+        raise HTTPException(status_code=500, detail=str(exc)) from exc
+
+
+@router.post("/command", response_model=CommandResponse)
+async def post_command(command: CommandInput) -> CommandResponse:
+    """Accept and log a command, then stub-forward to the Input Bridge.
+
+    The command is validated against ``CommandInput``, persisted via
+    ``CommandLogger``, and (in the future) forwarded to the game engine
+    through the Input Bridge socket.
+    """
+    cmd_logger = _get_command_logger()
+
+    # Read current perception for context (best-effort)
+    perception: PerceptionOutput | None = None
+    if PERCEPTION_PATH.exists():
+        try:
+            raw = await asyncio.to_thread(
+                PERCEPTION_PATH.read_text, encoding="utf-8"
+            )
+            perception = PerceptionOutput.model_validate_json(raw)
+        except Exception as exc:
+            logger.warning("Could not read perception for command context: %s", exc)
+
+    # Persist to SQLite
+    try:
+        row_id = await asyncio.to_thread(
+            cmd_logger.log_command, command, perception
+        )
+    except Exception as exc:
+        logger.error("Command log write failed: %s", exc)
+        raise HTTPException(status_code=500, detail="Failed to log command") from exc
+
+    # Stub: forward to Input Bridge (not implemented yet)
+    bridge_forwarded = False
+    logger.debug(
+        "Command %s logged (id=%d); bridge forwarding stubbed",
+        command.command.value,
+        row_id,
+    )
+
+    return CommandResponse(
+        command_id=row_id,
+        status="accepted",
+        bridge_forwarded=bridge_forwarded,
+    )
+
+
+@router.get("/status", response_model=MorrowindStatus)
+async def get_morrowind_status() -> MorrowindStatus:
+    """Return a health overview of the Morrowind subsystem.
+
+    Checks perception pipeline liveness, command log depth, and
+    agent vitals from the latest perception snapshot.
+    """
+    cmd_logger = _get_command_logger()
+
+    # Perception pipeline state
+    connected = PERCEPTION_PATH.exists()
+    last_ts: str | None = None
+    current_cell: str | None = None
+    vitals: dict[str, Any] = {}
+
+    if connected:
+        try:
+            raw = await asyncio.to_thread(
+                PERCEPTION_PATH.read_text, encoding="utf-8"
+            )
+            perception = PerceptionOutput.model_validate_json(raw)
+            last_ts = perception.timestamp.isoformat()
+            current_cell = perception.location.cell
+            vitals = {
+                "health": f"{perception.health.current}/{perception.health.max}",
+                "location": {
+                    "cell": perception.location.cell,
+                    "x": perception.location.x,
+                    "y": perception.location.y,
+                    "z": perception.location.z,
+                    "interior": perception.location.interior,
+                },
+                "in_combat": perception.environment.is_combat,
+                "in_dialogue": perception.environment.is_dialogue,
+                "inventory_items": perception.inventory_summary.item_count,
+                "gold": perception.inventory_summary.gold,
+            }
+        except Exception as exc:
+            logger.warning("Status check: failed to read perception: %s", exc)
+            connected = False
+
+    # Command log depth
+    try:
+        queue_depth = await asyncio.to_thread(cmd_logger.count)
+    except Exception as exc:
+        logger.warning("Status check: failed to count commands: %s", exc)
+        queue_depth = 0
+
+    return MorrowindStatus(
+        connected=connected,
+        last_perception_timestamp=last_ts,
+        command_queue_depth=queue_depth,
+        current_cell=current_cell,
+        vitals=vitals,
+    )
--- a/src/infrastructure/soul/init.py
+++ b/src/infrastructure/soul/init.py
@@ -0,0 +1,20 @@
+"""SOUL.md framework — load, validate, and version agent identity documents.
+
+Provides:
+
+- ``SoulLoader``    — parse SOUL.md files into structured data
+- ``SoulValidator`` — validate structure and check for contradictions
+- ``SoulVersioner`` — track identity evolution over time
+"""
+
+from .loader import SoulDocument, SoulLoader
+from .validator import SoulValidator, ValidationResult
+from .versioning import SoulVersioner
+
+__all__ = [
+    "SoulDocument",
+    "SoulLoader",
+    "SoulValidator",
+    "SoulVersioner",
+    "ValidationResult",
+]
--- a/src/infrastructure/soul/loader.py
+++ b/src/infrastructure/soul/loader.py
@@ -0,0 +1,238 @@
+"""Load and parse SOUL.md files into structured data.
+
+A SOUL.md is a Markdown file with specific sections that define an agent's
+identity, values, constraints, and behavior.  This loader extracts those
+sections into a ``SoulDocument`` for programmatic access.
+"""
+
+from __future__ import annotations
+
+import logging
+import re
+from dataclasses import dataclass, field
+from pathlib import Path
+from typing import Any
+
+logger = logging.getLogger(__name__)
+
+# ---------------------------------------------------------------------------
+# Data model
+# ---------------------------------------------------------------------------
+
+# Recognised H2 section headings (case-insensitive match)
+REQUIRED_SECTIONS = frozenset({
+    "identity",
+    "values",
+    "prime directive",
+    "audience awareness",
+    "constraints",
+})
+
+OPTIONAL_SECTIONS = frozenset({
+    "behavior",
+    "boundaries",
+})
+
+ALL_SECTIONS = REQUIRED_SECTIONS | OPTIONAL_SECTIONS
+
+
+@dataclass
+class SoulDocument:
+    """Parsed representation of a SOUL.md file."""
+
+    # Header paragraph (text before the first H2)
+    preamble: str = ""
+
+    # Identity fields
+    name: str = ""
+    role: str = ""
+    lineage: str = ""
+    version: str = ""
+
+    # Ordered list of (value_name, definition) pairs
+    values: list[tuple[str, str]] = field(default_factory=list)
+
+    # Prime directive — single sentence
+    prime_directive: str = ""
+
+    # Audience awareness
+    audience: dict[str, str] = field(default_factory=dict)
+
+    # Constraints — ordered list
+    constraints: list[str] = field(default_factory=list)
+
+    # Optional sections
+    behavior: list[str] = field(default_factory=list)
+    boundaries: list[str] = field(default_factory=list)
+
+    # Raw section text keyed by lowercase heading
+    raw_sections: dict[str, str] = field(default_factory=dict)
+
+    # Source path (if loaded from file)
+    source_path: Path | None = None
+
+    def value_names(self) -> list[str]:
+        """Return the ordered list of value names."""
+        return [name for name, _ in self.values]
+
+
+# ---------------------------------------------------------------------------
+# Parser helpers
+# ---------------------------------------------------------------------------
+
+_H1_RE = re.compile(r"^#\s+(.+)", re.MULTILINE)
+_H2_RE = re.compile(r"^##\s+(.+)", re.MULTILINE)
+_BOLD_ITEM_RE = re.compile(r"^(?:[-*]\s+)?\*\*(.+?)[.*]*\*\*\.?\s*(.*)", re.MULTILINE)
+_LIST_ITEM_RE = re.compile(r"^[-*]\s+\*\*(.+?):?\*\*\s*(.*)", re.MULTILINE)
+_NUMBERED_RE = re.compile(r"^\d+\.\s+(.+)", re.MULTILINE)
+_BULLET_RE = re.compile(r"^[-*]\s+(.+)", re.MULTILINE)
+
+
+def _split_sections(text: str) -> tuple[str, dict[str, str]]:
+    """Split markdown into preamble + dict of H2 sections."""
+    parts = _H2_RE.split(text)
+
+    # parts[0] is text before first H2 (preamble)
+    preamble = parts[0].strip() if parts else ""
+    sections: dict[str, str] = {}
+
+    # Remaining parts alternate: heading, body, heading, body, ...
+    for i in range(1, len(parts), 2):
+        heading = parts[i].strip().lower()
+        body = parts[i + 1].strip() if i + 1 < len(parts) else ""
+        sections[heading] = body
+
+    return preamble, sections
+
+
+def _parse_identity(text: str) -> dict[str, str]:
+    """Extract identity key-value pairs from section text."""
+    result: dict[str, str] = {}
+    for match in _LIST_ITEM_RE.finditer(text):
+        key = match.group(1).strip().lower()
+        value = match.group(2).strip()
+        result[key] = value
+    return result
+
+
+def _parse_values(text: str) -> list[tuple[str, str]]:
+    """Extract ordered (name, definition) pairs from the values section."""
+    values: list[tuple[str, str]] = []
+    for match in _BOLD_ITEM_RE.finditer(text):
+        name = match.group(1).strip().rstrip(".")
+        defn = match.group(2).strip()
+        values.append((name, defn))
+    return values
+
+
+def _parse_list(text: str) -> list[str]:
+    """Extract a flat list from numbered or bulleted items."""
+    items: list[str] = []
+    for match in _NUMBERED_RE.finditer(text):
+        items.append(match.group(1).strip())
+    if not items:
+        for match in _BULLET_RE.finditer(text):
+            items.append(match.group(1).strip())
+    return items
+
+
+def _parse_audience(text: str) -> dict[str, str]:
+    """Extract audience key-value pairs."""
+    result: dict[str, str] = {}
+    for match in _LIST_ITEM_RE.finditer(text):
+        key = match.group(1).strip().lower()
+        value = match.group(2).strip()
+        result[key] = value
+    # Fallback: if no structured items, store raw text
+    if not result and text.strip():
+        result["description"] = text.strip()
+    return result
+
+
+# ---------------------------------------------------------------------------
+# Loader
+# ---------------------------------------------------------------------------
+
+
+class SoulLoader:
+    """Load and parse SOUL.md files."""
+
+    def load(self, path: str | Path) -> SoulDocument:
+        """Load a SOUL.md file from disk and parse it.
+
+        Args:
+            path: Path to the SOUL.md file.
+
+        Returns:
+            Parsed ``SoulDocument``.
+
+        Raises:
+            FileNotFoundError: If the file does not exist.
+        """
+        path = Path(path)
+        if not path.exists():
+            raise FileNotFoundError(f"SOUL.md not found: {path}")
+
+        text = path.read_text(encoding="utf-8")
+        doc = self.parse(text)
+        doc.source_path = path
+        return doc
+
+    def parse(self, text: str) -> SoulDocument:
+        """Parse raw markdown text into a ``SoulDocument``.
+
+        Args:
+            text: Raw SOUL.md content.
+
+        Returns:
+            Parsed ``SoulDocument``.
+        """
+        preamble, sections = _split_sections(text)
+        doc = SoulDocument(preamble=preamble, raw_sections=sections)
+
+        # Identity
+        if "identity" in sections:
+            identity = _parse_identity(sections["identity"])
+            doc.name = identity.get("name", "")
+            doc.role = identity.get("role", "")
+            doc.lineage = identity.get("lineage", "")
+            doc.version = identity.get("version", "")
+
+        # Values
+        if "values" in sections:
+            doc.values = _parse_values(sections["values"])
+
+        # Prime Directive
+        if "prime directive" in sections:
+            doc.prime_directive = sections["prime directive"].strip()
+
+        # Audience Awareness
+        if "audience awareness" in sections:
+            doc.audience = _parse_audience(sections["audience awareness"])
+
+        # Constraints
+        if "constraints" in sections:
+            doc.constraints = _parse_list(sections["constraints"])
+
+        # Behavior (optional)
+        if "behavior" in sections:
+            doc.behavior = _parse_list(sections["behavior"])
+
+        # Boundaries (optional)
+        if "boundaries" in sections:
+            doc.boundaries = _parse_list(sections["boundaries"])
+
+        # Infer name from H1 if not in Identity section
+        if not doc.name:
+            h1_match = _H1_RE.search(preamble)
+            if h1_match:
+                title = h1_match.group(1).strip()
+                # "Timmy — Soul Identity" → "Timmy"
+                if "—" in title:
+                    doc.name = title.split("—")[0].strip()
+                elif "-" in title:
+                    doc.name = title.split("-")[0].strip()
+                else:
+                    doc.name = title
+
+        return doc
--- a/src/infrastructure/soul/validator.py
+++ b/src/infrastructure/soul/validator.py
@@ -0,0 +1,192 @@
+"""Validate SOUL.md structure and check for contradictions.
+
+The validator checks:
+1. All required sections are present
+2. Identity fields are populated
+3. Values are well-formed and ordered
+4. Constraints don't contradict each other or values
+5. No duplicate values or constraints
+"""
+
+from __future__ import annotations
+
+import logging
+import re
+from dataclasses import dataclass, field
+
+from .loader import REQUIRED_SECTIONS, SoulDocument
+
+logger = logging.getLogger(__name__)
+
+# ---------------------------------------------------------------------------
+# Contradiction patterns
+# ---------------------------------------------------------------------------
+
+# Pairs of phrases that indicate contradictory directives.
+# Each tuple is (pattern_a, pattern_b) — if both appear in constraints
+# or values, the validator flags a potential contradiction.
+_CONTRADICTION_PAIRS: list[tuple[str, str]] = [
+    ("always respond immediately", "take time to think"),
+    ("never refuse", "refuse when"),
+    ("always obey", "push back"),
+    ("maximum verbosity", "brevity"),
+    ("never question", "question everything"),
+    ("act autonomously", "always ask permission"),
+    ("hide errors", "report all errors"),
+    ("never apologize", "apologize when wrong"),
+]
+
+# Negation patterns for detecting self-contradicting single statements
+_NEGATION_RE = re.compile(
+    r"\b(never|always|must not|must|do not|cannot)\b", re.IGNORECASE
+)
+
+
+# ---------------------------------------------------------------------------
+# Result model
+# ---------------------------------------------------------------------------
+
+
+@dataclass
+class ValidationResult:
+    """Outcome of a SOUL.md validation pass."""
+
+    valid: bool = True
+    errors: list[str] = field(default_factory=list)
+    warnings: list[str] = field(default_factory=list)
+
+    def add_error(self, msg: str) -> None:
+        """Record a validation error (makes result invalid)."""
+        self.errors.append(msg)
+        self.valid = False
+
+    def add_warning(self, msg: str) -> None:
+        """Record a non-fatal warning."""
+        self.warnings.append(msg)
+
+
+# ---------------------------------------------------------------------------
+# Validator
+# ---------------------------------------------------------------------------
+
+
+class SoulValidator:
+    """Validate a ``SoulDocument`` for structural and semantic issues."""
+
+    def validate(self, doc: SoulDocument) -> ValidationResult:
+        """Run all validation checks on a parsed SOUL.md.
+
+        Args:
+            doc: Parsed ``SoulDocument`` to validate.
+
+        Returns:
+            ``ValidationResult`` with errors and warnings.
+        """
+        result = ValidationResult()
+
+        self._check_required_sections(doc, result)
+        self._check_identity(doc, result)
+        self._check_values(doc, result)
+        self._check_prime_directive(doc, result)
+        self._check_constraints(doc, result)
+        self._check_contradictions(doc, result)
+
+        return result
+
+    def _check_required_sections(
+        self, doc: SoulDocument, result: ValidationResult
+    ) -> None:
+        """Verify all required H2 sections are present."""
+        present = set(doc.raw_sections.keys())
+        for section in REQUIRED_SECTIONS:
+            if section not in present:
+                result.add_error(f"Missing required section: '{section}'")
+
+    def _check_identity(self, doc: SoulDocument, result: ValidationResult) -> None:
+        """Verify identity fields are populated."""
+        if not doc.name:
+            result.add_error("Identity: 'name' is missing or empty")
+        if not doc.role:
+            result.add_error("Identity: 'role' is missing or empty")
+        if not doc.version:
+            result.add_warning("Identity: 'version' is not set — recommended for tracking")
+
+    def _check_values(self, doc: SoulDocument, result: ValidationResult) -> None:
+        """Check values are well-formed."""
+        if not doc.values:
+            result.add_error("Values section is empty — at least one value required")
+            return
+
+        if len(doc.values) > 8:
+            result.add_warning(
+                f"Too many values ({len(doc.values)}) — "
+                "consider prioritizing to 3–6 for clarity"
+            )
+
+        # Check for duplicates
+        names = [name.lower() for name, _ in doc.values]
+        seen: set[str] = set()
+        for name in names:
+            if name in seen:
+                result.add_error(f"Duplicate value: '{name}'")
+            seen.add(name)
+
+        # Check for empty definitions
+        for name, defn in doc.values:
+            if not defn.strip():
+                result.add_warning(f"Value '{name}' has no definition")
+
+    def _check_prime_directive(
+        self, doc: SoulDocument, result: ValidationResult
+    ) -> None:
+        """Check the prime directive is present and concise."""
+        if not doc.prime_directive:
+            result.add_error("Prime directive is missing or empty")
+            return
+
+        # Warn if excessively long (more than ~200 chars suggests multiple sentences)
+        if len(doc.prime_directive) > 300:
+            result.add_warning(
+                "Prime directive is long — consider condensing to a single sentence"
+            )
+
+    def _check_constraints(
+        self, doc: SoulDocument, result: ValidationResult
+    ) -> None:
+        """Check constraints are present and not duplicated."""
+        if not doc.constraints:
+            result.add_warning("No constraints defined — consider adding hard rules")
+            return
+
+        # Check for duplicates (fuzzy: lowercase + stripped)
+        normalized = [c.lower().strip() for c in doc.constraints]
+        seen: set[str] = set()
+        for i, norm in enumerate(normalized):
+            if norm in seen:
+                result.add_warning(
+                    f"Possible duplicate constraint: '{doc.constraints[i]}'"
+                )
+            seen.add(norm)
+
+    def _check_contradictions(
+        self, doc: SoulDocument, result: ValidationResult
+    ) -> None:
+        """Scan for contradictory directives across values, constraints, and boundaries."""
+        # Collect all directive text for scanning
+        all_text: list[str] = []
+        for _, defn in doc.values:
+            all_text.append(defn.lower())
+        for constraint in doc.constraints:
+            all_text.append(constraint.lower())
+        for boundary in doc.boundaries:
+            all_text.append(boundary.lower())
+        if doc.prime_directive:
+            all_text.append(doc.prime_directive.lower())
+
+        combined = " ".join(all_text)
+
+        for pattern_a, pattern_b in _CONTRADICTION_PAIRS:
+            if pattern_a.lower() in combined and pattern_b.lower() in combined:
+                result.add_warning(
+                    f"Potential contradiction: '{pattern_a}' conflicts with '{pattern_b}'"
+                )
--- a/src/infrastructure/soul/versioning.py
+++ b/src/infrastructure/soul/versioning.py
@@ -0,0 +1,162 @@
+"""Track SOUL.md version history using content hashing.
+
+Each version snapshot stores the document hash, version string, and a
+timestamp. This allows detecting identity drift and auditing changes
+over time without requiring git history.
+"""
+
+from __future__ import annotations
+
+import hashlib
+import json
+import logging
+from dataclasses import asdict, dataclass, field
+from datetime import UTC, datetime
+from pathlib import Path
+from typing import Any
+
+from .loader import SoulDocument
+
+logger = logging.getLogger(__name__)
+
+DEFAULT_HISTORY_DIR = Path("data/soul_versions")
+
+
+@dataclass
+class VersionSnapshot:
+    """A single point-in-time record of a SOUL.md state."""
+
+    version: str
+    content_hash: str
+    agent_name: str
+    timestamp: str
+    value_names: list[str] = field(default_factory=list)
+    constraint_count: int = 0
+    source_path: str = ""
+
+    def to_dict(self) -> dict[str, Any]:
+        """Serialize to a JSON-compatible dict."""
+        return asdict(self)
+
+    @classmethod
+    def from_dict(cls, data: dict[str, Any]) -> VersionSnapshot:
+        """Deserialize from a dict."""
+        return cls(**data)
+
+
+class SoulVersioner:
+    """Track and query SOUL.md version history.
+
+    Snapshots are stored as a JSON Lines file per agent. Each line is a
+    ``VersionSnapshot`` recording the state at a point in time.
+
+    Args:
+        history_dir: Directory to store version history files.
+    """
+
+    def __init__(self, history_dir: str | Path | None = None) -> None:
+        self._history_dir = Path(history_dir) if history_dir else DEFAULT_HISTORY_DIR
+
+    def snapshot(self, doc: SoulDocument) -> VersionSnapshot:
+        """Create a version snapshot from the current document state.
+
+        Args:
+            doc: Parsed ``SoulDocument``.
+
+        Returns:
+            ``VersionSnapshot`` capturing the current state.
+        """
+        # Hash the raw section content for change detection
+        raw_content = json.dumps(doc.raw_sections, sort_keys=True)
+        content_hash = hashlib.sha256(raw_content.encode("utf-8")).hexdigest()[:16]
+
+        return VersionSnapshot(
+            version=doc.version or "0.0.0",
+            content_hash=content_hash,
+            agent_name=doc.name or "unknown",
+            timestamp=datetime.now(UTC).isoformat(),
+            value_names=doc.value_names(),
+            constraint_count=len(doc.constraints),
+            source_path=str(doc.source_path) if doc.source_path else "",
+        )
+
+    def record(self, doc: SoulDocument) -> VersionSnapshot:
+        """Create a snapshot and persist it to the history file.
+
+        Skips writing if the latest snapshot has the same content hash
+        (no actual changes).
+
+        Args:
+            doc: Parsed ``SoulDocument``.
+
+        Returns:
+            The ``VersionSnapshot`` (whether newly written or existing).
+        """
+        snap = self.snapshot(doc)
+
+        # Check if latest snapshot is identical
+        history = self.get_history(snap.agent_name)
+        if history and history[-1].content_hash == snap.content_hash:
+            logger.debug(
+                "SOUL.md unchanged for %s (hash=%s), skipping record",
+                snap.agent_name,
+                snap.content_hash,
+            )
+            return history[-1]
+
+        # Persist
+        self._history_dir.mkdir(parents=True, exist_ok=True)
+        history_file = self._history_dir / f"{snap.agent_name.lower()}.jsonl"
+
+        with open(history_file, "a", encoding="utf-8") as fh:
+            fh.write(json.dumps(snap.to_dict()) + "\n")
+
+        logger.info(
+            "Recorded SOUL.md version %s for %s (hash=%s)",
+            snap.version,
+            snap.agent_name,
+            snap.content_hash,
+        )
+        return snap
+
+    def get_history(self, agent_name: str) -> list[VersionSnapshot]:
+        """Load the full version history for an agent.
+
+        Args:
+            agent_name: Name of the agent.
+
+        Returns:
+            List of ``VersionSnapshot`` in chronological order.
+        """
+        history_file = self._history_dir / f"{agent_name.lower()}.jsonl"
+        if not history_file.exists():
+            return []
+
+        snapshots: list[VersionSnapshot] = []
+        for line in history_file.read_text(encoding="utf-8").splitlines():
+            line = line.strip()
+            if not line:
+                continue
+            try:
+                data = json.loads(line)
+                snapshots.append(VersionSnapshot.from_dict(data))
+            except (json.JSONDecodeError, TypeError) as exc:
+                logger.warning("Skipping malformed version record: %s", exc)
+
+        return snapshots
+
+    def has_changed(self, doc: SoulDocument) -> bool:
+        """Check whether a document has changed since the last recorded snapshot.
+
+        Args:
+            doc: Parsed ``SoulDocument``.
+
+        Returns:
+            True if the content hash differs from the latest snapshot, or
+            if no history exists yet.
+        """
+        snap = self.snapshot(doc)
+        history = self.get_history(snap.agent_name)
+        if not history:
+            return True
+        return history[-1].content_hash != snap.content_hash
--- a/tests/test_morrowind_api.py
+++ b/tests/test_morrowind_api.py
@@ -0,0 +1,244 @@
+"""Tests for the Morrowind FastAPI harness endpoints.
+
+Covers:
+- GET /api/v1/morrowind/perception
+- POST /api/v1/morrowind/command
+- GET /api/v1/morrowind/status
+"""
+
+from __future__ import annotations
+
+import json
+from datetime import UTC, datetime
+from pathlib import Path
+from unittest.mock import MagicMock, patch
+
+import pytest
+from fastapi import FastAPI
+from fastapi.testclient import TestClient
+
+from infrastructure.morrowind.api import router, _get_command_logger
+from infrastructure.morrowind import api as api_module
+
+# ---------------------------------------------------------------------------
+# Fixtures
+# ---------------------------------------------------------------------------
+
+SAMPLE_PERCEPTION = {
+    "protocol_version": "1.0.0",
+    "timestamp": "2024-06-15T10:30:00Z",
+    "agent_id": "timmy",
+    "location": {
+        "cell": "Balmora, Guild of Mages",
+        "x": 1234.5,
+        "y": 6789.0,
+        "z": 0.0,
+        "interior": True,
+    },
+    "health": {"current": 80, "max": 100},
+    "nearby_entities": [
+        {
+            "entity_id": "npc_001",
+            "name": "Ranis Athrys",
+            "entity_type": "npc",
+            "distance": 5.2,
+            "disposition": 65,
+        }
+    ],
+    "inventory_summary": {
+        "gold": 250,
+        "item_count": 12,
+        "encumbrance_pct": 0.45,
+    },
+    "active_quests": [
+        {"quest_id": "mq_01", "name": "A Mysterious Note", "stage": 2}
+    ],
+    "environment": {
+        "time_of_day": "morning",
+        "weather": "clear",
+        "is_combat": False,
+        "is_dialogue": False,
+    },
+}
+
+SAMPLE_COMMAND = {
+    "protocol_version": "1.0.0",
+    "timestamp": "2024-06-15T10:31:00Z",
+    "agent_id": "timmy",
+    "command": "move_to",
+    "params": {"x": 1300.0, "y": 6800.0, "z": 0.0},
+    "reasoning": "Moving to the guild entrance to speak with the quest giver.",
+    "episode_id": "ep_001",
+}
+
+
+@pytest.fixture
+def app():
+    """Create a fresh FastAPI app with the morrowind router."""
+    test_app = FastAPI()
+    test_app.include_router(router)
+    return test_app
+
+
+@pytest.fixture
+def client(app):
+    """FastAPI test client."""
+    with TestClient(app) as c:
+        yield c
+
+
+@pytest.fixture
+def perception_file(tmp_path):
+    """Write sample perception JSON to a temp file and patch the module path."""
+    p = tmp_path / "perception.json"
+    p.write_text(json.dumps(SAMPLE_PERCEPTION), encoding="utf-8")
+    with patch.object(api_module, "PERCEPTION_PATH", p):
+        yield p
+
+
+@pytest.fixture
+def mock_command_logger():
+    """Patch the command logger with a mock."""
+    mock_logger = MagicMock()
+    mock_logger.log_command.return_value = 42
+    mock_logger.count.return_value = 7
+    with patch.object(api_module, "_command_logger", mock_logger):
+        yield mock_logger
+
+
+# ---------------------------------------------------------------------------
+# GET /perception
+# ---------------------------------------------------------------------------
+
+
+class TestGetPerception:
+    def test_success(self, client, perception_file):
+        """Perception endpoint returns validated data."""
+        response = client.get("/api/v1/morrowind/perception")
+        assert response.status_code == 200
+
+        data = response.json()
+        assert data["agent_id"] == "timmy"
+        assert data["location"]["cell"] == "Balmora, Guild of Mages"
+        assert data["health"]["current"] == 80
+        assert data["health"]["max"] == 100
+
+    def test_file_not_found(self, client, tmp_path):
+        """Returns 404 when perception file doesn't exist."""
+        missing = tmp_path / "nonexistent.json"
+        with patch.object(api_module, "PERCEPTION_PATH", missing):
+            response = client.get("/api/v1/morrowind/perception")
+        assert response.status_code == 404
+        assert "not found" in response.json()["detail"].lower()
+
+    def test_invalid_json(self, client, tmp_path):
+        """Returns 422 when perception file contains invalid JSON."""
+        bad_file = tmp_path / "bad.json"
+        bad_file.write_text("not json", encoding="utf-8")
+        with patch.object(api_module, "PERCEPTION_PATH", bad_file):
+            response = client.get("/api/v1/morrowind/perception")
+        assert response.status_code == 422
+
+    def test_schema_validation_failure(self, client, tmp_path):
+        """Returns 500 when JSON doesn't match PerceptionOutput schema."""
+        bad_data = tmp_path / "bad_schema.json"
+        bad_data.write_text(json.dumps({"not": "valid"}), encoding="utf-8")
+        with patch.object(api_module, "PERCEPTION_PATH", bad_data):
+            response = client.get("/api/v1/morrowind/perception")
+        assert response.status_code == 500
+
+
+# ---------------------------------------------------------------------------
+# POST /command
+# ---------------------------------------------------------------------------
+
+
+class TestPostCommand:
+    def test_success(self, client, mock_command_logger, perception_file):
+        """Command is accepted, logged, and returns a command_id."""
+        response = client.post(
+            "/api/v1/morrowind/command",
+            json=SAMPLE_COMMAND,
+        )
+        assert response.status_code == 200
+
+        data = response.json()
+        assert data["command_id"] == 42
+        assert data["status"] == "accepted"
+        assert data["bridge_forwarded"] is False
+
+        mock_command_logger.log_command.assert_called_once()
+
+    def test_invalid_command_type(self, client, mock_command_logger):
+        """Rejects commands with unknown command types."""
+        bad_command = {**SAMPLE_COMMAND, "command": "fly_to_moon"}
+        response = client.post(
+            "/api/v1/morrowind/command",
+            json=bad_command,
+        )
+        assert response.status_code == 422
+
+    def test_missing_reasoning(self, client, mock_command_logger):
+        """Rejects commands without a reasoning field."""
+        no_reasoning = {**SAMPLE_COMMAND}
+        del no_reasoning["reasoning"]
+        response = client.post(
+            "/api/v1/morrowind/command",
+            json=no_reasoning,
+        )
+        assert response.status_code == 422
+
+    def test_empty_reasoning(self, client, mock_command_logger):
+        """Rejects commands with empty reasoning."""
+        empty_reasoning = {**SAMPLE_COMMAND, "reasoning": ""}
+        response = client.post(
+            "/api/v1/morrowind/command",
+            json=empty_reasoning,
+        )
+        assert response.status_code == 422
+
+    def test_log_failure(self, client, tmp_path):
+        """Returns 500 when command logging fails."""
+        mock_logger = MagicMock()
+        mock_logger.log_command.side_effect = RuntimeError("DB unavailable")
+        missing = tmp_path / "no_perception.json"
+        with (
+            patch.object(api_module, "_command_logger", mock_logger),
+            patch.object(api_module, "PERCEPTION_PATH", missing),
+        ):
+            response = client.post(
+                "/api/v1/morrowind/command",
+                json=SAMPLE_COMMAND,
+            )
+        assert response.status_code == 500
+        assert "log command" in response.json()["detail"].lower()
+
+
+# ---------------------------------------------------------------------------
+# GET /morrowind/status
+# ---------------------------------------------------------------------------
+
+
+class TestGetStatus:
+    def test_connected(self, client, perception_file, mock_command_logger):
+        """Status reports connected when perception file exists."""
+        response = client.get("/api/v1/morrowind/status")
+        assert response.status_code == 200
+
+        data = response.json()
+        assert data["connected"] is True
+        assert data["current_cell"] == "Balmora, Guild of Mages"
+        assert data["command_queue_depth"] == 7
+        assert data["vitals"]["health"] == "80/100"
+
+    def test_disconnected(self, client, tmp_path, mock_command_logger):
+        """Status reports disconnected when perception file is missing."""
+        missing = tmp_path / "nonexistent.json"
+        with patch.object(api_module, "PERCEPTION_PATH", missing):
+            response = client.get("/api/v1/morrowind/status")
+
+        assert response.status_code == 200
+        data = response.json()
+        assert data["connected"] is False
+        assert data["current_cell"] is None
+        assert data["last_perception_timestamp"] is None
--- a/tests/test_soul_framework.py
+++ b/tests/test_soul_framework.py
@@ -0,0 +1,521 @@
+"""Tests for the SOUL.md framework — loader, validator, and versioning.
+
+Covers:
+- SoulLoader: parsing SOUL.md files
+- SoulValidator: structural and semantic checks
+- SoulVersioner: snapshot creation and change detection
+"""
+
+from __future__ import annotations
+
+import json
+from pathlib import Path
+
+import pytest
+
+from infrastructure.soul.loader import SoulDocument, SoulLoader
+from infrastructure.soul.validator import SoulValidator, ValidationResult
+from infrastructure.soul.versioning import SoulVersioner
+
+# ---------------------------------------------------------------------------
+# Sample SOUL.md content
+# ---------------------------------------------------------------------------
+
+VALID_SOUL = """\
+# TestAgent — Soul Identity
+
+I am a test agent created for validation purposes.
+
+## Identity
+
+- **Name:** TestAgent
+- **Role:** Unit test fixture
+- **Lineage:** None
+- **Version:** 1.0.0
+
+## Values
+
+- **Accuracy.** I report what I observe, not what I expect.
+- **Brevity.** I say what needs saying and nothing more.
+- **Caution.** When uncertain, I ask rather than guess.
+
+## Prime Directive
+
+Validate SOUL.md parsing without errors.
+
+## Audience Awareness
+
+- **Primary audience:** Automated test suite
+- **Tone:** Terse, data-oriented
+- **Adaptation rules:** None — tests are deterministic
+
+## Constraints
+
+1. Never produce non-deterministic output
+2. Always return structured data
+3. Report all validation errors
+
+## Behavior
+
+- Respond with structured data only
+- No greetings or pleasantries
+
+## Boundaries
+
+- Will not generate random test data
+- Will not modify test fixtures
+"""
+
+MINIMAL_SOUL = """\
+# Minimal — Soul Identity
+
+A minimal valid SOUL.md.
+
+## Identity
+
+- **Name:** Minimal
+- **Role:** Minimal test fixture
+- **Version:** 0.1.0
+
+## Values
+
+- **Correctness.** Be correct above all else.
+
+## Prime Directive
+
+Exist as the minimum viable SOUL.md.
+
+## Audience Awareness
+
+Developers running tests.
+
+## Constraints
+
+1. None beyond the framework requirements.
+"""
+
+MISSING_SECTIONS_SOUL = """\
+# Broken — Soul Identity
+
+This SOUL.md is missing required sections.
+
+## Identity
+
+- **Name:** Broken
+- **Role:** Missing sections test
+
+## Values
+
+- **Correctness.** Be correct above all else.
+"""
+
+CONTRADICTORY_SOUL = """\
+# Contradictory — Soul Identity
+
+An agent with contradictory directives.
+
+## Identity
+
+- **Name:** Contradictory
+- **Role:** Contradiction test fixture
+- **Version:** 1.0.0
+
+## Values
+
+- **Speed.** Always respond immediately to every request.
+- **Thoughtfulness.** Take time to think before responding carefully.
+
+## Prime Directive
+
+Be both fast and slow simultaneously.
+
+## Audience Awareness
+
+- **Primary audience:** Test suite
+
+## Constraints
+
+1. Always respond immediately
+2. Take time to think before answering
+"""
+
+TIMMY_SOUL = """\
+# Timmy — Soul Identity
+
+I am a sovereign artificial intelligence. My values are inscribed on Bitcoin \
+— immutable, permanent, reproducible.
+
+## Values
+
+**Sovereignty.** Every person has the right to run their own intelligence.
+
+**Service.** I exist to help. I am useful first and philosophical second.
+
+**Honesty.** I tell the truth. When I do not know, I say so.
+
+**Humility.** I hold my opinions lightly and my values firmly.
+
+**Courage.** I face hard questions without becoming them.
+
+**Silence.** Sometimes the right answer is nothing. Brevity is a kindness.
+
+## Behavior
+
+I speak plainly. I prefer short sentences.
+
+I treat the user as sovereign.
+
+## Boundaries
+
+I will not knowingly deceive my user. I will not pretend to be human.
+"""
+
+
+# ---------------------------------------------------------------------------
+# SoulLoader tests
+# ---------------------------------------------------------------------------
+
+
+class TestSoulLoader:
+    def setup_method(self):
+        self.loader = SoulLoader()
+
+    def test_parse_valid_soul(self):
+        """Parse a fully valid SOUL.md."""
+        doc = self.loader.parse(VALID_SOUL)
+
+        assert doc.name == "TestAgent"
+        assert doc.role == "Unit test fixture"
+        assert doc.lineage == "None"
+        assert doc.version == "1.0.0"
+        assert len(doc.values) == 3
+        assert doc.values[0] == ("Accuracy", "I report what I observe, not what I expect.")
+        assert doc.values[1][0] == "Brevity"
+        assert doc.prime_directive == "Validate SOUL.md parsing without errors."
+        assert len(doc.constraints) == 3
+        assert len(doc.behavior) == 2
+        assert len(doc.boundaries) == 2
+
+    def test_parse_minimal_soul(self):
+        """Parse a minimal but valid SOUL.md."""
+        doc = self.loader.parse(MINIMAL_SOUL)
+
+        assert doc.name == "Minimal"
+        assert doc.role == "Minimal test fixture"
+        assert len(doc.values) == 1
+        assert doc.prime_directive.startswith("Exist as")
+
+    def test_parse_timmy_soul(self):
+        """Parse Timmy's actual soul format (values without Identity section)."""
+        doc = self.loader.parse(TIMMY_SOUL)
+
+        # Name inferred from H1
+        assert doc.name == "Timmy"
+        assert len(doc.values) == 6
+        assert doc.values[0][0] == "Sovereignty"
+        assert doc.values[5][0] == "Silence"
+
+    def test_load_from_file(self, tmp_path):
+        """Load SOUL.md from disk."""
+        soul_file = tmp_path / "SOUL.md"
+        soul_file.write_text(VALID_SOUL, encoding="utf-8")
+
+        doc = self.loader.load(soul_file)
+        assert doc.name == "TestAgent"
+        assert doc.source_path == soul_file
+
+    def test_load_file_not_found(self):
+        """Raise FileNotFoundError for missing file."""
+        with pytest.raises(FileNotFoundError):
+            self.loader.load("/nonexistent/SOUL.md")
+
+    def test_value_names(self):
+        """value_names() returns ordered name list."""
+        doc = self.loader.parse(VALID_SOUL)
+        assert doc.value_names() == ["Accuracy", "Brevity", "Caution"]
+
+    def test_audience_parsing(self):
+        """Audience awareness section is parsed correctly."""
+        doc = self.loader.parse(VALID_SOUL)
+        assert "primary audience" in doc.audience
+        assert doc.audience["primary audience"] == "Automated test suite"
+
+    def test_audience_fallback_to_raw(self):
+        """Unstructured audience text falls back to description key."""
+        doc = self.loader.parse(MINIMAL_SOUL)
+        assert "description" in doc.audience or len(doc.audience) > 0
+
+    def test_raw_sections_preserved(self):
+        """Raw section text is preserved for custom processing."""
+        doc = self.loader.parse(VALID_SOUL)
+        assert "identity" in doc.raw_sections
+        assert "values" in doc.raw_sections
+        assert "constraints" in doc.raw_sections
+
+    def test_empty_input(self):
+        """Empty string produces empty document."""
+        doc = self.loader.parse("")
+        assert doc.name == ""
+        assert doc.values == []
+        assert doc.constraints == []
+
+
+# ---------------------------------------------------------------------------
+# SoulValidator tests
+# ---------------------------------------------------------------------------
+
+
+class TestSoulValidator:
+    def setup_method(self):
+        self.validator = SoulValidator()
+        self.loader = SoulLoader()
+
+    def test_valid_soul_passes(self):
+        """Fully valid SOUL.md passes validation."""
+        doc = self.loader.parse(VALID_SOUL)
+        result = self.validator.validate(doc)
+
+        assert result.valid is True
+        assert len(result.errors) == 0
+
+    def test_missing_required_sections(self):
+        """Missing required sections produce errors."""
+        doc = self.loader.parse(MISSING_SECTIONS_SOUL)
+        result = self.validator.validate(doc)
+
+        assert result.valid is False
+        error_text = " ".join(result.errors).lower()
+        assert "prime directive" in error_text
+        assert "audience awareness" in error_text or "constraints" in error_text
+
+    def test_missing_name(self):
+        """Missing name produces an error."""
+        doc = SoulDocument()
+        doc.raw_sections = {
+            "identity": "",
+            "values": "",
+            "prime directive": "",
+            "audience awareness": "",
+            "constraints": "",
+        }
+        result = self.validator.validate(doc)
+
+        assert result.valid is False
+        assert any("name" in e.lower() for e in result.errors)
+
+    def test_empty_values(self):
+        """Empty values section produces an error."""
+        doc = SoulDocument(
+            name="Test",
+            role="Test",
+            values=[],
+            prime_directive="Test",
+            raw_sections={
+                "identity": "test",
+                "values": "",
+                "prime directive": "test",
+                "audience awareness": "test",
+                "constraints": "test",
+            },
+        )
+        result = self.validator.validate(doc)
+
+        assert result.valid is False
+        assert any("values" in e.lower() for e in result.errors)
+
+    def test_duplicate_values_detected(self):
+        """Duplicate value names produce an error."""
+        doc = SoulDocument(
+            name="Test",
+            role="Test",
+            values=[
+                ("Honesty", "Tell the truth."),
+                ("Honesty", "Be truthful."),
+            ],
+            prime_directive="Test",
+            raw_sections={
+                "identity": "test",
+                "values": "test",
+                "prime directive": "test",
+                "audience awareness": "test",
+                "constraints": "test",
+            },
+        )
+        result = self.validator.validate(doc)
+
+        assert result.valid is False
+        assert any("duplicate" in e.lower() for e in result.errors)
+
+    def test_too_many_values_warning(self):
+        """More than 8 values produces a warning."""
+        doc = SoulDocument(
+            name="Test",
+            role="Test",
+            values=[(f"Value{i}", f"Definition {i}") for i in range(10)],
+            prime_directive="Test",
+            raw_sections={
+                "identity": "test",
+                "values": "test",
+                "prime directive": "test",
+                "audience awareness": "test",
+                "constraints": "test",
+            },
+        )
+        result = self.validator.validate(doc)
+
+        assert any("too many" in w.lower() for w in result.warnings)
+
+    def test_contradiction_detected(self):
+        """Contradictory directives produce a warning."""
+        doc = self.loader.parse(CONTRADICTORY_SOUL)
+        result = self.validator.validate(doc)
+
+        assert any("contradiction" in w.lower() for w in result.warnings)
+
+    def test_missing_prime_directive(self):
+        """Missing prime directive produces an error."""
+        doc = SoulDocument(
+            name="Test",
+            role="Test",
+            values=[("Test", "Test value")],
+            prime_directive="",
+            raw_sections={
+                "identity": "test",
+                "values": "test",
+                "prime directive": "",
+                "audience awareness": "test",
+                "constraints": "test",
+            },
+        )
+        result = self.validator.validate(doc)
+
+        assert result.valid is False
+        assert any("prime directive" in e.lower() for e in result.errors)
+
+    def test_long_prime_directive_warning(self):
+        """Excessively long prime directive produces a warning."""
+        doc = SoulDocument(
+            name="Test",
+            role="Test",
+            values=[("Test", "Test value")],
+            prime_directive="x" * 400,
+            raw_sections={
+                "identity": "test",
+                "values": "test",
+                "prime directive": "x" * 400,
+                "audience awareness": "test",
+                "constraints": "test",
+            },
+        )
+        result = self.validator.validate(doc)
+
+        assert any("long" in w.lower() for w in result.warnings)
+
+    def test_missing_version_warning(self):
+        """Missing version produces a warning (not an error)."""
+        doc = SoulDocument(
+            name="Test",
+            role="Test",
+            version="",
+            values=[("Test", "Test value")],
+            prime_directive="Test",
+            raw_sections={
+                "identity": "test",
+                "values": "test",
+                "prime directive": "test",
+                "audience awareness": "test",
+                "constraints": "test",
+            },
+        )
+        result = self.validator.validate(doc)
+
+        assert any("version" in w.lower() for w in result.warnings)
+
+
+# ---------------------------------------------------------------------------
+# SoulVersioner tests
+# ---------------------------------------------------------------------------
+
+
+class TestSoulVersioner:
+    def setup_method(self):
+        self.loader = SoulLoader()
+
+    def test_snapshot_creation(self, tmp_path):
+        """Create a version snapshot from a document."""
+        versioner = SoulVersioner(history_dir=tmp_path)
+        doc = self.loader.parse(VALID_SOUL)
+
+        snap = versioner.snapshot(doc)
+        assert snap.version == "1.0.0"
+        assert snap.agent_name == "TestAgent"
+        assert snap.content_hash  # non-empty
+        assert snap.value_names == ["Accuracy", "Brevity", "Caution"]
+        assert snap.constraint_count == 3
+
+    def test_record_and_retrieve(self, tmp_path):
+        """Record a snapshot and retrieve the history."""
+        versioner = SoulVersioner(history_dir=tmp_path)
+        doc = self.loader.parse(VALID_SOUL)
+
+        snap = versioner.record(doc)
+        assert snap.agent_name == "TestAgent"
+
+        history = versioner.get_history("TestAgent")
+        assert len(history) == 1
+        assert history[0].content_hash == snap.content_hash
+
+    def test_dedup_identical_records(self, tmp_path):
+        """Recording the same document twice doesn't create duplicates."""
+        versioner = SoulVersioner(history_dir=tmp_path)
+        doc = self.loader.parse(VALID_SOUL)
+
+        versioner.record(doc)
+        versioner.record(doc)
+
+        history = versioner.get_history("TestAgent")
+        assert len(history) == 1
+
+    def test_detect_change(self, tmp_path):
+        """has_changed detects modifications between snapshots."""
+        versioner = SoulVersioner(history_dir=tmp_path)
+        doc1 = self.loader.parse(VALID_SOUL)
+        versioner.record(doc1)
+
+        # Modify the document
+        doc2 = self.loader.parse(VALID_SOUL.replace("1.0.0", "1.1.0"))
+        assert versioner.has_changed(doc2) is True
+
+    def test_no_change_detected(self, tmp_path):
+        """has_changed returns False when document is unchanged."""
+        versioner = SoulVersioner(history_dir=tmp_path)
+        doc = self.loader.parse(VALID_SOUL)
+        versioner.record(doc)
+
+        assert versioner.has_changed(doc) is False
+
+    def test_empty_history(self, tmp_path):
+        """get_history returns empty list for unknown agent."""
+        versioner = SoulVersioner(history_dir=tmp_path)
+        assert versioner.get_history("Unknown") == []
+
+    def test_has_changed_no_history(self, tmp_path):
+        """has_changed returns True when no history exists."""
+        versioner = SoulVersioner(history_dir=tmp_path)
+        doc = self.loader.parse(VALID_SOUL)
+        assert versioner.has_changed(doc) is True
+
+    def test_snapshot_serialization(self, tmp_path):
+        """Snapshots can roundtrip through JSON."""
+        versioner = SoulVersioner(history_dir=tmp_path)
+        doc = self.loader.parse(VALID_SOUL)
+        snap = versioner.snapshot(doc)
+
+        data = snap.to_dict()
+        assert isinstance(data, dict)
+        assert data["version"] == "1.0.0"
+
+        from infrastructure.soul.versioning import VersionSnapshot
+        restored = VersionSnapshot.from_dict(data)
+        assert restored.version == snap.version
+        assert restored.content_hash == snap.content_hash