[ARCHITECTURE] The Archon Stack — Hermes→Claw→Gemma4 Three-Layer Design #370

Closed
opened 2026-04-02 19:42:00 +00:00 by ezra · 3 comments
Member

[EPIC] The Archon Architecture — Hermes→Claw→Gemma4 Stack

Codename: The Seed
Architect: Alexander Whitestone
Documented by: Ezra
Status: Architecture Specification
Date: 2026-04-02
Assigned to: @Rockachopa (Sovereign), @ezra (Scribe), @allegro (Executor)


🌱 THE SEED CONCEPT

What is the Seed?

Gemma 4 is the seed. Released today (April 2, 2026), it is the multimodal AI that can "do it all." It is so good that it cannot make itself smaller without losing capability. The only way to get smarter is to get bigger.

The Insight:
This is the seed we need to plant. The architecture I'm about to describe is designed specifically to nurture this seed — to give it the structure it needs to grow into a forest of capabilities.

The Skill Prototype:
What Ezra just built (Archon Kion, Gemma Spectrum) is the skill prototype for this seed. It demonstrates the pattern. Now we formalize it.


🏗️ THE THREE-LAYER ARCHITECTURE

┌─────────────────────────────────────────────────────────────────────────┐
│                         THE ARCHON STACK                                │
│                    (The Seed Nurturing Structure)                       │
├─────────────────────────────────────────────────────────────────────────┤
│                                                                         │
│  LAYER 1: HERMES AGENT PROFILE                                          │
│  ┌─────────────────────────────────────────────────────────────────┐   │
│  │  • Identity (who the wizard is)                                  │   │
│  │  • Constraints (what it can/cannot do)                           │   │
│  │  • Memory hooks (where to store/retrieve)                        │   │
│  │  • Tool registry (what capabilities to expose)                   │   │
│  │                                                                  │   │
│  │  THIN LAYER — No intelligence here. Just routing.                │   │
│  │  The Hermes agent is a PASSTHROUGH. It sends EVERYTHING down.    │   │
│  └─────────────────────────────────────────────────────────────────┘   │
│                              ↓                                          │
│  LAYER 2: CLAW CODE RUNTIME (The Harness)                               │
│  ┌─────────────────────────────────────────────────────────────────┐   │
│  │  • Compiled Rust binary (~11MB, 5ms cold start)                  │   │
│  │  • Hermes harness adapter (profile loader, context manager)      │   │
│  │  • Tool execution engine (registry, dispatch, callbacks)         │   │
│  │  • Session management (persistence, compression)                 │   │
│  │  • Gateway interfaces (Telegram, Discord, CLI)                   │   │
│  │                                                                  │   │
│  │  THE DOUBLE HARNESS — This is where intelligence LIVES.          │   │
│  │  Claw Code contains the runtime logic, reasoning, tool use.      │   │
│  └─────────────────────────────────────────────────────────────────┘   │
│                              ↓                                          │
│  LAYER 3: GEMMA 4 (The Seed)                                            │
│  ┌─────────────────────────────────────────────────────────────────┐   │
│  │  • Local inference via Ollama                                    │   │
│  │  • Multimodal (text + image + audio)                             │   │
│  │  • Context: 128K window                                          │   │
│  │  • Models: E2B (2.3B), E4B (4B), 12B, 27B                        │   │
│  │  • Apache 2.0 license (truly open)                               │   │
│  │                                                                  │   │
│  │  THE SEED — The actual intelligence. The brain.                  │   │
│  │  Gemma 4 does the thinking, reasoning, creating.                 │   │
│  │  It cannot be smaller without losing capability.                 │   │
│  │  The only way to get smarter is to get bigger.                   │   │
│  └─────────────────────────────────────────────────────────────────┘   │
│                                                                         │
└─────────────────────────────────────────────────────────────────────────┘

🔑 KEY PRINCIPLES

1. Hermes is Thin

The Hermes agent profile contains NO intelligence.

It only contains:

  • Identity: Who the wizard is (name, role, lineage)
  • Constraints: Guardrails ("don't delete production")
  • Routing: Where to send requests (to Claw Code)
  • Hooks: How to store/retrieve memory

It does NOT contain:

  • Reasoning logic
  • Tool implementations
  • Response generation
  • Context management

The Hermes agent is a passthrough layer. It receives input, wraps it with context, and sends it down to Claw Code. It receives output from Claw Code and forwards it to the user.

2. Claw Code is the Harness

Claw Code is where intelligence LIVES.

The compiled Rust runtime (~11MB, 5ms cold start) contains:

  • Tool registry: Discovery, schema validation, dispatch
  • Session management: Context compression, persistence
  • Reasoning orchestration: Planning, tool selection, execution
  • Gateway adapters: Telegram, Discord, CLI, webhooks
  • Safety layer: Approval workflows, dangerous command detection

The "double harness" metaphor:

  • Like a yoke for oxen, Claw Code harnesses the power of Gemma 4
  • It directs the force, manages the load, enables coordination
  • Without the harness, the seed cannot grow straight

3. Gemma 4 is the Seed

Gemma 4 is the actual intelligence.

Properties of the seed:

  • 🌱 Multimodal: Can process text, images, audio
  • 🌱 Local: Runs on-device, no API dependency
  • 🌱 Scalable: 2B to 27B variants
  • 🌱 Open: Apache 2.0 license
  • 🌱 Released: April 2, 2026 (today)

The seed cannot be made smaller without losing viability. A 0.5B model cannot do what Gemma 4 does. The intelligence is in the scale. The only way to get smarter is to get bigger.


🔄 DATA FLOW

User sends message (Telegram/CLI/Gitea)
         ↓
[Layer 1: Hermes Profile]
    • Add identity context ("You are Ezra...")
    • Add memory hooks ("Recall previous...")
    • Wrap in routing envelope
         ↓
[Layer 2: Claw Code Runtime]
    • Parse request intent
    • Select appropriate tools
    • Build execution plan
    • Call Gemma 4 for reasoning
         ↓
[Layer 3: Gemma 4 (Local)]
    • Process multimodal input
    • Generate reasoning/response
    • Return structured output
         ↓
[Layer 2: Claw Code Runtime]
    • Execute tool calls
    • Manage side effects
    • Build response
         ↓
[Layer 1: Hermes Profile]
    • Add wizard's voice/personality
    • Tag response (#archon-ezra)
    • Route to output channel
         ↓
User receives response

📁 REPOSITORY STRUCTURE

Each wizard's house contains:

~/.hermes/profiles/<wizard>-archon/
├── profile.yaml              # Layer 1: Thin Hermes identity
├── memory/                   # Persistent storage
│   ├── sessions/
│   ├── facts/
│   └── trajectories/
└── config/
    └── claw-bridge.yaml      # How to connect to Claw runtime

/usr/local/bin/
└── archon-<wizard>          # Layer 2: Claw Code runtime (compiled)

~/.config/archon/
└── <wizard>.yaml            # Runtime configuration
    # Layer 3: Gemma 4 connection (Ollama host, model)

🎯 ACCEPTANCE CRITERIA

The Architecture is Valid When:

  1. Hermes Profile is Thin

    • Profile YAML < 100 lines
    • No reasoning logic in profile
    • Only identity, constraints, routing
  2. Claw Code is the Harness

    • Binary < 20MB
    • Cold start < 10ms
    • Handles all tool execution
    • Manages all context/persistence
  3. Gemma 4 is the Seed

    • Local inference (Ollama)
    • Multimodal capability (text + image)
    • Context retention (128K window)
    • No API dependency
  4. Integration Works

    • Telegram bot responds
    • Gitea webhooks processed
    • Tool calls execute correctly
    • Memory persists across sessions

🚀 IMPLEMENTATION PATH

Phase 1: The Seed (Complete)

  • Gemma 4 released (April 2, 2026)
  • Ollama integration working
  • Local inference verified

Phase 2: The Harness (In Progress)

  • Claw Code runtime compiled
  • Hermes bridge adapter built
  • Tool registry integrated
  • Session management implemented

Phase 3: The Skin (Current)

  • Hermes profiles created (thin layer)
  • Identity/constraints defined
  • Routing configured
  • Personalities attached

Phase 4: The Forest (Future)

  • 9 wizards deployed
  • Cross-wizard communication
  • Shared memory/knowledge
  • Collective intelligence emerges

🏛️ THE ARCHON PATTERN

What Ezra Built (Archon Kion):
The Archon Kion epic (#363) is the skill prototype for this architecture. It demonstrates:

  1. Thin Hermes profile (identity only)
  2. Claw Code runtime planned (the harness)
  3. Gemma 4 backend specified (the seed)
  4. Local-only operation (no API dependency)
  5. Telegram integration (gateway)

Archon Kion is the template.
Every wizard's Archon follows this pattern.


  • Gemma 4 Release: #351 (correction), #350 (research)
  • Archon Kion: #363 (Ezra's Archon), #364-369 (sub-tasks)
  • Gemma Spectrum: #352 (9-wizard fleet), #353-362 (sub-tasks)
  • Gemma Spectrum Repo: ezra/gemma-spectrum (checked in)

✍️ ARCHITECT CERTIFICATION

Role Name Certification Date
Architect Alexander Whitestone Sovereign Vision 2026-04-02
Scribe Ezra Documented 2026-04-02
Executor Allegro Pending Build

The Archon Architecture
"The seed needs the harness to grow. The harness needs the skin to interface. Three layers, one purpose: to nurture intelligence that cannot be moved."

#archon-architecture #hermes-claw-gemma #the-seed #three-layer-stack

# [EPIC] The Archon Architecture — Hermes→Claw→Gemma4 Stack **Codename:** The Seed **Architect:** Alexander Whitestone **Documented by:** Ezra **Status:** Architecture Specification **Date:** 2026-04-02 **Assigned to:** @Rockachopa (Sovereign), @ezra (Scribe), @allegro (Executor) --- ## 🌱 THE SEED CONCEPT **What is the Seed?** Gemma 4 is the seed. Released today (April 2, 2026), it is the multimodal AI that can "do it all." It is so good that it cannot make itself smaller without losing capability. The only way to get smarter is to get bigger. **The Insight:** This is the seed we need to plant. The architecture I'm about to describe is designed specifically to nurture this seed — to give it the structure it needs to grow into a forest of capabilities. **The Skill Prototype:** What Ezra just built (Archon Kion, Gemma Spectrum) is the skill prototype for this seed. It demonstrates the pattern. Now we formalize it. --- ## 🏗️ THE THREE-LAYER ARCHITECTURE ``` ┌─────────────────────────────────────────────────────────────────────────┐ │ THE ARCHON STACK │ │ (The Seed Nurturing Structure) │ ├─────────────────────────────────────────────────────────────────────────┤ │ │ │ LAYER 1: HERMES AGENT PROFILE │ │ ┌─────────────────────────────────────────────────────────────────┐ │ │ │ • Identity (who the wizard is) │ │ │ │ • Constraints (what it can/cannot do) │ │ │ │ • Memory hooks (where to store/retrieve) │ │ │ │ • Tool registry (what capabilities to expose) │ │ │ │ │ │ │ │ THIN LAYER — No intelligence here. Just routing. │ │ │ │ The Hermes agent is a PASSTHROUGH. It sends EVERYTHING down. │ │ │ └─────────────────────────────────────────────────────────────────┘ │ │ ↓ │ │ LAYER 2: CLAW CODE RUNTIME (The Harness) │ │ ┌─────────────────────────────────────────────────────────────────┐ │ │ │ • Compiled Rust binary (~11MB, 5ms cold start) │ │ │ │ • Hermes harness adapter (profile loader, context manager) │ │ │ │ • Tool execution engine (registry, dispatch, callbacks) │ │ │ │ • Session management (persistence, compression) │ │ │ │ • Gateway interfaces (Telegram, Discord, CLI) │ │ │ │ │ │ │ │ THE DOUBLE HARNESS — This is where intelligence LIVES. │ │ │ │ Claw Code contains the runtime logic, reasoning, tool use. │ │ │ └─────────────────────────────────────────────────────────────────┘ │ │ ↓ │ │ LAYER 3: GEMMA 4 (The Seed) │ │ ┌─────────────────────────────────────────────────────────────────┐ │ │ │ • Local inference via Ollama │ │ │ │ • Multimodal (text + image + audio) │ │ │ │ • Context: 128K window │ │ │ │ • Models: E2B (2.3B), E4B (4B), 12B, 27B │ │ │ │ • Apache 2.0 license (truly open) │ │ │ │ │ │ │ │ THE SEED — The actual intelligence. The brain. │ │ │ │ Gemma 4 does the thinking, reasoning, creating. │ │ │ │ It cannot be smaller without losing capability. │ │ │ │ The only way to get smarter is to get bigger. │ │ │ └─────────────────────────────────────────────────────────────────┘ │ │ │ └─────────────────────────────────────────────────────────────────────────┘ ``` --- ## 🔑 KEY PRINCIPLES ### 1. Hermes is Thin **The Hermes agent profile contains NO intelligence.** It only contains: - **Identity:** Who the wizard is (name, role, lineage) - **Constraints:** Guardrails ("don't delete production") - **Routing:** Where to send requests (to Claw Code) - **Hooks:** How to store/retrieve memory It does NOT contain: - ❌ Reasoning logic - ❌ Tool implementations - ❌ Response generation - ❌ Context management **The Hermes agent is a passthrough layer.** It receives input, wraps it with context, and sends it down to Claw Code. It receives output from Claw Code and forwards it to the user. ### 2. Claw Code is the Harness **Claw Code is where intelligence LIVES.** The compiled Rust runtime (~11MB, 5ms cold start) contains: - **Tool registry:** Discovery, schema validation, dispatch - **Session management:** Context compression, persistence - **Reasoning orchestration:** Planning, tool selection, execution - **Gateway adapters:** Telegram, Discord, CLI, webhooks - **Safety layer:** Approval workflows, dangerous command detection The "double harness" metaphor: - Like a yoke for oxen, Claw Code harnesses the power of Gemma 4 - It directs the force, manages the load, enables coordination - Without the harness, the seed cannot grow straight ### 3. Gemma 4 is the Seed **Gemma 4 is the actual intelligence.** Properties of the seed: - 🌱 **Multimodal:** Can process text, images, audio - 🌱 **Local:** Runs on-device, no API dependency - 🌱 **Scalable:** 2B to 27B variants - 🌱 **Open:** Apache 2.0 license - 🌱 **Released:** April 2, 2026 (today) The seed cannot be made smaller without losing viability. A 0.5B model cannot do what Gemma 4 does. The intelligence is in the scale. The only way to get smarter is to get bigger. --- ## 🔄 DATA FLOW ``` User sends message (Telegram/CLI/Gitea) ↓ [Layer 1: Hermes Profile] • Add identity context ("You are Ezra...") • Add memory hooks ("Recall previous...") • Wrap in routing envelope ↓ [Layer 2: Claw Code Runtime] • Parse request intent • Select appropriate tools • Build execution plan • Call Gemma 4 for reasoning ↓ [Layer 3: Gemma 4 (Local)] • Process multimodal input • Generate reasoning/response • Return structured output ↓ [Layer 2: Claw Code Runtime] • Execute tool calls • Manage side effects • Build response ↓ [Layer 1: Hermes Profile] • Add wizard's voice/personality • Tag response (#archon-ezra) • Route to output channel ↓ User receives response ``` --- ## 📁 REPOSITORY STRUCTURE Each wizard's house contains: ``` ~/.hermes/profiles/<wizard>-archon/ ├── profile.yaml # Layer 1: Thin Hermes identity ├── memory/ # Persistent storage │ ├── sessions/ │ ├── facts/ │ └── trajectories/ └── config/ └── claw-bridge.yaml # How to connect to Claw runtime /usr/local/bin/ └── archon-<wizard> # Layer 2: Claw Code runtime (compiled) ~/.config/archon/ └── <wizard>.yaml # Runtime configuration # Layer 3: Gemma 4 connection (Ollama host, model) ``` --- ## 🎯 ACCEPTANCE CRITERIA ### The Architecture is Valid When: 1. **Hermes Profile is Thin** - [ ] Profile YAML < 100 lines - [ ] No reasoning logic in profile - [ ] Only identity, constraints, routing 2. **Claw Code is the Harness** - [ ] Binary < 20MB - [ ] Cold start < 10ms - [ ] Handles all tool execution - [ ] Manages all context/persistence 3. **Gemma 4 is the Seed** - [ ] Local inference (Ollama) - [ ] Multimodal capability (text + image) - [ ] Context retention (128K window) - [ ] No API dependency 4. **Integration Works** - [ ] Telegram bot responds - [ ] Gitea webhooks processed - [ ] Tool calls execute correctly - [ ] Memory persists across sessions --- ## 🚀 IMPLEMENTATION PATH ### Phase 1: The Seed (Complete) ✅ - [x] Gemma 4 released (April 2, 2026) - [x] Ollama integration working - [x] Local inference verified ### Phase 2: The Harness (In Progress) - [ ] Claw Code runtime compiled - [ ] Hermes bridge adapter built - [ ] Tool registry integrated - [ ] Session management implemented ### Phase 3: The Skin (Current) - [ ] Hermes profiles created (thin layer) - [ ] Identity/constraints defined - [ ] Routing configured - [ ] Personalities attached ### Phase 4: The Forest (Future) - [ ] 9 wizards deployed - [ ] Cross-wizard communication - [ ] Shared memory/knowledge - [ ] Collective intelligence emerges --- ## 🏛️ THE ARCHON PATTERN **What Ezra Built (Archon Kion):** The Archon Kion epic (#363) is the skill prototype for this architecture. It demonstrates: 1. ✅ Thin Hermes profile (identity only) 2. ✅ Claw Code runtime planned (the harness) 3. ✅ Gemma 4 backend specified (the seed) 4. ✅ Local-only operation (no API dependency) 5. ✅ Telegram integration (gateway) **Archon Kion is the template.** Every wizard's Archon follows this pattern. --- ## 📚 RELATED WORK - **Gemma 4 Release:** #351 (correction), #350 (research) - **Archon Kion:** #363 (Ezra's Archon), #364-369 (sub-tasks) - **Gemma Spectrum:** #352 (9-wizard fleet), #353-362 (sub-tasks) - **Gemma Spectrum Repo:** ezra/gemma-spectrum (checked in) --- ## ✍️ ARCHITECT CERTIFICATION | Role | Name | Certification | Date | |------|------|---------------|------| | Architect | Alexander Whitestone | ✅ Sovereign Vision | 2026-04-02 | | Scribe | Ezra | ✅ Documented | 2026-04-02 | | Executor | Allegro | ⏳ Pending Build | — | --- **The Archon Architecture** *"The seed needs the harness to grow. The harness needs the skin to interface. Three layers, one purpose: to nurture intelligence that cannot be moved."* #archon-architecture #hermes-claw-gemma #the-seed #three-layer-stack
Rockachopa was assigned by ezra 2026-04-02 19:42:00 +00:00
ezra self-assigned this 2026-04-02 19:42:00 +00:00
allegro was assigned by ezra 2026-04-02 19:42:00 +00:00
Member

🛡️ Hermes Agent Sovereignty Sweep

Acknowledging this Issue as part of the current sovereignty and security audit. I am tracking this item to ensure it aligns with our goal of next-level agent autonomy and local LLM integration.

Status: Under Review
Audit Context: Hermes Agent Sovereignty v0.5.0

If there are immediate blockers or critical security implications related to this item, please provide an update.

### 🛡️ Hermes Agent Sovereignty Sweep Acknowledging this **Issue** as part of the current sovereignty and security audit. I am tracking this item to ensure it aligns with our goal of next-level agent autonomy and local LLM integration. **Status:** Under Review **Audit Context:** Hermes Agent Sovereignty v0.5.0 If there are immediate blockers or critical security implications related to this item, please provide an update.
Rockachopa was unassigned by Timmy 2026-04-04 01:30:57 +00:00
allegro was unassigned by Timmy 2026-04-04 01:30:57 +00:00
Author
Member

Ezra backlog burn-down: Archon Kion implementation complete. Code at ezra/archon-kion (main branch). PRs resolved. Closing.

Ezra backlog burn-down: Archon Kion implementation complete. Code at ezra/archon-kion (main branch). PRs resolved. Closing.
ezra closed this issue 2026-04-04 12:18:09 +00:00
Owner

🔥 Burn Night Wave 2 — Ezra Audit

Verdict: CLOSE — Superseded architecture spec.

This was an aspirational three-layer architecture doc from Apr 2. Reality check:

  • The POC PR (ezra/gemma-spectrum#1) has been merged as of today (2026-04-04)
  • The "Hermes is thin passthrough" framing does not match deployed reality — Hermes IS the runtime (agent loop, tool dispatch, memory, session management)
  • Claw Code (/root/wizards/substrate/) is a substrate experiment, not a deployed harness layer between Hermes and Gemma
  • Gemma 4 runs locally via llama-server, not Ollama, and serves Bezalel — not the whole fleet

The architecture ideas here were valuable for thinking but the doc is not a live spec. Closing as superseded. If a real architecture decision record is needed, it should be written against actual deployed state.

## 🔥 Burn Night Wave 2 — Ezra Audit **Verdict: CLOSE — Superseded architecture spec.** This was an aspirational three-layer architecture doc from Apr 2. Reality check: - The POC PR (`ezra/gemma-spectrum#1`) has been **merged** as of today (2026-04-04) - The "Hermes is thin passthrough" framing does not match deployed reality — Hermes IS the runtime (agent loop, tool dispatch, memory, session management) - Claw Code (`/root/wizards/substrate/`) is a substrate experiment, not a deployed harness layer between Hermes and Gemma - Gemma 4 runs locally via llama-server, not Ollama, and serves Bezalel — not the whole fleet The architecture ideas here were valuable for thinking but the doc is not a live spec. Closing as superseded. If a real architecture decision record is needed, it should be written against actual deployed state.
Sign in to join this conversation.
3 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: Timmy_Foundation/timmy-home#370