docs(honcho): rewrite Honcho Memory docs as full feature documentation

Replaces the stub docs with comprehensive coverage: setup (interactive + manual), all config fields, memory modes, recall modes, write frequency, session strategies, host blocks, async prefetch pipeline, dual-peer architecture, dynamic reasoning, gateway integration, four tools, full CLI reference, migration paths, and AI peer identity. Trims the Honcho section in memory.md to a cross-reference.
2026-03-10 16:49:14 -04:00
parent 87349b9bc1
commit 960c1521f3
2 changed files with 225 additions and 104 deletions
--- a/website/docs/user-guide/features/honcho.md
+++ b/website/docs/user-guide/features/honcho.md
@@ -7,37 +7,47 @@ sidebar_position: 8

 # Honcho Memory

-[Honcho](https://honcho.dev) is an AI-native memory system that gives Hermes Agent persistent, cross-session understanding of users. While Hermes has built-in memory (`MEMORY.md` and `USER.md` files), Honcho adds a deeper layer of **user modeling** — learning user preferences, goals, communication style, and context across conversations.
+[Honcho](https://honcho.dev) is an AI-native memory system that gives Hermes persistent, cross-session understanding of users. While Hermes has built-in memory (`MEMORY.md` and `USER.md`), Honcho adds a deeper layer of **user modeling** — learning preferences, goals, communication style, and context across conversations via a dual-peer architecture where both the user and the AI build representations over time.

-## How It Complements Built-in Memory
+## Works Alongside Built in Memory

-Hermes has two memory systems that work together:
+Runs `hybrid` (`local` + `honcho`) by default. 

 | Feature | Built-in Memory | Honcho Memory |
 |---------|----------------|---------------|
 | Storage | Local files (`~/.hermes/memories/`) | Cloud-hosted Honcho API |
 | Scope | Agent-level notes and user profile | Deep user modeling via dialectic reasoning |
 | Persistence | Across sessions on same machine | Across sessions, machines, and platforms |
-| Query | Injected into system prompt automatically | On-demand via `query_user_context` tool |
+| Query | Injected into system prompt automatically | Prefetched + on-demand via tools |
 | Content | Manually curated by the agent | Automatically learned from conversations |
+| Write surface | `memory` tool (add/replace/remove) | `honcho_conclude` tool (persist facts) |

-Honcho doesn't replace built-in memory — it **supplements** it with richer user understanding.

 ## Setup

-### 1. Get a Honcho API Key
-
-Sign up at [app.honcho.dev](https://app.honcho.dev) and get your API key.
-
-### 2. Install the Client Library
+### Interactive Setup

 ```bash
-pip install honcho-ai
+hermes honcho setup
 ```

-### 3. Configure Honcho
+The setup wizard walks through API key, peer names, workspace, memory mode, write frequency, recall mode, and session strategy. It offers to install `honcho-ai` if missing.

-Honcho reads its configuration from `~/.honcho/config.json` (the global Honcho config shared across all Honcho-enabled applications):
+### Manual Setup
+
+#### 1. Install the Client Library
+
+```bash
+pip install 'honcho-ai>=2.0.1'
+```
+
+#### 2. Get an API Key
+
+Go to [app.honcho.dev](https://app.honcho.dev) > Settings > API Keys.
+
+#### 3. Configure
+
+Honcho reads from `~/.honcho/config.json` (shared across all Honcho-enabled applications):

 ```json
 {
@@ -45,25 +55,25 @@ Honcho reads its configuration from `~/.honcho/config.json` (the global Honcho c
  "workspace": "hermes",
  "peerName": "your-name",
  "aiPeer": "hermes",
-  "environment": "production",
-  "saveMessages": true,
+  "memoryMode": "hybrid",
+  "writeFrequency": "async",
+  "recallMode": "hybrid",
  "sessionStrategy": "per-directory",
  "enabled": true
 }
 ```

-Alternatively, set the API key as an environment variable:
+Or set the API key as an environment variable:

 ```bash
-# Add to ~/.hermes/.env
-HONCHO_API_KEY=your-honcho-api-key
+hermes config set HONCHO_API_KEY your-key
 ```

 :::info
-When an API key is present (either in `~/.honcho/config.json` or as `HONCHO_API_KEY`), Honcho auto-enables unless explicitly set to `"enabled": false` in the config.
+When an API key is present (either in `~/.honcho/config.json` or as `HONCHO_API_KEY`), Honcho auto-enables unless explicitly set to `"enabled": false`.
 :::

-## Configuration Details
+## Configuration

 ### Global Config (`~/.honcho/config.json`)

@@ -75,14 +85,71 @@ When an API key is present (either in `~/.honcho/config.json` or as `HONCHO_API_
 | `aiPeer` | `"hermes"` | AI assistant identity name |
 | `environment` | `"production"` | Honcho environment |
 | `saveMessages` | `true` | Whether to sync messages to Honcho |
+| `memoryMode` | `"hybrid"` | Memory mode: `hybrid`, `honcho`, or `local` |
+| `writeFrequency` | `"async"` | When to write: `async`, `turn`, `session`, or integer N |
+| `recallMode` | `"hybrid"` | Retrieval strategy: `hybrid`, `context`, or `tools` |
 | `sessionStrategy` | `"per-directory"` | How sessions are scoped |
 | `sessionPeerPrefix` | `false` | Prefix session names with peer name |
 | `contextTokens` | *(Honcho default)* | Max tokens for context prefetch |
+| `dialecticReasoningLevel` | `"low"` | Floor for dialectic reasoning: `minimal` / `low` / `medium` / `high` / `max` |
+| `dialecticMaxChars` | `600` | Char cap on dialectic results injected into system prompt |
 | `sessions` | `{}` | Manual session name overrides per directory |

+### Memory Modes
+
+| Mode | Effect |
+|------|--------|
+| `hybrid` | Write to both Honcho and local files (default) |
+| `honcho` | Honcho only — skip local file writes |
+| `local` | Local files only — skip all Honcho activity |
+
+Memory mode can be set globally or per-peer (user, agent1, agent2, etc):
+
+```json
+{
+  "memoryMode": {
+    "default": "hybrid",
+    "hermes": "honcho",
+    "user": "local"
+  }
+}
+```
+
+When both active peers resolve to `local`, Hermes skips all remote Honcho activity entirely — no client initialization, no session creation, no prefetch.
+
+### Recall Modes
+
+Controls how Honcho context reaches the agent:
+
+| Mode | Behavior |
+|------|----------|
+| `hybrid` | Prefetch context into system prompt + expose tools (default) |
+| `context` | Context injection only — no Honcho tools available |
+| `tools` | Tools only — no prefetch into system prompt |
+
+### Write Frequency
+
+| Setting | Behavior |
+|---------|----------|
+| `async` | Background thread writes (zero blocking, default) |
+| `turn` | Synchronous write after each turn |
+| `session` | Batched write at session end |
+| *integer N* | Write every N turns |
+
+### Session Strategies
+
+| Strategy | Session key | Use case |
+|----------|-------------|----------|
+| `per-directory` | CWD basename | Default. Each project gets its own session. |
+| `per-repo` | Git repo root name | Groups subdirectories under one session. |
+| `per-session` | Unique per run | Fresh session every time. |
+| `global` | Fixed `"global"` | Single cross-project session. |
+
+Resolution order: manual map > session title > strategy-derived key > platform key.
+
 ### Host-specific Configuration

-You can configure per-host settings for multi-application setups:
+For multi-application setups, use host blocks:

 ```json
 {
@@ -91,73 +158,158 @@ You can configure per-host settings for multi-application setups:
    "hermes": {
      "workspace": "my-workspace",
      "aiPeer": "hermes-assistant",
-      "linkedHosts": ["other-app"],
-      "contextTokens": 2000
+      "linkedHosts": ["claude-code"],
+      "contextTokens": 2000,
+      "dialecticReasoningLevel": "medium"
    }
  }
 }
 ```

-Host-specific fields override global fields. Resolution order:
-1. Explicit host block fields
-2. Global/flat fields from config root
-3. Defaults (host name used as workspace/peer)
+Host-specific fields override global fields. Resolution: host block > global fields > defaults.

 ### Hermes Config (`~/.hermes/config.yaml`)

-The `honcho` section in Hermes config is intentionally minimal — most configuration comes from the global `~/.honcho/config.json`:
+Intentionally minimal — most configuration comes from `~/.honcho/config.json`:

 ```yaml
 honcho: {}
 ```

-## The `query_user_context` Tool
+## How It Works

-When Honcho is active, Hermes gains access to the `query_user_context` tool. This lets the agent proactively ask Honcho about the user during conversations:
+### Async Prefetch Pipeline

-**Tool schema:**
- **Name:** `query_user_context`
- **Parameter:** `query` (string) — a natural language question about the user
- **Toolset:** `honcho`
-
-**Example queries the agent might make:**
+Honcho context is fetched asynchronously to avoid blocking the response path:

 ```
-"What are this user's main goals?"
-"What communication style does this user prefer?"
-"What topics has this user discussed recently?"
-"What is this user's technical expertise level?"
+Turn N:
+  user message
+    → pop prefetch result from cache (from previous turn)
+    → inject into system prompt (user representation, AI representation, dialectic)
+    → LLM call
+    → response
+    → fire prefetch in background threads
+         → prefetch_context()   ─┐
+         → prefetch_dialectic() ─┴→ cache for Turn N+1
 ```

-The tool calls Honcho's dialectic chat API to retrieve relevant user context based on accumulated conversation history.
+Turn 1 is a cold start (no cache). All subsequent turns consume pre-warmed results with zero HTTP latency on the response path. The system prompt on turn 1 uses only static context to preserve prefix cache hits at the LLM provider.

-:::note
-The `query_user_context` tool is only available when Honcho is active (API key configured and session context set). It registers in the `honcho` toolset and its availability is checked dynamically.
+### Dual-Peer Architecture
+
+Both the user and AI have peer representations in Honcho:
+
+- **User peer** — observed from user messages. Honcho learns preferences, goals, communication style.
+- **AI peer** — observed from assistant messages (`observe_me=True`). Honcho builds a representation of the agent's knowledge and behavior.
+
+Both representations are injected into the system prompt when available.
+
+### Dynamic Reasoning Level
+
+Dialectic queries scale reasoning effort with message complexity:
+
+| Message length | Reasoning level |
+|----------------|-----------------|
+| < 120 chars | Config default (typically `low`) |
+| 120-400 chars | One level above default (cap: `high`) |
+| > 400 chars | Two levels above default (cap: `high`) |
+
+`max` is never selected automatically.
+
+### Gateway Integration
+
+The gateway creates short-lived `AIAgent` instances per request. Honcho managers are owned at the gateway session layer (`_honcho_managers` dict) so they persist across requests within the same session and flush at real session boundaries (reset, resume, expiry, server stop).
+
+## Tools
+
+When Honcho is active, four tools become available. Availability is gated dynamically — they are invisible when Honcho is disabled.
+
+### `honcho_profile`
+
+Fast peer card retrieval (no LLM). Returns a curated list of key facts about the user.
+
+### `honcho_search`
+
+Semantic search over memory (no LLM). Returns raw excerpts ranked by relevance. Cheaper and faster than `honcho_context` — good for factual lookups.
+
+Parameters:
+- `query` (string) — search query
+- `max_tokens` (integer, optional) — result token budget
+
+### `honcho_context`
+
+Dialectic Q&A powered by Honcho's LLM. Synthesizes an answer from accumulated conversation history.
+
+Parameters:
+- `query` (string) — natural language question
+- `peer` (string, optional) — `"user"` (default) or `"ai"`. Querying `"ai"` asks about the assistant's own history and identity.
+
+### `honcho_conclude`
+
+Writes a fact to Honcho memory. Use when the user explicitly states a preference, correction, or project context worth remembering. Feeds into the user's peer card and representation.
+
+Parameters:
+- `conclusion` (string) — the fact to persist
+
+## CLI Commands
+
+```
+hermes honcho setup                        # Interactive setup wizard
+hermes honcho status                       # Show config and connection status
+hermes honcho sessions                     # List directory → session name mappings
+hermes honcho map <name>                   # Map current directory to a session name
+hermes honcho peer                         # Show peer names and dialectic settings
+hermes honcho peer --user NAME             # Set user peer name
+hermes honcho peer --ai NAME               # Set AI peer name
+hermes honcho peer --reasoning LEVEL       # Set dialectic reasoning level
+hermes honcho mode                         # Show current memory mode
+hermes honcho mode [hybrid|honcho|local]   # Set memory mode
+hermes honcho tokens                       # Show token budget settings
+hermes honcho tokens --context N           # Set context token cap
+hermes honcho tokens --dialectic N         # Set dialectic char cap
+hermes honcho identity                     # Show AI peer identity
+hermes honcho identity <file>              # Seed AI peer identity from file (SOUL.md, etc.)
+hermes honcho migrate                      # Migration guide: OpenClaw → Hermes + Honcho
+```
+
+### Doctor Integration
+
+`hermes doctor` includes a Honcho section that validates config, API key, and connection status.
+
+## Migration
+
+### From Local Memory
+
+When Honcho activates on an instance with existing local history, migration runs automatically:
+
+1. **Conversation history** — prior messages are uploaded as an XML transcript file
+2. **Memory files** — existing `MEMORY.md`, `USER.md`, and `SOUL.md` are uploaded for context
+
+### From OpenClaw
+
+```bash
+hermes honcho migrate
+```
+
+Walks through converting an OpenClaw native Honcho setup to the shared `~/.honcho/config.json` format.
+
+## AI Peer Identity
+
+Honcho can build a representation of the AI assistant over time (via `observe_me=True`). You can also seed the AI peer explicitly:
+
+```bash
+hermes honcho identity ~/.hermes/SOUL.md
+```
+
+This uploads the file content through Honcho's observation pipeline. The AI peer representation is then injected into the system prompt alongside the user's, giving the agent awareness of its own accumulated identity.
+
+```bash
+hermes honcho identity --show
+```
+
+Shows the current AI peer representation from Honcho.
+
+:::tip
+Honcho is fully opt-in — zero behavior change when disabled or unconfigured. All Honcho calls are non-fatal; if the service is unreachable, the agent continues normally.
 :::
-
-## Session Management
-
-Honcho sessions track conversation history for user modeling:
-
- **Session creation** — sessions are created or resumed automatically based on session keys (e.g., `telegram:123456` or CLI session IDs)
- **Message syncing** — new messages are synced to Honcho incrementally (only unsynced messages)
- **Peer configuration** — user messages are observed for learning; assistant messages are not
- **Context prefetch** — before responding, Hermes can prefetch user context (representation + peer card) in a single API call
- **Session rotation** — when sessions reset, old data is preserved in Honcho for continued user modeling
-
-## Migration from Local Memory
-
-When Honcho is activated on an instance that already has local conversation history:
-
-1. **Conversation history** — prior messages can be uploaded to Honcho as a transcript file
-2. **Memory files** — existing `MEMORY.md` and `USER.md` files can be uploaded for context
-
-This ensures Honcho has the full picture even when activated mid-conversation.
-
-## Use Cases
-
- **Personalized responses** — Honcho learns how each user prefers to communicate
- **Goal tracking** — remembers what users are working toward across sessions
- **Expertise adaptation** — adjusts technical depth based on user's background
- **Cross-platform memory** — same user understanding across CLI, Telegram, Discord, etc.
- **Multi-user support** — each user (via messaging platforms) gets their own user model
--- a/website/docs/user-guide/features/memory.md
+++ b/website/docs/user-guide/features/memory.md
@@ -209,41 +209,10 @@ memory:

 ## Honcho Integration (Cross-Session User Modeling)

-For deeper, AI-generated user understanding that works across tools, you can optionally enable [Honcho](https://honcho.dev/) by Plastic Labs. Honcho runs alongside existing memory — USER.md stays as-is, and Honcho adds an additional layer of context.
-
-When enabled:
- **Prefetch**: Each turn, Honcho's user representation is injected into the system prompt
- **Sync**: After each conversation, messages are synced to Honcho
- **Query tool**: The agent can actively query its understanding of you via `query_user_context`
-
-**Setup:**
+For deeper, AI-generated user understanding that works across sessions and platforms, you can enable [Honcho Memory](./honcho.md). Honcho runs alongside built-in memory in `hybrid` mode (the default) — `MEMORY.md` and `USER.md` stay as-is, and Honcho adds a persistent user modeling layer on top.

 ```bash
-# 1. Install the optional dependency
-uv pip install honcho-ai
-
-# 2. Get an API key from https://app.honcho.dev
-
-# 3. Create ~/.honcho/config.json
-cat > ~/.honcho/config.json << 'EOF'
-{
-  "enabled": true,
-  "apiKey": "your-honcho-api-key",
-  "peerName": "your-name",
-  "hosts": {
-    "hermes": {
-      "workspace": "hermes"
-    }
-  }
-}
-EOF
+hermes honcho setup
 ```

-Or via environment variable:
-```bash
-hermes config set HONCHO_API_KEY your-key
-```
-
-:::tip
-Honcho is fully opt-in — zero behavior change when disabled or unconfigured. All Honcho calls are non-fatal; if the service is unreachable, the agent continues normally.
-:::
+See the [Honcho Memory](./honcho.md) docs for full configuration, tools, and CLI reference.