[STUDY] Sovereign Local Agents on macOS — Hermes v0.4.0 Architecture Spike #576
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Research Spike: Sovereign Local Agents on macOS — State of the Art
Source: "Sovereign Local Agents on macOS: A State-of-the-Art Technical Architecture" (20pp, generated by Kimi.ai, March 2026)
This report covers the full Hermes Agent v0.4.0 architecture and how it maps to our sovereign stack. Key sections with actionable findings below.
1. Hermes Agent Harness (Section 1)
What we already have: Hermes Agent installed on Hermes (Mac M3 Max), config.yaml with Ollama, SOUL.md, memory enabled, orchestration MCP server.
What the report confirms we're missing:
steam-info-mcp,mcp-pyautogui) are defined inmcp/servers.jsonbut not registered inconfig.yamlmcp_servers— Hermes doesn't see themclaude-opus-4-6(Anthropic) as default withhermes3:latest(8B) as Ollama fallback — should point athermes4:14bonce pulled (#9)trajectory_compressor.py) that we're not using — it compresses agent execution traces into training-optimized formats, 10-50x reductionKey v0.4.0 features we should leverage:
hermes mcpCLI for installing/configuring MCP servers (with OAuth 2.1 PKCE flow)2. Instant Distillation / Context Compression (Section 2)
Structured Distillation paper (Nous Research, March 13, 2026, arXiv):
exchange_core(~15 tokens),specific_context(~23 tokens),thematic_room_assignments,files_touchedAgentic On-Policy Distillation (OPD):
tinker-atroposstandalone training infrastructure available3. Memory Architecture (Section 3)
Five-tier memory already in Hermes: Working Context → FTS5 Search → Vector Embeddings → Honcho Profiles → Skill Documents
Hindsight-Hermes plugin (worth evaluating):
pip install hindsight-hermeshermes tools disable memory(prevents native tool preference)my-hermantic-agentfork shows Ollama-hosted Hermes-4-14B with TimescaleDB persistent semantic memory — closest to our architecture.4. Auto Research / Self-Improvement (Section 4)
hermes-autoresearchbranch — experimental, enables autonomous hypothesis generation and experimentation5. Tool Ecosystem (Section 5)
hermes-skillsrepoagentskills.iostandard adopted by 11 tools (Claude Code, Cursor, Copilot, Gemini CLI, etc.)6. macOS Deployment (Section 6)
M3 Max with 128GB unified memory can run Hermes 3 70B without quantization — capability requiring ~$15,000 in discrete GPU hardware elsewhere. We should be running at least 14B without breaking a sweat.
Security layers available: namespace isolation, capability dropping, read-only root FS, seccomp-bpf, DM pairing for credential access.
7. Integration Roadmap from Report (Section 7)
The report's 4-phase roadmap maps to our current state:
Actionable Tickets Created
steam-info-mcpanddesktop-controlin config.yamlmcp_serversheartbeat_tick()in tasks.py to invoke Hermes agent sessions (telemetry capture)/config set trajectory_export true+HERMES_TRAJECTORY_PATHhermes4:14bafter #9 completes)🔧
geminiworking on this via Huey. Branch:gemini/issue-576🔧
grokworking on this via Huey. Branch:grok/issue-576⚠️
grokproduced no changes for this issue. Skipping.⚡ Dispatched to
claude. Huey task queued.⚡ Dispatched to
gemini. Huey task queued.⚡ Dispatched to
kimi. Huey task queued.⚡ Dispatched to
grok. Huey task queued.⚡ Dispatched to
perplexity. Huey task queued.Closing during the 2026-03-28 backlog burn-down.
Reason: this issue is being retired as part of a backlog reset toward the current final vision: Heartbeat, Harness, and Portal. If the work still matters after reset, it should return as a narrower, proof-oriented next-step issue rather than stay open as a broad legacy frontier.