Files
hermes-agent/TODO.md

3.0 KiB

Hermes Agent - Future Improvements


1. Subagent Architecture (Context Isolation) 🎯

The main agent becomes an orchestrator that delegates context-heavy tasks to subagents with isolated context. Each subagent returns a summary, keeping the orchestrator's context clean. delegate_task(goal, context, toolsets=[]) with fresh conversation, limited toolset, task-specific system prompt.

2. Planning & Task Management 📋

Task decomposition tool, progress checkpoints after N tool calls, persistent plan storage that survives context compression, failure recovery with replanning.

3. Dynamic Skills Expansion 📚

Skill acquisition from successful tasks, parameterized skill templates, skill chaining with dependency graphs.

4. Interactive Clarifying Questions

Multiple-choice prompt tool with rich terminal UI. Up to 4 choices + free-text. CLI-only with graceful fallback for non-interactive modes.

5. Memory System 🧠

Daily memory logs, long-term curated MEMORY.md, vector/semantic search, pre-compaction memory flush, user profile, learning store for error patterns and discovered fixes. Inspired by ClawdBot's memory system.

6. Heartbeat System 💓

Periodic agent wake-up that reads HEARTBEAT.md for instructions. Runs inside the main session with full context. Triggers on interval, exec completion, cron events, or manual wake. HEARTBEAT_OK suppression when nothing needs attention. Inspired by ClawdBot's heartbeat.

7. Local Browser Control via CDP 🌐

Support both local Chrome (via CDP, free) and Browserbase (cloud, paid) as browser backends. Local gives persistent login sessions but lacks CAPTCHA solving.

8. Signal Integration 📡

New platform adapter using signal-cli daemon (JSON-RPC HTTP + SSE). Requires Java runtime and phone number registration.

9. Session Transcript Search 🔍

hermes sessions search <query> CLI command and session_search agent tool. Text-based first (ripgrep over JSONL), vector search later.

10. Plugin/Extension System 🔌

Python plugin interface with plugin.yaml + handler.py. Discovery from ~/.hermes/plugins/. Plugins can register tools, hooks, and CLI commands. Inspired by ClawdBot's 36-plugin extension system.

11. Native Companion Apps 📱

macOS (Swift/SwiftUI), iOS, Android apps connecting via WebSocket. Prerequisite: WS API on gateway. MVP: web UI with Flask/FastAPI. Inspired by ClawdBot's companion apps.

12. Evaluation System 📏

LLM grader mode for batch_runner, action comparison against expected tool calls, string matching baselines.

13. Layered Context Architecture 📊

Structured hierarchy: project context > skills > user profile > learnings > external knowledge > runtime introspection.

14. Tools Wishlist 🧰

  • Diagram rendering (Mermaid/PlantUML to images)
  • Document generation (PDFs, Word, presentations)
  • Canvas / visual workspace
  • Coding agent skill (Codex, Claude Code orchestration via PTY)
  • Domain skill packs (DevOps, data science, security)