- docs/mcp.md: Full MCP documentation covering prerequisites, configuration, transports (stdio + HTTP), security (env filtering, credential stripping), reconnection, troubleshooting, popular servers, and advanced usage - README.md: Add MCP section with quick config example and install instructions - cli-config.yaml.example: Add commented mcp_servers section with examples for stdio, HTTP, and authenticated server configs - docs/tools.md: Add MCP to Tool Categories table and MCP Tools section - skills/mcp/native-mcp/SKILL.md: Create native MCP client skill with full configuration reference, transport types, security, troubleshooting - skills/mcp/DESCRIPTION.md: Update category description to cover both native MCP client and mcporter bridge approaches
17 KiB
MCP (Model Context Protocol) Support
MCP lets Hermes Agent connect to external tool servers — giving the agent access to databases, APIs, filesystems, and more without any code changes.
Overview
The Model Context Protocol (MCP) is an open standard for connecting AI agents to external tools and data sources. MCP servers expose tools over a lightweight RPC protocol, and Hermes Agent can connect to any compliant server automatically.
What this means for you:
- Thousands of ready-made tools — browse the MCP server directory for servers covering GitHub, Slack, databases, file systems, web scraping, and more.
- No code changes needed — add a few lines to
~/.hermes/config.yamland the tools appear alongside built-in ones. - Mix and match — run multiple MCP servers simultaneously, combining stdio-based and HTTP-based servers.
- Secure by default — environment variables are filtered and credentials are stripped from error messages returned to the LLM.
Prerequisites
Install MCP support as an optional dependency:
pip install hermes-agent[mcp]
Depending on which MCP servers you want to use, you may need additional runtimes:
| Server Type | Runtime Needed | Example |
|---|---|---|
| HTTP/remote | Nothing extra | url: "https://mcp.example.com" |
| npm-based (npx) | Node.js 18+ | command: "npx" |
| Python-based | uv (recommended) | command: "uvx" |
Most popular MCP servers are distributed as npm packages and launched via npx. Python-based servers typically use uvx (from the uv package manager).
Configuration
MCP servers are configured in ~/.hermes/config.yaml under the mcp_servers key. Each entry is a named server with its connection details.
Stdio Servers (command + args + env)
Stdio servers run as local subprocesses. Communication happens over stdin/stdout.
mcp_servers:
filesystem:
command: "npx"
args: ["-y", "@modelcontextprotocol/server-filesystem", "/home/user/projects"]
env: {}
github:
command: "npx"
args: ["-y", "@modelcontextprotocol/server-github"]
env:
GITHUB_PERSONAL_ACCESS_TOKEN: "ghp_xxxxxxxxxxxx"
| Key | Required | Description |
|---|---|---|
command |
Yes | Executable to run (e.g., npx, uvx, python) |
args |
No | List of command-line arguments |
env |
No | Environment variables to pass to the subprocess |
Note: Only explicitly listed env variables plus a safe baseline (PATH, HOME, USER, LANG, SHELL, TMPDIR, XDG_*) are passed to the subprocess. Your shell's API keys, tokens, and secrets are not leaked. See Security for details.
HTTP Servers (url + headers)
HTTP servers run remotely and are accessed over HTTP/StreamableHTTP.
mcp_servers:
remote_api:
url: "https://my-mcp-server.example.com/mcp"
headers:
Authorization: "Bearer sk-xxxxxxxxxxxx"
| Key | Required | Description |
|---|---|---|
url |
Yes | Full URL of the MCP HTTP endpoint |
headers |
No | HTTP headers to include (e.g., auth tokens) |
Per-Server Timeouts
Each server can have custom timeouts:
mcp_servers:
slow_database:
command: "npx"
args: ["-y", "@modelcontextprotocol/server-postgres"]
env:
DATABASE_URL: "postgres://user:pass@localhost/mydb"
timeout: 300 # Tool call timeout in seconds (default: 120)
connect_timeout: 90 # Initial connection timeout in seconds (default: 60)
| Key | Default | Description |
|---|---|---|
timeout |
120 | Maximum seconds to wait for a single tool call to complete |
connect_timeout |
60 | Maximum seconds to wait for the initial connection and tool discovery |
Mixed Configuration Example
You can combine stdio and HTTP servers freely:
mcp_servers:
# Local filesystem access via stdio
filesystem:
command: "npx"
args: ["-y", "@modelcontextprotocol/server-filesystem", "/tmp"]
# GitHub API via stdio with auth
github:
command: "npx"
args: ["-y", "@modelcontextprotocol/server-github"]
env:
GITHUB_PERSONAL_ACCESS_TOKEN: "ghp_xxxxxxxxxxxx"
# Remote database via HTTP
company_db:
url: "https://mcp.internal.company.com/db"
headers:
Authorization: "Bearer sk-xxxxxxxxxxxx"
timeout: 180
# Python-based server via uvx
memory:
command: "uvx"
args: ["mcp-server-memory"]
Config Translation (Claude/Cursor JSON → Hermes YAML)
Many MCP server docs show configuration in Claude Desktop JSON format. Here's how to translate:
Claude Desktop JSON (claude_desktop_config.json):
{
"mcpServers": {
"filesystem": {
"command": "npx",
"args": ["-y", "@modelcontextprotocol/server-filesystem", "/tmp"],
"env": {}
},
"github": {
"command": "npx",
"args": ["-y", "@modelcontextprotocol/server-github"],
"env": {
"GITHUB_PERSONAL_ACCESS_TOKEN": "ghp_xxxxxxxxxxxx"
}
}
}
}
Hermes Agent YAML (~/.hermes/config.yaml):
mcp_servers: # mcpServers → mcp_servers (snake_case)
filesystem:
command: "npx"
args: ["-y", "@modelcontextprotocol/server-filesystem", "/tmp"]
env: {}
github:
command: "npx"
args: ["-y", "@modelcontextprotocol/server-github"]
env:
GITHUB_PERSONAL_ACCESS_TOKEN: "ghp_xxxxxxxxxxxx"
Translation rules:
- Key name:
mcpServers→mcp_servers(snake_case) - Format: JSON → YAML (remove braces/brackets, use indentation)
- Arrays:
["a", "b"]stays the same in YAML flow style, or use block style with- a - Everything else: Keys (
command,args,env) are identical
How It Works
Startup & Discovery
When Hermes Agent starts, the tool discovery system calls discover_mcp_tools():
- Config loading — Reads
mcp_serversfrom~/.hermes/config.yaml - Background loop — Spins up a dedicated asyncio event loop in a daemon thread for MCP connections
- Connection — Connects to each configured server (stdio subprocess or HTTP)
- Session init — Initializes the MCP client session (protocol handshake)
- Tool discovery — Calls
list_tools()on each server to get available tools - Registration — Registers each MCP tool into the Hermes tool registry with a prefixed name
Tool Registration
Each discovered MCP tool is registered with a prefixed name following this pattern:
mcp_{server_name}_{tool_name}
Hyphens and dots in both server and tool names are replaced with underscores for API compatibility. For example:
| Server Name | MCP Tool Name | Registered As |
|---|---|---|
filesystem |
read_file |
mcp_filesystem_read_file |
github |
create-issue |
mcp_github_create_issue |
my-api |
query.data |
mcp_my_api_query_data |
Tools appear alongside built-in tools — the agent sees them in its tool list and can call them like any other tool.
Tool Calling
When the agent calls an MCP tool:
- The handler is invoked by the tool registry (sync interface)
- The handler schedules the actual MCP
call_tool()RPC on the background event loop - The call blocks (with timeout) until the MCP server responds
- Response content blocks are collected and returned as JSON
- Errors are sanitized to strip credentials before returning to the LLM
Shutdown
On agent exit, shutdown_mcp_servers() is called:
- All server tasks are signalled to exit via their shutdown events
- Each server's
async withcontext manager exits, cleaning up transports - The background event loop is stopped and its thread is joined
- All server state is cleared
Security
Environment Variable Filtering
When launching stdio MCP servers, Hermes does not pass your full shell environment to the subprocess. The _build_safe_env() function constructs a minimal environment:
Always passed through (from your current environment):
PATH,HOME,USER,LANG,LC_ALL,TERM,SHELL,TMPDIR- Any variable starting with
XDG_
Explicitly added: Any variables you list in the server's env config.
Everything else is excluded — your OPENAI_API_KEY, AWS_SECRET_ACCESS_KEY, database passwords, and other secrets are never leaked to MCP server subprocesses unless you explicitly add them.
mcp_servers:
github:
command: "npx"
args: ["-y", "@modelcontextprotocol/server-github"]
env:
# Only this token is passed — nothing else from your shell
GITHUB_PERSONAL_ACCESS_TOKEN: "ghp_xxxxxxxxxxxx"
Credential Stripping in Errors
If an MCP tool call fails, the error message is sanitized by _sanitize_error() before being returned to the LLM. The following patterns are replaced with [REDACTED]:
- GitHub PATs (
ghp_...) - OpenAI-style keys (
sk-...) - Bearer tokens (
Bearer ...) - Query parameters (
token=...,key=...,API_KEY=...,password=...,secret=...)
This prevents accidental credential exposure through error messages in the conversation.
Transport Types
Stdio Transport
The default transport for locally-installed MCP servers. The server runs as a subprocess and communicates over stdin/stdout.
mcp_servers:
my_server:
command: "npx" # or "uvx", "python", any executable
args: ["-y", "package"]
env:
MY_VAR: "value"
Pros: Simple setup, no network needed, works offline. Cons: Server must be installed locally, one process per server.
HTTP / StreamableHTTP Transport
For remote MCP servers accessible over HTTP. Uses the StreamableHTTP protocol from the MCP SDK.
mcp_servers:
my_remote:
url: "https://mcp.example.com/endpoint"
headers:
Authorization: "Bearer token"
Pros: No local installation needed, shared servers, cloud-hosted.
Cons: Requires network, slightly higher latency, needs mcp package with HTTP support.
Note: If HTTP transport is not available in your installed mcp package version, Hermes will log a clear error and skip that server.
Reconnection
If an MCP server connection drops after initial setup (e.g., process crash, network hiccup), Hermes automatically attempts to reconnect with exponential backoff:
| Attempt | Delay Before Retry |
|---|---|
| 1 | 1 second |
| 2 | 2 seconds |
| 3 | 4 seconds |
| 4 | 8 seconds |
| 5 | 16 seconds |
- Maximum of 5 retry attempts before giving up
- Backoff is capped at 60 seconds (relevant if the formula exceeds this)
- Reconnection only triggers for established connections that drop — initial connection failures are reported immediately without retries
- If shutdown is requested during reconnection, the retry loop exits cleanly
Troubleshooting
Common Errors
"mcp package not installed"
MCP SDK not available -- skipping MCP tool discovery
Solution: Install the MCP optional dependency:
pip install hermes-agent[mcp]
"command not found" or server fails to start
The MCP server command (npx, uvx, etc.) is not on PATH.
Solution: Install the required runtime:
# For npm-based servers
npm install -g npx # or ensure Node.js 18+ is installed
# For Python-based servers
pip install uv # then use "uvx" as the command
"MCP server 'X' has no 'command' in config"
Your stdio server config is missing the command key.
Solution: Check your ~/.hermes/config.yaml indentation and ensure command is present:
mcp_servers:
my_server:
command: "npx" # <-- required for stdio servers
args: ["-y", "package-name"]
Server connects but tools fail with authentication errors
Your API key or token is missing or invalid.
Solution: Ensure the key is in the server's env block (not your shell env):
mcp_servers:
github:
command: "npx"
args: ["-y", "@modelcontextprotocol/server-github"]
env:
GITHUB_PERSONAL_ACCESS_TOKEN: "ghp_your_actual_token" # <-- check this
"MCP server 'X' is not connected"
The server disconnected and reconnection failed (or was never established).
Solution:
- Check the Hermes logs for connection errors (
hermes --verbose) - Verify the server works standalone (e.g., run the
npxcommand manually) - Increase
connect_timeoutif the server is slow to start
Connection timeout during discovery
Failed to connect to MCP server 'X': TimeoutError
Solution: Increase the connect_timeout for slow-starting servers:
mcp_servers:
slow_server:
command: "npx"
args: ["-y", "heavy-server-package"]
connect_timeout: 120 # default is 60
HTTP transport not available
mcp.client.streamable_http is not available
Solution: Upgrade the mcp package to a version that includes HTTP support:
pip install --upgrade mcp
Popular MCP Servers
Here are some popular free MCP servers you can use immediately:
| Server | Package | Description |
|---|---|---|
| Filesystem | @modelcontextprotocol/server-filesystem |
Read/write/search local files |
| GitHub | @modelcontextprotocol/server-github |
Issues, PRs, repos, code search |
| Git | @modelcontextprotocol/server-git |
Git operations on local repos |
| Fetch | @modelcontextprotocol/server-fetch |
HTTP fetching and web content extraction |
| Memory | @modelcontextprotocol/server-memory |
Persistent key-value memory |
| SQLite | @modelcontextprotocol/server-sqlite |
Query SQLite databases |
| PostgreSQL | @modelcontextprotocol/server-postgres |
Query PostgreSQL databases |
| Brave Search | @modelcontextprotocol/server-brave-search |
Web search via Brave API |
| Puppeteer | @modelcontextprotocol/server-puppeteer |
Browser automation |
| Sequential Thinking | @modelcontextprotocol/server-sequential-thinking |
Step-by-step reasoning |
Example Configs for Popular Servers
mcp_servers:
# Filesystem — no API key needed
filesystem:
command: "npx"
args: ["-y", "@modelcontextprotocol/server-filesystem", "/home/user/projects"]
# Git — no API key needed
git:
command: "uvx"
args: ["mcp-server-git", "--repository", "/home/user/my-repo"]
# GitHub — requires a personal access token
github:
command: "npx"
args: ["-y", "@modelcontextprotocol/server-github"]
env:
GITHUB_PERSONAL_ACCESS_TOKEN: "ghp_xxxxxxxxxxxx"
# Fetch — no API key needed
fetch:
command: "uvx"
args: ["mcp-server-fetch"]
# SQLite — no API key needed
sqlite:
command: "uvx"
args: ["mcp-server-sqlite", "--db-path", "/home/user/data.db"]
# Brave Search — requires API key (free tier available)
brave_search:
command: "npx"
args: ["-y", "@modelcontextprotocol/server-brave-search"]
env:
BRAVE_API_KEY: "BSA_xxxxxxxxxxxx"
Advanced
Multiple Servers
You can run as many MCP servers as you want simultaneously. Each server gets its own subprocess (stdio) or HTTP connection, and all tools are registered into a single unified namespace.
Servers are connected sequentially during startup. If one server fails to connect, the others still work — failed servers are logged as warnings and skipped.
Tool Naming Convention
All MCP tools follow the naming pattern:
mcp_{server_name}_{tool_name}
Both the server name and tool name are sanitized: hyphens (-) and dots (.) are replaced with underscores (_). This ensures compatibility with LLM function-calling APIs that restrict tool name characters.
If you configure a server named my-api that exposes a tool called query.users, the agent will see it as mcp_my_api_query_users.
Configurable Timeouts
Fine-tune timeouts per server based on expected response times:
mcp_servers:
fast_cache:
command: "npx"
args: ["-y", "mcp-server-redis"]
timeout: 30 # Fast lookups — short timeout
connect_timeout: 15
slow_analysis:
url: "https://analysis.example.com/mcp"
timeout: 600 # Long-running analysis — generous timeout
connect_timeout: 120
Idempotent Discovery
discover_mcp_tools() is idempotent — calling it multiple times only connects to servers that aren't already running. Already-connected servers keep their existing connections and tool registrations.
Custom Toolsets
Each MCP server's tools are automatically grouped into a toolset named mcp-{server_name}. These toolsets are also injected into all hermes-* platform toolsets, so MCP tools are available in CLI, Telegram, Discord, and other platforms.
Thread Safety
The MCP subsystem is fully thread-safe. A dedicated background event loop runs in a daemon thread, and all server state is protected by a lock. This works correctly even with Python 3.13+ free-threading builds.