Files

teknium1 799114ac8b docs: clarify Anthropic Claude auth flow

2026-03-14 19:49:38 -07:00

3.1 KiB

Raw Blame History

sidebar_position, title, description

sidebar_position	title	description
4	Provider Runtime Resolution	How Hermes resolves providers, credentials, API modes, and auxiliary models at runtime

Provider Runtime Resolution

Hermes has a shared provider runtime resolver used across:

CLI
gateway
cron jobs
ACP
auxiliary model calls

Primary implementation:

hermes_cli/runtime_provider.py
hermes_cli/auth.py
agent/auxiliary_client.py

Resolution precedence

At a high level, provider resolution uses:

explicit CLI/runtime request
environment variables
config.yaml model/provider config
provider-specific defaults or auto resolution

Providers

Current provider families include:

OpenRouter
Nous Portal
OpenAI Codex
Anthropic (native)
Z.AI
Kimi / Moonshot
MiniMax
MiniMax China
custom OpenAI-compatible endpoints

Output of runtime resolution

The runtime resolver returns data such as:

provider
api_mode
base_url
api_key
source
provider-specific metadata like expiry/refresh info

Why this matters

This resolver is the main reason Hermes can share auth/runtime logic between:

hermes chat
gateway message handling
cron jobs running in fresh sessions
ACP editor sessions
auxiliary model tasks

OpenRouter vs custom OpenAI-compatible base URLs

Hermes contains logic to avoid leaking the wrong API key to a custom endpoint when both OPENROUTER_API_KEY and OPENAI_API_KEY exist.

That distinction is especially important for:

local model servers
non-OpenRouter OpenAI-compatible APIs
switching providers without re-running setup

Native Anthropic path

Anthropic is not just "via OpenRouter" anymore.

When provider resolution selects anthropic, Hermes uses:

api_mode = anthropic_messages
the native Anthropic Messages API
agent/anthropic_adapter.py for translation

Credential resolution for native Anthropic now prefers refreshable Claude Code credentials over copied env tokens when both are present. In practice that means:

Claude Code credential files are treated as the preferred source when they include refreshable auth
manual ANTHROPIC_TOKEN / CLAUDE_CODE_OAUTH_TOKEN values still work as explicit overrides
Hermes preflights Anthropic credential refresh before native Messages API calls
Hermes still retries once on a 401 after rebuilding the Anthropic client, as a fallback path

OpenAI Codex path

Codex uses a separate Responses API path:

api_mode = codex_responses
dedicated credential resolution and auth store support

Auxiliary model routing

Auxiliary tasks such as:

vision
web extraction summarization
context compression summaries
session search summarization
skills hub operations
MCP helper operations
memory flushes

can use their own provider/model routing rather than the main conversational model.

Fallback models

Hermes also supports a configured fallback model/provider, allowing runtime failover in supported error paths.

3.1 KiB Raw Blame History