hermes-agent

Author	SHA1	Message	Date
Teknium	d70e07fc45	refactor(cli): add protected TUI extension hooks for wrapper CLIs Based on PR #1749 by @erosika (reimplemented on current main). Extracts three protected methods from run() so wrapper CLIs can extend the TUI without overriding the entire method: - _get_extra_tui_widgets(): inject widgets between spacer and status bar - _register_extra_tui_keybindings(kb, input_area): add keybindings - _build_tui_layout_children(**widgets): full control over ordering Default implementations reproduce existing layout exactly. The inline HSplit in run() now delegates to _build_tui_layout_children(). 5 tests covering defaults, widget insertion position, and keybinding registration.	2026-03-21 09:42:07 -07:00
Teknium	07112e4e98	fix(mattermost): use MIME types for media attachments (#2329 ) fix(mattermost): use MIME types for media attachments	2026-03-21 09:31:53 -07:00
Himess	bc15f6cca3	fix(mattermost): use MIME types for media attachments Bare strings like "image", "audio", "document" were appended to media_types, but downstream run.py checks mtype.startswith("image/") and mtype.startswith("audio/"), which never matched. This caused all Mattermost file attachments to be silently dropped from vision/STT processing. Use the actual MIME type from file_info instead.	2026-03-21 09:31:15 -07:00
Teknium	3921fb973c	fix(gateway): load platforms section from config.yaml for webhook routes (#2328 ) fix(gateway): load platforms section from config.yaml for webhook routes	2026-03-21 09:27:40 -07:00
Teknium	6408b4ad53	Merge pull request #2327 from NousResearch/hermes/hermes-5d6932ba fix: prevent systemd restart storm on gateway connection failure	2026-03-21 09:26:57 -07:00
Teknium	326b146d68	fix: prevent systemd restart storm on gateway connection failure Cherry-picked from PR #2319 by @itenev. When the gateway fails to connect (e.g. PrivilegedIntentsRequired, missing token), systemd's default RestartSec=10 with no start rate limit causes rapid reconnect storms flooding logs and triggering platform-side rate limits. - StartLimitIntervalSec=600 + StartLimitBurst=5 in [Unit] (max 5 restarts per 10 min) - RestartSec: 10 → 30 - Applied to both templates in gateway.py and scripts/hermes-gateway	2026-03-21 09:26:39 -07:00
dieutx	1830db0476	fix(gateway): load platforms section from config.yaml into gateway config The gateway config loader read config.yaml but never merged its `platforms` key into the runtime config dict. This meant that platform-specific settings defined under `platforms.<name>.extra` (e.g. webhook routes) were silently ignored unless the user also duplicated them in the legacy gateway.json file. Merge `yaml_cfg["platforms"]` into `gw_data["platforms"]` with a shallow deep-merge of the `extra` dict so that gateway.json defaults are preserved while config.yaml values take precedence. Closes #2305	2026-03-21 09:26:24 -07:00
Teknium	3ba6043c62	feat(compressor): major context compaction improvements (#2323 ) feat(compressor): major context compaction improvements — structured summaries, iterative updates, token-budget tail protection	2026-03-21 08:51:42 -07:00
Teknium	e75f58420c	feat(compressor): major context compaction improvements Six improvements to reduce information loss during context compression, informed by analysis of Cline, OpenCode, Pi-mono, Codex, and ClawdBot: 1. Structured summary template — sections for Goal, Progress (Done/ In Progress/Blocked), Key Decisions, Relevant Files, Next Steps, and Critical Context. Forces the summarizer to preserve each category instead of writing a vague paragraph. 2. Iterative summary updates — on re-compression, the prompt says 'PRESERVE existing info, ADD new progress, UPDATE done/in-progress status.' Previous summary is stored and fed back to the summarizer so accumulated context survives across multiple compactions. 3. Token-budget tail protection — instead of fixed protect_last_n=4, walks backward keeping ~20K tokens of recent context. Adapts to message density: sessions with big tool results protect fewer messages, short exchanges protect more. Falls back to protect_last_n for small conversations. 4. Tool output pruning (pre-pass) — before the expensive LLM summary, replaces old tool result contents with a placeholder. This is free (no LLM call) and can save 30%+ of context by itself. 5. Scaled summary budget — instead of fixed 2500 tokens, allocates 20% of compressed content tokens (clamped to 2000-8000). A 50-turn conversation gets more summary space than a 10-turn one. 6. Richer summarizer input — tool calls now include arguments (up to 500 chars) and tool results keep up to 3000 chars (was 1500). The summarizer sees 'terminal(git status) → M src/config.py' instead of just '[Tool calls: terminal]'.	2026-03-21 08:14:14 -07:00
Teknium	28bb0e770f	fix(voice): enable TTS voice reply when streaming is active (#2322 ) When streaming is enabled, the base adapter receives None from _handle_message (already_sent=True) and cannot run auto-TTS for voice input. The runner was unconditionally skipping voice input TTS assuming the base adapter would handle it. Now the runner takes over TTS responsibility when streaming has already delivered the text response, so voice channel playback works with both streaming on and off. Streaming off behavior is unchanged (default already_sent=False preserves the original code path exactly). Co-authored-by: 0xbyt4 <35742124+0xbyt4@users.noreply.github.com>	2026-03-21 08:08:37 -07:00
Teknium	06f4df52f1	fix(install): add zprofile fallback and create zshrc on fresh macOS installs (#2320 ) On macOS, zsh users may not have ~/.zshrc if they haven't customized their shell yet. The installer would silently fail to add ~/.local/bin to PATH, causing 'hermes: command not found' after installation. - Check ~/.zprofile as fallback for zsh users (macOS login shell config) - Create ~/.zshrc if neither config file exists Cherry-picked from PR #2315 by erhnysr. Co-authored-by: erhnysr <erhnysr@users.noreply.github.com>	2026-03-21 07:30:43 -07:00
Teknium	a03cbcd5f9	Merge pull request #2317 from NousResearch/hermes/hermes-5d6932ba fix(cron): close abandoned coroutine when asyncio.run() raises RuntimeError	2026-03-21 07:21:18 -07:00
Teknium	df67ae730b	fix(cron): close abandoned coroutine when asyncio.run() raises RuntimeError Cherry-picked from PR #2290 by @Mibayy. Closes #2138. When asyncio.run() raises RuntimeError (running loop exists), the coroutine was created but never awaited, producing a RuntimeWarning on GC. Extract coro before try, call coro.close() in the except branch before falling back to ThreadPoolExecutor.	2026-03-21 07:20:58 -07:00
Teknium	9305164bf3	fix: add None-entry guard to tool_calls loops in run_agent, batch_runner, and mini_swe_runner (#2316 ) Co-authored-by: Dilee <uzmpsk.dilekakbas@gmail.com>	2026-03-21 07:20:41 -07:00
Teknium	453f4c5175	Merge pull request #2312 from NousResearch/hermes/hermes-31d7db3b fix(gateway): retry Telegram 409 polling conflicts before giving up	2026-03-21 07:19:43 -07:00
Teknium	37a9979459	fix(cron): stop injecting cron outputs into gateway session history (#2313 ) Cron deliveries were mirrored into the target gateway session as assistant-role messages, causing consecutive assistant messages that violate message alternation (issue #2221). Instead of fixing the role, remove the mirror injection entirely. Cron outputs already live in their own cron session and don't belong in the interactive conversation history. Delivered messages are now wrapped with a header (task name) and a footer noting the agent cannot see or respond to the message, so users have clear context about what they're reading. Closes #2221	2026-03-21 07:18:36 -07:00
Teknium	713f2f73da	fix(agent): inject model identity for Alibaba Coding Plan (#2314 ) fix(agent): inject model identity for Alibaba Coding Plan	2026-03-21 07:11:51 -07:00
Teknium	237499d102	Merge pull request #2311 from NousResearch/hermes/hermes-5d6932ba fix(toolsets): pass visited set by reference to prevent diamond dependency duplication	2026-03-21 07:11:27 -07:00
Teknium	3f811f52fd	fix(toolsets): pass visited set by reference to prevent diamond dependency duplication Cherry-picked from PR #2292 by @Mibayy. Closes #2134. resolve_toolset() called visited.copy() per sibling include, breaking dedup for diamond dependencies (D resolved twice via B and C paths) and causing duplicate cycle warnings. Fix: pass visited directly so siblings share the same set. The .copy() for the all/* alias at the top level is kept so each top-level toolset gets an independent pass. Removes the print() cycle warning since hitting a visited name now usually means diamond (not a bug).	2026-03-21 07:11:09 -07:00
ygd58	2ea8054304	fix(agent): inject model identity for Alibaba Coding Plan to work around API returning wrong model name	2026-03-21 07:11:08 -07:00
Teknium	488a30e879	fix(gateway): retry Telegram 409 polling conflicts before giving up A single Telegram 409 Conflict from getUpdates permanently killed Telegram polling with no recovery possible (retryable=False on first occurrence). This is too aggressive for production use with process supervisors. Transient 409s are expected during: - --replace handoffs where the old long-poll session lingers on Telegram servers for a few seconds after SIGTERM - systemd Restart=on-failure respawns that overlap with the dying instance cleanup Now _handle_polling_conflict() retries up to 3 times with a 10-second delay between attempts. The 30-second total retry window lets stale server-side sessions expire. If all retries fail, the error is still marked as permanently fatal — preserving the original protection against genuine dual-instance conflicts. Tests updated: split the single conflict test into two — one verifying retry on transient conflict, one verifying fatal after exhausted retries. Closes #2296	2026-03-21 07:11:06 -07:00
Teknium	bc3f425212	Merge pull request #2309 from NousResearch/hermes/hermes-5d6932ba fix(cli): correct truncated AUXILIARY_WEB_EXTRACT_API_KEY env var name	2026-03-21 07:09:47 -07:00
Teknium	fd1d6c03cb	fix(cli): correct truncated AUXILIARY_WEB_EXTRACT_API_KEY env var name Cherry-picked from PR #2295 by @dlkakbs. The web_extract auxiliary client api_key env var was literally stored as 'AUXILI..._KEY' (dots in the source) instead of the full name. Users configuring an auxiliary web_extract model with an API key would have auth failures because the key was written to a non-existent var.	2026-03-21 07:09:28 -07:00
Teknium	58b52dfb2f	Merge pull request #2303 from NousResearch/hermes/hermes-31d7db3b fix: remove synthetic error message injection, fix session resume after repeated failures	2026-03-21 07:03:54 -07:00
Teknium	651e92fbbf	fix: use git pull --ff-only in update/install to avoid divergent branch error (#2274 ) fix: use git pull --rebase in update/install to avoid divergent branch error	2026-03-21 06:33:22 -07:00
Teknium	779619f742	fix: remove synthetic error message injection, fix session resume after repeated failures Two changes to the error handler in the agent loop: 1. Remove the 'if not pending_handled' block that injected fake [System error during processing: ...] messages into conversation history. These polluted history, burned tokens on retries, and could violate role alternation by injecting as role=user. The tool_calls error-result path (role=tool) is preserved. 2. Append the error final_response as an assistant message when hitting the iteration limit, so session resume doesn't produce consecutive user messages.	2026-03-21 06:33:05 -07:00
Teknium	96a5e9fc11	feat(agent): add summary of successful tool actions in review agent Enhanced the review agent to scan and summarize successful tool actions, providing users with a compact overview of updates made during the review process. This includes actions related to memory and user profiles, improving user feedback and interaction clarity.	2026-03-21 06:31:59 -07:00
Teknium	eb537b5db4	fix(cli): prevent multiple reasoning boxes from rendering Added a check to suppress further reasoning rendering once the response box is open, preventing potential overlap of reasoning boxes during late thinking blocks. This enhances the user experience by maintaining a clean output in the CLI.	2026-03-21 06:28:47 -07:00
Teknium	2da79b13df	feat: priority-based context file selection + CLAUDE.md support (#2301 ) Previously, all project context files (AGENTS.md, .cursorrules, .hermes.md) were loaded and concatenated into the system prompt. This bloated the prompt with potentially redundant or conflicting instructions. Now only ONE project context type is loaded, using priority order: 1. .hermes.md / HERMES.md (walk to git root) 2. AGENTS.md / agents.md (recursive directory walk) 3. CLAUDE.md / claude.md (cwd only, NEW) 4. .cursorrules / .cursor/rules/*.mdc (cwd only) SOUL.md from HERMES_HOME remains independent and always loads. Also adds CLAUDE.md as a recognized context file format, matching the convention popularized by Claude Code. Refactored the monolithic function into four focused helpers: _load_hermes_md, _load_agents_md, _load_claude_md, _load_cursorrules. Tests: replaced 1 coexistence test with 10 new tests covering priority ordering, CLAUDE.md loading, case sensitivity, injection blocking.	2026-03-21 06:26:20 -07:00
Teknium	885f88fb60	feat(agent): suppress non-forced output during post-response housekeeping - Introduced a mechanism to mute output after the main response is delivered, ensuring that subsequent tool calls run without cluttering the CLI. - Redirected stdout to devnull during the review agent's execution to prevent any print statements from interfering with the main CLI display. - Added a new attribute `_mute_post_response` to manage output suppression effectively.	2026-03-20 23:54:42 -07:00
Teknium	3585019831	feat(cli): enhance user input display with consistent formatting - Added a user bar separator for improved visual clarity when displaying pasted text and user input in the HermesCLI. - Ensured consistent formatting for both multi-line and single-line user inputs, enhancing the overall user experience in the command-line interface. These changes contribute to a more organized and visually appealing output during interactions.	2026-03-20 23:36:49 -07:00
Teknium	6d7f3dbbb7	Merge pull request #2278 from NousResearch/hermes/hermes-5d6932ba fix(setup): add alibaba and deepseek to provider model selection	2026-03-20 22:50:18 -07:00
Test	71cf7ad11a	fix(setup): add alibaba to provider model selection Same bug as opencode-zen/go — alibaba fell through to the OpenRouter model list instead of using _setup_provider_model_selection() which probes the provider's own /models endpoint. All user-selectable providers now have correct model selection routing.	2026-03-20 22:48:59 -07:00
Teknium	b748fcf836	Merge pull request #2277 from NousResearch/hermes/hermes-5d6932ba fix(setup): OpenCode Zen/Go show OpenRouter models instead of their own	2026-03-20 22:42:33 -07:00
Test	7289256114	fix(setup): OpenCode Zen/Go show OpenRouter models instead of their own After selecting OpenCode Zen or Go as provider in hermes setup, the model selection page showed OpenRouter models because these providers weren't in the list that routes to _setup_provider_model_selection(). They fell through to the else branch which shows the OpenRouter catalog. Users ended up with an OpenCode API key but an OpenRouter model name, causing 'Provider resolver returned an empty API key' on first use. Fix: add opencode-zen and opencode-go to the provider list that uses _setup_provider_model_selection() for live /models detection.	2026-03-20 22:42:14 -07:00
Test	870ebb8850	fix: use git pull --ff-only in update/install to avoid divergent branch error Fresh installs without pull.rebase configured hit a git error when running hermes update because git doesn't know how to reconcile divergent branches. --ff-only is the right strategy: it works for the normal case (local branch is behind remote) and fails cleanly if the user somehow has local commits, rather than silently rebasing them.	2026-03-20 22:28:55 -07:00
Teknium	517b5c17d6	Merge pull request #2275 from NousResearch/hermes/hermes-5d6932ba chore: remove dead top-level toolsets config key	2026-03-20 22:27:35 -07:00
Test	d0ac8d9fc7	chore: remove dead top-level toolsets config key The top-level 'toolsets' key in config.yaml was never read at runtime. Tool selection uses platform_toolsets (per-platform) or the --toolsets CLI flag. The key existed in load_cli_config() defaults and the example config as 'toolsets: [all]', misleading users into thinking it controlled tool availability. - Remove from load_cli_config() hardcoded defaults - Remove from hermes config show output - Replace in cli-config.yaml.example with deprecation note pointing to platform_toolsets and hermes tools	2026-03-20 22:27:13 -07:00
Teknium	761a8ad39a	fix(display): show provider and endpoint in API error messages (#2266 ) fix(display): show provider and endpoint in API error messages	2026-03-20 21:57:53 -07:00
Teknium	52adc8873b	Merge pull request #2268 from NousResearch/hermes/hermes-5d6932ba fix(tools): disabled toolsets re-enable themselves after hermes tools	2026-03-20 21:57:39 -07:00
Test	173a5c6290	fix(tools): disabled toolsets re-enable themselves after hermes tools Two bugs in the save/load roundtrip for platform_toolsets: 1. _save_platform_tools preserved composite toolset entries (hermes-cli, hermes-telegram, etc.) because they weren't in configurable_keys. These composites include ALL _HERMES_CORE_TOOLS, so having hermes-cli in the saved list alongside individual keys negated any disables — the subset check always found the disabled toolset's tools via the composite entry. Fix: also filter out known TOOLSETS keys from preserved entries. Only truly unknown entries (MCP server names, custom entries) are kept. 2. _get_platform_tools used reverse subset inference to determine which configurable toolsets were enabled. This is inherently broken when tools appear in multiple toolsets (e.g. HA tools in both the homeassistant toolset and _HERMES_CORE_TOOLS). Fix: when the saved list contains explicit configurable keys (meaning the user has configured this platform), use direct membership instead of subset inference. The fallback path still handles legacy configs that only have a composite entry like hermes-cli.	2026-03-20 21:11:54 -07:00
Test	f3b2303428	fix(gateway): skip model auto-detection for custom/local providers Mirrors the CLI fix for the gateway /model handler. When the user is on a custom provider (provider=custom, localhost, or 127.0.0.1 endpoint), /model <name> no longer tries to auto-detect a provider switch. Previously, typing /model openrouter/nvidia/nemotron:free on Telegram while on a localhost endpoint would silently accept the model name on the local server — auto-detection failed to match the free model, so the provider stayed as custom with the localhost base_url. The user saw 'Model changed' but requests still went to localhost, which doesn't serve that model. Now shows the endpoint URL and provider:model syntax tip, matching the CLI behavior.	2026-03-20 21:07:48 -07:00
Test	1870069f80	fix(session_search): exclude current session lineage Cherry-picked from PR #2201 by @Gutslabs. session_search resolved hits to parent/root sessions but only excluded the exact current_session_id. If the active session was a child continuation (compression/delegation), its parent could still appear as a 'past' conversation result. Fix: resolve current_session_id to its lineage root before filtering, so the entire active lineage (parent and children) is excluded.	2026-03-20 21:07:48 -07:00
Test	d560f2d1f2	fix(display): show provider and endpoint in API error messages When an API call fails, the error output now shows the provider name, model, and endpoint URL so users can immediately identify which service rejected their request. Auth errors (401/403) get actionable guidance: check key validity, model access, and OpenRouter credits link. Before: 'API call failed (attempt 1/3): PermissionDeniedError' After: 'API call failed (attempt 1/3): PermissionDeniedError Provider: openrouter Model: anthropic/claude-sonnet-4 Endpoint: https://openrouter.ai/api/v1 Your API key was rejected by the provider. Check: • Is the key valid? Run: hermes setup • Does your account have access to anthropic/claude-sonnet-4? • Check credits: https://openrouter.ai/settings/credits'	2026-03-20 21:06:55 -07:00
Test	f7e2ed20fa	feat(cli): implement true-color ANSI support for response text - Added support for true-color ANSI escape codes in the HermesCLI to enhance the visual appearance of streamed content. - Introduced a fallback mechanism for text color in case of errors while retrieving the color from the active skin. - Updated the output formatting to include the new text color in both line emissions and buffer flushing. These changes improve the user experience by ensuring consistent and visually appealing text output in the command-line interface.	2026-03-20 21:02:36 -07:00
Test	10d719ac1b	fix(security): require opt-in for project plugin discovery	2026-03-20 20:50:30 -07:00
Teknium	45058b4105	feat: replace inline nudges with background memory/skill review (#2235 ) Remove the memory and skill nudges that were appended directly to user messages, causing backward-looking system instructions to compete with forward-looking user tasks. Found in 43% of user messages across 15 sessions, with confirmed cases of the agent spending tool calls on nudge responses before starting the user's actual request. Replace with a background review agent that runs AFTER the main agent finishes responding: - Spawns a background thread with a snapshot of the conversation - Uses the main model (not auxiliary) for high-precision memory/skill work - Only has memory + skill_manage tools (5 iteration budget) - Shares the memory store for direct writes - Never modifies the main conversation history - Never competes with the user's task for model attention - Zero latency impact (runs after response is delivered) - Same token cost (processes the same context, just on a separate track) The trigger conditions are unchanged (every 10 user turns for memory, after 10+ tool iterations for skills). Only the execution path changes: from inline injection to background fork. Closes #2227. Co-authored-by: Test <test@test.com>	2026-03-20 18:51:31 -07:00
Teknium	2416b2b7af	refactor(cli, banner): update gold ANSI color to true-color format (#2246 ) - Changed the ANSI escape code for gold color in cli.py and banner.py to use true-color format (#FFD700) for better visual consistency. - Enhanced the _on_tool_progress method in HermesCLI to update the TUI spinner with tool execution status, improving user feedback during operations. These changes improve the visual representation and user experience in the command-line interface. Co-authored-by: Test <test@test.com>	2026-03-20 18:17:38 -07:00
Teknium	4263350c5b	fix: remove post-compression file-read history injection (#2226 ) Remove the [Files already read — do NOT re-read these] user message that was injected into the conversation after context compression. This message used role='user' for system-generated content, creating a fake user turn that confused models about conversation state and could contribute to task-redo behavior. The file_tools.py read tracker (warn on 3rd consecutive read, block on 4th+) already handles re-read prevention inline without injecting synthetic messages. Closes #2224. Co-authored-by: Test <test@test.com>	2026-03-20 14:54:25 -07:00
Teknium	214047dee1	fix(display): suppress spinner animation in non-TTY environments (#2216 ) fix(display): suppress spinner animation in non-TTY environments	2026-03-20 12:55:54 -07:00

1 2 3 4 5 ...

2432 Commits