hermes-agent

Author	SHA1	Message	Date
Test	35558dadf4	Merge PR #2061 : fix(security): eliminate SQL string formatting in execute() calls Authored by dusterbloom. Closes #1911. Pre-computes SQL query strings at class definition time in insights.py, adds identifier quoting for ALTER TABLE DDL in hermes_state.py, and adds 4 regression tests verifying query construction safety.	2026-03-19 09:52:00 -07:00
Test	ae8059ca24	fix(delegate): move _saved_tool_names assignment to correct scope The merge at `e7844e9c` re-introduced a line in _build_child_agent() that references _saved_tool_names — a variable only defined in _run_single_child(). This caused NameError on every delegate_task call, completely breaking subagent delegation. Moves the child._delegate_saved_tool_names assignment to _run_single_child() where _saved_tool_names is actually defined, keeping the save/restore in the same scope as the try/finally block. Adds two regression tests from PR #2038 (YanSte). Also fixes the same issue reported in PR #2048 (Gutslabs). Co-authored-by: Yannick Stephan <yannick.stephan@gmail.com> Co-authored-by: Guts <gutslabs@users.noreply.github.com>	2026-03-19 09:26:05 -07:00
Peppi Littera	219af75704	fix(security): eliminate SQL string formatting in execute() calls Closes #1911 - insights.py: Pre-compute SELECT queries as class constants instead of f-string interpolation at runtime. _SESSION_COLS is now evaluated once at class definition time. - hermes_state.py: Add identifier quoting and whitelist validation for ALTER TABLE column names in schema migrations. - Add 4 tests verifying no injection vectors in SQL query construction.	2026-03-19 15:16:35 +01:00
Teknium	d76fa7fc37	fix: detect context length for custom model endpoints via fuzzy matching + config override (#2051 ) * fix: detect context length for custom model endpoints via fuzzy matching + config override Custom model endpoints (non-OpenRouter, non-known-provider) were silently falling back to 2M tokens when the model name didn't exactly match what the endpoint's /v1/models reported. This happened because: 1. Endpoint metadata lookup used exact match only — model name mismatches (e.g. 'qwen3.5:9b' vs 'Qwen3.5-9B-Q4_K_M.gguf') caused a miss 2. Single-model servers (common for local inference) required exact name match even though only one model was loaded 3. No user escape hatch to manually set context length Changes: - Add fuzzy matching for endpoint model metadata: single-model servers use the only available model regardless of name; multi-model servers try substring matching in both directions - Add model.context_length config override (highest priority) so users can explicitly set their model's context length in config.yaml - Log an informative message when falling back to 2M probe, telling users about the config override option - Thread config_context_length through ContextCompressor and AIAgent init Tests: 6 new tests covering fuzzy match, single-model fallback, config override (including zero/None edge cases). * fix: auto-detect local model name and context length for local servers Cherry-picked from PR #2043 by sudoingX. - Auto-detect model name from local server's /v1/models when only one model is loaded (no manual model name config needed) - Add n_ctx_train and n_ctx to context length detection keys for llama.cpp - Query llama.cpp /props endpoint for actual allocated context (not just training context from GGUF metadata) - Strip .gguf suffix from display in banner and status bar - _auto_detect_local_model() in runtime_provider.py for CLI init Co-authored-by: sudo <sudoingx@users.noreply.github.com> * fix: revert accidental summary_target_tokens change + add docs for context_length config - Revert summary_target_tokens from 2500 back to 500 (accidental change during patching) - Add 'Context Length Detection' section to Custom & Self-Hosted docs explaining model.context_length config override --------- Co-authored-by: Test <test@test.com> Co-authored-by: sudo <sudoingx@users.noreply.github.com>	2026-03-19 06:01:16 -07:00
Teknium	7b6d14e62a	fix(gateway): replace bare text approval with /approve and /deny commands (#2002 ) The gateway approval system previously intercepted bare 'yes'/'no' text from the user's next message to approve/deny dangerous commands. This was fragile and dangerous — if the agent asked a clarify question and the user said 'yes' to answer it, the gateway would execute the pending dangerous command instead. (Fixes #1888) Changes: - Remove bare text matching ('yes', 'y', 'approve', 'ok', etc.) from _handle_message approval check - Add /approve and /deny as gateway-only slash commands in the command registry - /approve supports scoping: /approve (one-time), /approve session, /approve always (permanent) - Add 5-minute timeout for stale approvals - Gateway appends structured instructions to the agent response when a dangerous command is pending, telling the user exactly how to respond - 9 tests covering approve, deny, timeout, scoping, and verification that bare 'yes' no longer triggers execution Credit to @solo386 and @FlyByNight69420 for identifying and reporting this security issue in PR #1971 and issue #1888. Co-authored-by: Test <test@test.com>	2026-03-18 16:58:20 -07:00
Teknium	67d707e851	fix: respect config.yaml model.base_url for Anthropic provider (#1948 ) (#1998 ) After #1675 removed ANTHROPIC_BASE_URL env var support, the Anthropic provider base URL was hardcoded to https://api.anthropic.com. Now reads model.base_url from config.yaml as an override, falling back to the default when not set. Also applies to the auxiliary client. Cherry-picked from PR #1949 by @rivercrab26. Co-authored-by: rivercrab26 <rivercrab26@users.noreply.github.com>	2026-03-18 16:51:24 -07:00
Teknium	e648863d52	docs: fix documentation inconsistencies across reference and user guides - toolsets-reference: add browser_console to browser + all platform toolsets, add missing hermes-acp, hermes-sms, messaging toolsets, correct hermes-gateway as composite, deduplicate platform toolset listings - tools-reference: add missing vision and web toolset sections - slash-commands: fix /new+/reset as alias (not separate commands), add /stop to CLI section (available in both CLI and gateway), add /plugins command, fix Notes section about messaging-only vs CLI-only - environment-variables: fix HERMES_MAX_ITERATIONS default (90 not 60), add DEEPSEEK_API_KEY/BASE_URL, OPENCODE_ZEN/GO keys, TAVILY_API_KEY, GITHUB_TOKEN, HERMES_EPHEMERAL_SYSTEM_PROMPT - configuration: remove duplicate Alibaba Cloud row, add OpenCode Zen/Go providers - cli-commands: add missing providers to --provider list (opencode-zen, opencode-go, ai-gateway, kilocode, alibaba) - quickstart: add OpenCode Zen and OpenCode Go to provider table Co-authored-by: Test <test@test.com>	2026-03-18 16:26:27 -07:00
Teknium	a7cc1cf309	fix: support Anthropic-compatible endpoints for third-party providers (#1997 ) Three bugs prevented providers like MiniMax from using their Anthropic-compatible endpoints (e.g. api.minimax.io/anthropic): 1. _VALID_API_MODES was missing 'anthropic_messages', so explicit api_mode config was silently rejected and defaulted to chat_completions. 2. API-key provider resolution hardcoded api_mode to 'chat_completions' without checking model config or detecting Anthropic-compatible URLs. 3. run_agent.py auto-detection only recognized api.anthropic.com, not third-party endpoints using the /anthropic URL convention. Fixes: - Add 'anthropic_messages' to _VALID_API_MODES - API-key providers now check model config api_mode and auto-detect URLs ending in /anthropic - run_agent.py and fallback logic detect /anthropic URL convention - 5 new tests covering all scenarios Users can now either: - Set MINIMAX_BASE_URL=https://api.minimax.io/anthropic (auto-detected) - Set api_mode: anthropic_messages in model config (explicit) - Use custom_providers with api_mode: anthropic_messages Co-authored-by: Test <test@test.com>	2026-03-18 16:26:06 -07:00
Teknium	f24db23458	fix: custom provider uses config base_url and api_key over env vars (#1760 ) (#1994 ) When provider: custom is set in config.yaml with base_url and api_key, those values are now used instead of falling back to OPENAI_BASE_URL and OPENAI_API_KEY env vars. Also reads the 'api' field as an alternative to 'api_key' for config compatibility. Cherry-picked from PR #1762 by crazywriter1. Co-authored-by: crazywriter1 <53251494+crazywriter1@users.noreply.github.com>	2026-03-18 16:00:14 -07:00
Teknium	d132e344d7	fix(agent): prevent silent tool result loss during context compression (#1993 ) _align_boundary_backward only checked messages[idx-1] to decide if the compress-end boundary splits a tool_call/result group. When an assistant issues 3+ parallel tool calls, their results span multiple consecutive messages. If the boundary fell in the middle of that group, the parent assistant was summarized away and orphaned tool results were silently deleted by _sanitize_tool_pairs. Now walks backward through all consecutive tool results to find the parent assistant, then pulls the boundary before the entire group. 6 regression tests added in tests/test_compression_boundary.py. Co-authored-by: Guts <Gutslabs@users.noreply.github.com>	2026-03-18 15:22:51 -07:00
Teknium	22f41daded	fix: send error details to user in gateway outer exception handler Previously, if an error occurred during response processing in _process_message_background (e.g. during extract_media, send, or any uncaught exception from the handler), the error was only logged to server console and the user was left with radio silence — typing indicator stops but no message arrives. Now the outer except block attempts to send the error type and detail (truncated to 300 chars) to the user's chat, matching the format already used by the inner handler in gateway/run.py. Co-authored-by: Test <test@test.com>	2026-03-18 10:42:43 -07:00
Teknium	7c7feaa033	Merge pull request #1929 from NousResearch/hermes/hermes-b29f73b2 feat: inject model and provider into system prompt	2026-03-18 04:18:41 -07:00
Teknium	2f80bd9f87	fix: whatsapp reply_prefix config.yaml bridging was dead code (#1923 ) The whatsapp reply_prefix bridging referenced config.platforms before the config object was constructed, making it a silent NameError caught by except Exception: pass. Fix: fold reply_prefix into the per-platform bridging loop (introduced in #1919) which correctly writes to gw_data dict pre-construction. Removes the broken standalone whatsapp bridging block. Co-authored-by: Test <test@test.com>	2026-03-18 04:18:33 -07:00
Teknium	23e5e8dde9	Merge pull request #1928 from NousResearch/hermes/hermes-ba3c8fa1 chore: trim huggingface-hub skill description	2026-03-18 04:18:27 -07:00
Test	e99aca98ab	feat: inject model and provider into system prompt Adds model name and provider to the system prompt metadata block, alongside the existing session ID and timestamp. These are frozen at session start and don't change mid-conversation, so they won't break prompt caching.	2026-03-18 04:18:26 -07:00
Test	7e30e97a59	chore: trim redundant trigger sentence from huggingface-hub description	2026-03-18 04:18:13 -07:00
Teknium	db4dfea7ec	docs: document SOUL.md as primary agent identity (#1927 ) Update all SOUL.md documentation to reflect that it now occupies slot #1 in the system prompt, replacing the hardcoded default identity. Updated pages: - user-guide/features/personality.md — SOUL.md is primary identity, not just a layer - developer-guide/prompt-assembly.md — updated prompt layer order, context files list - guides/use-soul-with-hermes.md — SOUL.md replaces built-in identity - user-guide/configuration.md — updated context files table and directory tree Co-authored-by: Test <test@test.com>	2026-03-18 04:18:08 -07:00
Teknium	17254a7692	Merge pull request #1926 from NousResearch/hermes/hermes-ba3c8fa1 chore: add search to huggingface-hub skill description	2026-03-18 04:15:17 -07:00
Test	adf188c439	chore: add search to huggingface-hub skill description	2026-03-18 04:15:03 -07:00
Teknium	21958a55d1	Merge pull request #1925 from NousResearch/hermes/hermes-ba3c8fa1 chore: tighten huggingface-hub skill description	2026-03-18 04:11:43 -07:00
Test	947827bba0	chore: tighten huggingface-hub skill description	2026-03-18 04:11:33 -07:00
Teknium	e4a3ffa9c1	feat: use SOUL.md as primary agent identity instead of hardcoded default (#1922 ) SOUL.md now loads in slot #1 of the system prompt, replacing the hardcoded DEFAULT_AGENT_IDENTITY. This lets users fully customize the agent's identity and personality by editing ~/.hermes/SOUL.md without it conflicting with the built-in identity text. When SOUL.md is loaded as identity, it's excluded from the context files section to avoid appearing twice. When SOUL.md is missing, empty, unreadable, or skip_context_files is set, the hardcoded DEFAULT_AGENT_IDENTITY is used as a fallback. The default SOUL.md (seeded on first run) already contains the full Hermes personality, so existing installs are unaffected. Co-authored-by: Test <test@test.com>	2026-03-18 04:11:20 -07:00
Teknium	1fa3737134	feat: GitHub Copilot provider integration (#1924 ) feat: GitHub Copilot provider integration with OAuth auth, API routing, and docs	2026-03-18 04:09:30 -07:00
Test	e7844e9c8d	Merge origin/main, resolve conflicts (self._base_url_lower)	2026-03-18 04:09:00 -07:00
Teknium	1c761ae042	feat: add huggingface-hub bundled skill (#1921 ) feat: add huggingface-hub bundled skill	2026-03-18 04:08:00 -07:00
Test	56ca84f243	feat: add huggingface-hub bundled skill Adds the Hugging Face CLI (hf) reference as a built-in skill under mlops/. Covers downloading/uploading models and datasets, repo management, SQL queries on datasets, inference endpoints, Spaces, buckets, and more. Based on the official HF skill from huggingface/skills.	2026-03-18 04:07:41 -07:00
Test	04101bc59e	docs: comprehensive GitHub Copilot provider documentation - Add dedicated GitHub Copilot section in configuration guide with: - Auth options (OAuth device code, env vars, gh CLI) - Token type table (supported vs unsupported) - API routing explanation (GPT-5+ → Responses, others → Chat) - Copilot ACP setup instructions - Environment variable reference - Add all Copilot env vars to environment-variables.md: COPILOT_GITHUB_TOKEN, HERMES_COPILOT_ACP_COMMAND, etc. - Add copilot-acp to --provider list in cli-commands.md - Docs build verified	2026-03-18 04:07:34 -07:00
Teknium	0a247a50f2	feat: support ignoring unauthorized gateway DMs (#1919 ) Add unauthorized_dm_behavior config (pair\|ignore) with global default and per-platform override. WhatsApp can silently drop unknown DMs instead of sending pairing codes. Adapted config bridging to work with gw_data dict (pre-construction) rather than config object. Dropped implementation plan document. Co-authored-by: Frederico Ribeiro <fr@tecompanytea.com>	2026-03-18 04:06:08 -07:00
Teknium	0e2714acea	fix(cron): recover recent one-shot jobs (#1918 ) Co-authored-by: Frederico Ribeiro <fr@tecompanytea.com>	2026-03-18 04:06:02 -07:00
Test	36921a3e98	fix: correct Copilot API mode selection to match opencode The previous copilot_model_api_mode() checked the catalog's supported_endpoints first and picked /chat/completions when a model supported both endpoints. This is wrong — GPT-5+ models should use the Responses API even when the catalog lists both. Replicate opencode's shouldUseCopilotResponsesApi() logic: - GPT-5+ models (gpt-5.4, gpt-5.3-codex, etc.) → Responses API - gpt-5-mini → Chat Completions (explicit exception) - Everything else (gpt-4o, claude, gemini, etc.) → Chat Completions - Model ID pattern is the primary signal, catalog is secondary The catalog fallback now only matters for non-GPT-5 models that might exclusively support /v1/messages (e.g. Claude via Copilot). Models are auto-detected from the live catalog at api.githubcopilot.com/models — no hardcoded list required for supported models, only a static fallback for when the API is unreachable.	2026-03-18 03:54:50 -07:00
Teknium	c1a127c87c	Merge pull request #1917 from NousResearch/hermes/hermes-b29f73b2 feat(cli): add /statusbar command to toggle context bar	2026-03-18 03:50:05 -07:00
Test	c1750bb32d	feat(cli): add /statusbar command to toggle context bar Adds /statusbar (alias /sb) to show/hide the bottom status bar that displays model name, context usage, and session duration. Uses ConditionalContainer so the bar takes zero space when hidden rather than leaving a blank line.	2026-03-18 03:49:49 -07:00
Teknium	4699c226da	chore: reorder OpenRouter model catalog (#1916 ) chore: reorder OpenRouter model catalog	2026-03-18 03:31:19 -07:00
Test	b05f9b6256	chore: reorder OpenRouter catalog — glm-5-turbo under glm-5, minimax under stepfun	2026-03-18 03:31:04 -07:00
Teknium	0679712d26	feat: reorder OpenRouter catalog, add haiku-4.5, fix minimax slug (#1915 ) feat: reorder OpenRouter catalog, add haiku-4.5, fix minimax slug	2026-03-18 03:26:22 -07:00
Test	cb54750e07	feat: reorder OpenRouter catalog, add haiku-4.5, fix minimax slug - Add anthropic/claude-haiku-4.5 - Move gpt-5.4-pro and gpt-5.4-nano to bottom - Fix minimax/minimax-m2.7 → minimax-m2.5 (m2.7 not on OpenRouter) - Tag hunter-alpha and healer-alpha as free - Place hunter/healer-alpha right below gpt-5.4-mini	2026-03-18 03:26:06 -07:00
Test	21c45ba0ac	feat: proper Copilot auth with OAuth device code flow and token validation Builds on PR #1879's Copilot integration with critical auth improvements modeled after opencode's implementation: - Add hermes_cli/copilot_auth.py with: - OAuth device code flow (copilot_device_code_login) using the same client_id (Ov23li8tweQw6odWQebz) as opencode and Copilot CLI - Token type validation: reject classic PATs (ghp_*) with a clear error message explaining supported token types - Proper env var priority: COPILOT_GITHUB_TOKEN > GH_TOKEN > GITHUB_TOKEN (matching Copilot CLI documentation) - copilot_request_headers() with Openai-Intent, x-initiator, and Copilot-Vision-Request headers (matching opencode) - Update auth.py: - PROVIDER_REGISTRY copilot entry uses correct env var order - _resolve_api_key_provider_secret delegates to copilot_auth for the copilot provider with proper token validation - Update models.py: - copilot_default_headers() now includes Openai-Intent and x-initiator - Update main.py: - _model_flow_copilot offers OAuth device code login when no token is found, with manual token entry as fallback - Shows supported vs unsupported token types - 22 new tests covering token validation, env var priority, header generation, and integration with existing auth infrastructure	2026-03-18 03:25:58 -07:00
Teknium	c0c14e60b4	fix: make concurrent tool batching path-aware for file mutations (#1914 ) * Improve tool batching independence checks * fix: address review feedback on path-aware batching - Log malformed/non-dict tool arguments at debug level before falling back to sequential, instead of silently swallowing the error into an empty dict - Guard empty paths in _paths_overlap (unreachable in practice due to upstream filtering, but makes the invariant explicit) - Add tests: malformed JSON args, non-dict args, _paths_overlap unit tests including empty path edge cases - web_crawl is not a registered tool (only web_search/web_extract are); no addition needed to _PARALLEL_SAFE_TOOLS --------- Co-authored-by: kshitij <82637225+kshitijk4poor@users.noreply.github.com>	2026-03-18 03:25:38 -07:00
Teknium	050b43108c	feat: add gpt-5.4-mini, gpt-5.4-nano, healer-alpha to OpenRouter catalog (#1913 ) feat: add gpt-5.4-mini, gpt-5.4-nano, healer-alpha to OpenRouter catalog	2026-03-18 03:23:36 -07:00
Test	00cc0c6a28	feat: add gpt-5.4-mini, gpt-5.4-nano, healer-alpha to OpenRouter catalog	2026-03-18 03:23:20 -07:00
Teknium	bee13d9921	Merge pull request #1912 from NousResearch/hermes/hermes-b29f73b2 fix(banner): normalize toolset labels and use skin colors	2026-03-18 03:23:15 -07:00
Test	f814787144	fix(banner): normalize toolset labels and use skin colors - Strip '_tools' suffix from internal toolset identifiers in the banner (e.g. 'web_tools' -> 'web', 'homeassistant_tools' -> 'homeassistant') - Stop appending '_tools' to unavailable toolset names - Replace 6 hardcoded hex colors (#B8860B, #FFBF00, #FFF8DC) in toolset rows, overflow line, and MCP server rows with the skin variables (dim, accent, text) already resolved at the top of the function Inspired by PR #1871 by @kshitijk4poor. Adds 4 tests.	2026-03-18 03:22:58 -07:00
Teknium	c9bb0c587f	fix: direct user message on STT failure + hermes-agent-setup skill (#1905 ) fix: direct user message on STT failure + hermes-agent-setup skill	2026-03-18 03:21:12 -07:00
Test	8422196e89	Merge PR #1879 : feat: integrate GitHub Copilot providers	2026-03-18 03:18:33 -07:00
Teknium	b70dd51cfa	fix: disabled skills respected across banner, system prompt, slash commands, and skill_view (#1897 ) * fix: banner skill count now respects disabled skills and platform filtering The banner's get_available_skills() was doing a raw rglob scan of ~/.hermes/skills/ without checking: - Whether skills are disabled (skills.disabled config) - Whether skills match the current platform (platforms: frontmatter) This caused the banner to show inflated skill counts (e.g. '100 skills' when many are disabled) and list macOS-only skills on Linux. Fix: delegate to _find_all_skills() from tools/skills_tool which already handles both platform gating and disabled-skill filtering. * fix: system prompt and slash commands now respect disabled skills Two more places where disabled skills were still surfaced: 1. build_skills_system_prompt() in prompt_builder.py — disabled skills appeared in the <available_skills> system prompt section, causing the agent to suggest/load them despite being disabled. 2. scan_skill_commands() in skill_commands.py — disabled skills still registered as /skill-name slash commands in CLI help and could be invoked. Both now load _get_disabled_skill_names() and filter accordingly. * fix: skill_view blocks disabled skills skill_view() checked platform compatibility but not disabled state, so the agent could still load and read disabled skills directly. Now returns a clear error when a disabled skill is requested, telling the user to enable it via hermes skills or inspect the files manually. --------- Co-authored-by: Test <test@test.com>	2026-03-18 03:17:37 -07:00
Test	190c07975d	fix: check skill availability before hinting at hermes-agent-setup Only mention the hermes-agent-setup skill in STT failure notes (both the direct user message and the agent context note) when the skill is actually installed. Uses _find_skill() from skill_manager_tool. Also confirmed: STT is the only user-facing failure case where the setup skill hint helps. Vision failures are transient API issues, runtime transcription errors indicate a configured-but-broken provider, and platform startup warnings are server logs.	2026-03-18 03:17:23 -07:00
Teknium	011ed540dd	Merge pull request #1909 from NousResearch/hermes/hermes-b29f73b2 docs: fix MCP install commands — use uv, not bare pip	2026-03-18 03:15:15 -07:00
Test	a9c405fac9	docs: fix MCP install commands — use uv, not bare pip The standard install already includes MCP via .[all]. For users who need to add it separately, the correct command is: cd ~/.hermes/hermes-agent && uv pip install -e ".[mcp]" The venv is created by uv, so bare 'pip' isn't available. All four occurrences across 3 docs pages updated.	2026-03-18 03:14:58 -07:00
Teknium	9c174e0940	Merge pull request #1908 from NousResearch/hermes/hermes-b29f73b2 fix(gateway): detect script-style gateway processes for --replace	2026-03-18 03:13:21 -07:00
TheSameCat2	5c4c4b8b7d	fix(gateway): detect script-style gateway processes for --replace Recognize hermes_cli/main.py gateway command lines in gateway process detection and PID validation so --replace reliably finds existing gateway instances. Adds a regression test covering script-style cmdline detection. Closes #1830	2026-03-18 03:12:59 -07:00

1 2 3 4 5 ...

2304 Commits