hermes-agent

Author	SHA1	Message	Date
Test	9c0f346258	fix: direct user message on STT failure + hermes-agent-setup skill When a user sends a voice message and STT isn't configured, the gateway now sends a clear message directly to the user explaining how to set up voice transcription, rather than relying on the agent to relay an injected context note (which often gets misinterpreted). Also adds a hermes-agent-setup bundled skill covering STT/TTS setup, tool configuration, dependency installation, and troubleshooting.	2026-03-18 03:01:41 -07:00
Teknium	11f029c311	fix(tts): document NeuTTS provider and align install guidance (#1903 ) Co-authored-by: charles-édouard <59705750+ccbbccbb@users.noreply.github.com>	2026-03-18 02:55:30 -07:00
Teknium	fb923d5efc	Merge pull request #1902 from NousResearch/hermes/hermes-b29f73b2 fix(gateway): PID-based wait with force-kill for gateway restart	2026-03-18 02:54:38 -07:00
Test	ace2cc6257	fix(gateway): PID-based wait with force-kill for gateway restart Add _wait_for_gateway_exit() that polls get_running_pid() to confirm the old gateway process has actually exited before starting a new one. If the process doesn't exit within 5s, sends SIGKILL to the specific PID. Uses the saved PID from gateway.pid (not launchd labels) so it works correctly with multiple gateway instances under separate HERMES_HOME directories. Applied to both launchd_restart() and the manual restart path (replaces the blind time.sleep(2)). Inspired by PR #1881 by @AzothZephyr (race condition diagnosis). Adds 4 tests.	2026-03-18 02:54:18 -07:00
Teknium	24ac577046	fix: respect model.default from config.yaml for openai-codex provider (#1896 ) When config.yaml had a non-default model (e.g. gpt-5.3-codex) and the provider was openai-codex, _normalize_model_for_provider() would replace it with the latest available codex model because _model_is_default only checked the CLI argument, not the config value. Now _model_is_default is False when config.yaml has a model that differs from the global fallback (anthropic/claude-opus-4.6), so the user's explicit config choice is preserved. Fixes #1887 Co-authored-by: Test <test@test.com>	2026-03-18 02:50:31 -07:00
Teknium	e86bfd7667	feat: upgrade MiniMax default to M2.7 + add new OpenRouter models (#1900 ) feat: upgrade MiniMax default to M2.7 + add new OpenRouter models	2026-03-18 02:43:19 -07:00
octo-patch	e4043633fc	feat: upgrade MiniMax default to M2.7 + add new OpenRouter models MiniMax: Add M2.7 and M2.7-highspeed as new defaults across provider model lists, auxiliary client, metadata, setup wizard, RL training tool, fallback tests, and docs. Retain M2.5/M2.1 as alternatives. OpenRouter: Add grok-4.20-beta, nemotron-3-super-120b-a12b:free, trinity-large-preview:free, glm-5-turbo, and hunter-alpha to the model catalog. MiniMax changes based on PR #1882 by @octo-patch (applied manually due to stale conflicts in refactored pricing module).	2026-03-18 02:42:58 -07:00
Teknium	927f4d3a37	fix(matrix): use correct reply_to_message_id parameter name (#1895 ) fix(matrix): use correct reply_to_message_id parameter name	2026-03-18 02:23:38 -07:00
Bartok9	66f71c1836	fix(matrix): use correct reply_to_message_id parameter name Fixes #1842 The MessageEvent dataclass expects 'reply_to_message_id' but the Matrix connector was passing 'reply_to'. This caused replies to fail with: MessageEvent.__init__() got an unexpected keyword argument 'reply_to' Changed the parameter name to match the dataclass definition.	2026-03-18 02:23:21 -07:00
Teknium	b1069196a6	Merge pull request #1894 from NousResearch/hermes/hermes-b29f73b2 fix(delegate): move _saved_tool_names save/restore to _run_single_child scope	2026-03-18 02:23:14 -07:00
Bartok9	ba7248c669	fix(delegate): move _saved_tool_names save/restore to _run_single_child scope Fixes #1802 The v0.3.0 refactor split child agent construction (_build_child_agent) and execution (_run_single_child) into separate functions. This created a scope bug where _saved_tool_names was defined in _build_child_agent but referenced in _run_single_child's finally block, causing a NameError on every delegate_task call. Solution: Move the save/restore logic entirely into _run_single_child, keeping the save and restore in the same scope as the try/finally block. This is cleaner than passing the variable through and removes the dead save from _build_child_agent.	2026-03-18 02:22:46 -07:00
Teknium	6fc4e36625	fix: search all sources by default in session_search (#1892 ) * fix: include ACP sessions in default search sources * fix: remove hardcoded source allowlist from session search The default source_filter was a hardcoded list that silently excluded any platform not explicitly listed. Instead of maintaining an ever-growing allowlist, remove it entirely so all sources are searched by default. Callers can still pass source_filter explicitly to narrow results. Follow-up to cherry-picked PR #1817. --------- Co-authored-by: someoneexistsontheinternet <154079416+someoneexistsontheinternet@users.noreply.github.com> Co-authored-by: Test <test@test.com>	2026-03-18 02:21:29 -07:00
Teknium	7d7c2a62dd	Merge pull request #1890 from NousResearch/hermes/hermes-b29f73b2 fix: OAuth flag stale after refresh/fallback, memory nudge never fires, dead code	2026-03-18 02:20:19 -07:00
Test	5b74df2bfc	fix: OAuth flag stale after refresh/fallback, memory nudge never fires, dead code - Update _is_anthropic_oauth in _try_refresh_anthropic_client_credentials() when token type changes during credential refresh - Set _is_anthropic_oauth in _try_activate_fallback() Anthropic path - Move _turns_since_memory and _iters_since_skill init to __init__ so nudge counters accumulate across run_conversation() calls in CLI mode - Remove unreachable retry_count >= max_retries block after raise Adds 7 regression tests. Salvaged from PR #1797 by @0xbyt4.	2026-03-18 02:19:57 -07:00
Teknium	f656dfcb32	Merge pull request #1840 from NousResearch/hermes/hermes-b29f73b2 fix: allow agent-created skills with caution-level findings	2026-03-17 16:33:04 -07:00
Test	0fab46f65c	fix: allow agent-created skills with caution-level findings Agent-created skills were using the same policy as community hub installs, blocking any skill with medium/high severity findings (e.g. docker pull, pip install, git clone). This meant the agent couldn't create skills that reference Docker or other common tools. Changed agent-created policy from (allow, block, block) to (allow, allow, block) — matching the trusted policy. Caution-level findings (medium/high severity) are now allowed through, while dangerous findings (critical severity like exfiltration, prompt injection, reverse shells) remain blocked. Added 4 tests covering the agent-created policy: safe allowed, caution allowed, dangerous blocked, force override.	2026-03-17 16:32:25 -07:00
Teknium	37dceb043e	fix: improve gateway error handling for 429 usage limits and 500 context overflow (#1839 ) fix: improve gateway error handling for 429 usage limits and 500 context overflow	2026-03-17 16:32:20 -07:00
silentconsensus	7ce374d3b9	Improve gateway error handling for 429 usage limits and 500 context overflow - Distinguish plan usage limits (429 with usage_limit_reached) from transient rate limits - Show approximate reset time in hours for plan limits - Treat HTTP 500 with large sessions as context overflow (same as 400) - Move history length check earlier for reuse across status codes	2026-03-17 16:32:01 -07:00
Teknium	6e4415e865	Merge pull request #1838 from NousResearch/hermes/hermes-b29f73b2 fix(context_compressor): replace print() calls with logger	2026-03-17 16:31:32 -07:00
Test	45bad9771d	fix(context_compressor): replace print() calls with logger Replaces all remaining print() calls in compress() with logger.info() and logger.warning() for consistency with the rest of the module. Inspired by PR #1822.	2026-03-17 16:31:01 -07:00
Teknium	8d60db0f6f	fix(discord): remove bugged followup messages + remove /ask command (#1836 ) fix(discord): remove bugged followup messages + remove /ask command	2026-03-17 16:28:36 -07:00
Test	1bee519a6f	fix(discord): remove redundant /ask slash command /ask was just 'send a message to the bot' via the slash command menu — completely redundant since Discord bots already listen to channel messages. Removed as part of salvaging PR #1827.	2026-03-17 16:25:09 -07:00
charliekerfoot	72bfa115a0	fix(discord): removebugged follow up messages from discord slash commands	2026-03-17 16:24:17 -07:00
Teknium	7f85b2914d	Merge pull request #1824 from cutepawss/fix/search-files-pagination Clean fix — adds pagination args to search_key for parity with read_file. Thanks @cutepawss!	2026-03-17 16:16:47 -07:00
Teknium	b8076bb0bd	feat: cron agents can suppress delivery with [SILENT] response (#1833 ) feat: cron agents can suppress delivery with [SILENT] response	2026-03-17 16:09:24 -07:00
Test	d35d923c76	feat: cron agents can suppress delivery with [SILENT] response Every cron job prompt now includes guidance that the agent can respond with [SILENT] when it has nothing new or noteworthy to report. The scheduler checks for this marker and skips delivery, while still saving output to disk for audit. Failed jobs always deliver regardless. This replaces the notify parameter approach from PR #1807 with a simpler always-on design — the model is smart enough to decide when there's nothing worth reporting without needing a per-job flag.	2026-03-17 16:06:49 -07:00
darya	a654bc04f7	fix(file_tools): include pagination args in repeated search key	2026-03-18 01:19:05 +03:00
Test	a71e3f4d98	fix: add /browser to COMMAND_REGISTRY so it shows in help and autocomplete The /browser command handler existed in cli.py but was never added to COMMAND_REGISTRY after the centralized command registry refactor. This meant: - /browser didn't appear in /help - No tab-completion or subcommand suggestions - Dispatch used _base_word fallback instead of canonical resolution Added CommandDef with connect/disconnect/status subcommands and switched dispatch to use canonical instead of _base_word.	2026-03-17 13:29:36 -07:00
Teknium	588962d24e	docs: escape {id} in api-server.md headings to fix MDX build (#1787 ) MDX v2+ interprets curly braces in regular markdown as JSX expressions. The headings 'GET /v1/responses/{id}' and 'DELETE /v1/responses/{id}' caused a ReferenceError during Docusaurus static site generation because 'id' is not a defined JavaScript variable. Escaped with backslashes. Co-authored-by: Test <test@test.com>	2026-03-17 11:04:37 -07:00
Teknium	2fa33dde81	fix: handle message length overflow in streaming mode (#1783 ) Stream consumer now splits messages that exceed the platform's MAX_MESSAGE_LENGTH. When accumulated text grows past the safe limit, the current message is finalized and a new message is started for the overflow — same as how normal sends chunk long responses. Split point prefers line boundaries (rfind newline) for clean breaks. Works for all platforms (Telegram 4096, Discord 2000, etc.) by reading the adapter's MAX_MESSAGE_LENGTH at runtime. Also added a safety net in the Telegram adapter: if edit_message_text still hits MESSAGE_TOO_LONG (e.g. markdown formatting expansion), it truncates and returns success so the stream consumer doesn't die. Co-authored-by: Test <test@test.com>	2026-03-17 11:00:52 -07:00
Teknium	7ac9088d5c	fix: Telegram streaming — config bridge, not-modified, flood control (#1782 ) * fix: NameError in OpenCode provider setup (prompt_text -> prompt) The OpenCode Zen and OpenCode Go setup sections used prompt_text() which is undefined. All other providers correctly use the local prompt() function defined in setup.py. Fixes crash during 'hermes setup' when selecting either OpenCode provider. * fix: Telegram streaming — config bridge, not-modified, flood control Three fixes for gateway streaming: 1. Bridge streaming config from config.yaml into gateway runtime. load_gateway_config() now reads the 'streaming' key from config.yaml (same pattern as session_reset, stt, etc.), matching the docs. Previously only gateway.json was read. 2. Handle 'Message is not modified' in Telegram edit_message(). This Telegram API error fires when editing with identical content — a no-op, not a real failure. Previously it returned success=False which made the stream consumer disable streaming entirely. 3. Handle RetryAfter / flood control in Telegram edit_message(). Fast providers can hit Telegram rate limits during streaming. Now waits the requested retry_after duration and retries once, instead of treating it as a fatal edit failure. Also fixed double-edit on stream finish: the consumer now tracks last-sent text and skips redundant edits, preventing the not-modified error at the source. * refactor: make config.yaml the primary gateway config source Eliminates the per-key bridge pattern in load_gateway_config(). Previously gateway.json was the primary source and each config.yaml key needed an individual bridge — easy to forget (streaming was missing, causing garl4546's bug). Now config.yaml is read first and its keys are mapped directly into the GatewayConfig.from_dict() schema. gateway.json is kept as a legacy fallback layer (loaded first, then overwritten by config.yaml keys). If gateway.json exists, a log message suggests migrating. Also: - Removed dead save_gateway_config() (never called anywhere) - Updated CLI help text and send_message error to reference config.yaml instead of gateway.json --------- Co-authored-by: Test <test@test.com>	2026-03-17 10:51:54 -07:00
Teknium	dd60bcbfb7	feat: OpenAI-compatible API server + WhatsApp configurable reply prefix (#1756 ) * feat: OpenAI-compatible API server platform adapter Salvaged from PR #956, updated for current main. Adds an HTTP API server as a gateway platform adapter that exposes hermes-agent via the OpenAI Chat Completions and Responses APIs. Any OpenAI-compatible frontend (Open WebUI, LobeChat, LibreChat, AnythingLLM, NextChat, ChatBox, etc.) can connect by pointing at http://localhost:8642/v1. Endpoints: - POST /v1/chat/completions — stateless Chat Completions API - POST /v1/responses — stateful Responses API with chaining - GET /v1/responses/{id} — retrieve stored response - DELETE /v1/responses/{id} — delete stored response - GET /v1/models — list hermes-agent as available model - GET /health — health check Features: - Real SSE streaming via stream_delta_callback (uses main's streaming) - In-memory LRU response store for Responses API conversation chaining - Named conversations via 'conversation' parameter - Bearer token auth (optional, via API_SERVER_KEY) - CORS support for browser-based frontends - System prompt layering (frontend system messages on top of core) - Real token usage tracking in responses Integration points: - Platform.API_SERVER in gateway/config.py - _create_adapter() branch in gateway/run.py - API_SERVER_* env vars in hermes_cli/config.py - Env var overrides in gateway/config.py _apply_env_overrides() Changes vs original PR #956: - Removed streaming infrastructure (already on main via stream_consumer.py) - Removed Telegram reply_to_mode (separate feature, not included) - Updated _resolve_model() -> _resolve_gateway_model() - Updated stream_callback -> stream_delta_callback - Updated connect()/disconnect() to use _mark_connected()/_mark_disconnected() - Adapted to current Platform enum (includes MATTERMOST, MATRIX, DINGTALK) Tests: 72 new tests, all passing Docs: API server guide, Open WebUI integration guide, env var reference * feat(whatsapp): make reply prefix configurable via config.yaml Reworked from PR #1764 (ifrederico) to use config.yaml instead of .env. The WhatsApp bridge prepends a header to every outgoing message. This was hardcoded to '⚕ Hermes Agent'. Users can now customize or disable it via config.yaml: whatsapp: reply_prefix: '' # disable header reply_prefix: '🤖 My Bot\n───\n' # custom prefix How it works: - load_gateway_config() reads whatsapp.reply_prefix from config.yaml and stores it in PlatformConfig.extra['reply_prefix'] - WhatsAppAdapter reads it from config.extra at init - When spawning bridge.js, the adapter passes it as WHATSAPP_REPLY_PREFIX in the subprocess environment - bridge.js handles undefined (default), empty (no header), or custom values with \\n escape support - Self-chat echo suppression uses the configured prefix Also fixes _config_version: was 9 but ENV_VARS_BY_VERSION had a key 10 (TAVILY_API_KEY), so existing users at v9 would never be prompted for Tavily. Bumped to 10 to close the gap. Added a regression test to prevent this from happening again. Credit: ifrederico (PR #1764) for the bridge.js implementation and the config version gap discovery. --------- Co-authored-by: Test <test@test.com>	2026-03-17 10:44:37 -07:00
Teknium	b5cf0f0aef	fix: preserve parent agent's tool list after subagent delegation (#1778 ) Save and restore the process-global _last_resolved_tool_names in _run_single_child() so the parent's execute_code sandbox generates correct tool imports after delegation completes. The global was already mostly mitigated (run_agent.py passes enabled_tools via self.valid_tool_names), but the global itself remained corrupted — a footgun for any code that reads it directly. Co-authored-by: shane9coy <shane9coy@users.noreply.github.com>	2026-03-17 10:31:38 -07:00
Teknium	9a1e971126	fix(stt): respect explicit provider config instead of env-var fallback (#1775 ) * fix(session): skip corrupt lines in load_transcript instead of crashing Wrap json.loads() in load_transcript() with try/except JSONDecodeError so that partial JSONL lines (from mid-write crashes like OOM/SIGKILL) are skipped with a warning instead of crashing the entire transcript load. The rest of the history loads fine. Adds a logger.warning with the session ID and truncated corrupt line content for debugging visibility. Salvaged from PR #1193 by alireza78a. Closes #1193 * fix(stt): respect explicit provider config instead of env-var fallback Rework _get_provider() to separate explicit config from auto-detect. When stt.provider is explicitly set in config.yaml, that choice is authoritative — no silent cross-provider fallback based on which env vars happen to be set. When no provider is configured, auto-detect still tries: local > groq > openai. This fixes the reported scenario where provider: local + a placeholder OPENAI_API_KEY caused the system to silently select OpenAI and fail with a 401. Closes #1774	2026-03-17 10:30:58 -07:00
Teknium	088d65605a	fix: NameError in OpenCode provider setup (prompt_text -> prompt) (#1779 ) The OpenCode Zen and OpenCode Go setup sections used prompt_text() which is undefined. All other providers correctly use the local prompt() function defined in setup.py. Fixes crash during 'hermes setup' when selecting either OpenCode provider.	2026-03-17 10:30:16 -07:00
teknium1	c881209b92	Revert "feat(cli): skin-aware light/dark theme mode with terminal auto-detection" This reverts commit `a1c81360a5`.	2026-03-17 10:04:53 -07:00
Teknium	d7a2e3ddae	fix: handle hyphenated FTS5 queries and preserve quoted literals (#1776 ) _sanitize_fts5_query() was stripping ALL double quotes (including properly paired ones), breaking user-provided quoted phrases like "exact phrase". Hyphenated terms like chat-send also silently expanded to chat AND send, returning unexpected or zero results. Fix: 1. Extract balanced quoted phrases into placeholders before stripping FTS5-special characters, then restore them. 2. Wrap unquoted hyphenated terms (word-word) in double quotes so FTS5 matches them as exact phrases instead of splitting on the hyphen. 3. Unmatched quotes are still stripped as before. Based on issue report by @bailob (#1770) and PR #1773 by @Jah-yee (whose branch contained unrelated changes and couldn't be merged directly). Closes #1770 Closes #1773 Co-authored-by: Jah-yee <Jah-yee@users.noreply.github.com>	2026-03-17 09:44:01 -07:00
Teknium	d5af593769	Merge pull request #1769 from sai-samarth/fix/whatsapp-send-message-support Clean merge — PR is current against main, tests pass, implementation matches existing gateway WhatsApp bridge pattern.	2026-03-17 09:42:01 -07:00
Teknium	df74f86955	Merge pull request #1767 from sai-samarth/fix/systemd-node-path-whatsapp Clean fix for nvm/non-standard Node.js paths in systemd units. Merges cleanly.	2026-03-17 09:41:39 -07:00
sai-samarth	a3de843fdb	test: replace real-looking WhatsApp jid in regression test	2026-03-17 15:38:37 +00:00
sai-samarth	dc15bc508f	fix(tools): add outbound WhatsApp send_message routing	2026-03-17 15:31:13 +00:00
sai-samarth	b8eb7c5fed	fix(gateway): include resolved node path in systemd unit	2026-03-17 15:11:28 +00:00
Teknium	548cedb869	fix(context_compressor): prevent consecutive same-role messages after compression (#1743 ) compress() checks both the head and tail neighbors when choosing the summary message role. When only the tail collides, the role is flipped. When BOTH roles would create consecutive same-role messages (e.g. head=assistant, tail=user), the summary is merged into the first tail message instead of inserting a standalone message that breaks role alternation and causes API 400 errors. The previous code handled head-side collision but left the tail-side uncovered — long conversations would crash mid-reply with no useful error, forcing the user to /reset and lose session history. Based on PR #1186 by @alireza78a, with improved double-collision handling (merge into tail instead of unconditional 'user' fallback). Co-authored-by: alireza78a <alireza78.crypto@gmail.com>	2026-03-17 05:18:52 -07:00
Teknium	702191049f	fix(session): skip corrupt lines in load_transcript instead of crashing (#1744 ) Wrap json.loads() in load_transcript() with try/except JSONDecodeError so that partial JSONL lines (from mid-write crashes like OOM/SIGKILL) are skipped with a warning instead of crashing the entire transcript load. The rest of the history loads fine. Adds a logger.warning with the session ID and truncated corrupt line content for debugging visibility. Salvaged from PR #1193 by alireza78a. Closes #1193	2026-03-17 05:18:12 -07:00
Teknium	aea39eeafb	Merge pull request #1736 from NousResearch/fix/gateway-platform-hardening fix(gateway): SMS session-per-send + Matrix bare media types break downstream processing	2026-03-17 04:46:25 -07:00
Teknium	23a3f01b2b	Merge pull request #1735 from NousResearch/fix/tool-handler-safety fix(tools): browser handlers TypeError on unexpected LLM params + fuzzy_match docstring	2026-03-17 04:46:22 -07:00
Teknium	af118501b9	Merge pull request #1733 from NousResearch/fix/defensive-hardening fix: defensive hardening — logging, dedup, locks, dead code	2026-03-17 04:46:20 -07:00
Teknium	d1d17f4f0a	feat(compression): add summary_base_url + move compression config to YAML-only - Add summary_base_url config option to compression block for custom OpenAI-compatible endpoints (e.g. zai, DeepSeek, Ollama) - Remove compression env var bridges from cli.py and gateway/run.py (CONTEXT_COMPRESSION_* env vars no longer set from config) - Switch run_agent.py to read compression config directly from config.yaml instead of env vars - Fix backwards-compat block in _resolve_task_provider_model to also fire when auxiliary.compression.provider is 'auto' (DEFAULT_CONFIG sets this, which was silently preventing the compression section's summary_* keys from being read) - Add test for summary_base_url config-to-client flow - Update docs to show compression as config.yaml-only Closes #1591 Based on PR #1702 by @uzaylisak	2026-03-17 04:46:15 -07:00
teknium1	6832d60bc0	fix(gateway): SMS persistent HTTP session + Matrix MIME media types 1. sms.py: Replace per-send aiohttp.ClientSession with a persistent session created in connect() and closed in disconnect(). Each outbound SMS no longer pays the TCP+TLS handshake cost. Falls back to a temporary session if the persistent one isn't available. 2. matrix.py: Use proper MIME types (image/png, audio/ogg, video/mp4) instead of bare category words (image, audio, video). The gateway's media processing checks startswith('image/') and startswith('audio/') so bare words caused Matrix images to skip vision enrichment and Matrix audio to skip transcription. Now extracts the actual MIME type from the nio event's content info when available.	2026-03-17 04:35:14 -07:00
teknium1	ea95462998	fix(tools): browser handler safety + fuzzy_match docstring accuracy 1. browser_tool.py: Replace args spread on browser_click, browser_type, and browser_scroll handlers with explicit parameter extraction. The args pattern passed all dict keys as keyword arguments, causing TypeError if the LLM sent unexpected parameters. Now extracts only the expected params (ref, text, direction) with safe defaults. 2. fuzzy_match.py: Update module docstring to match actual strategy order in code. Block anchor was listed as #3 but is actually #7. Multi-occurrence is not a separate strategy but a flag. Updated count from 9 to 8.	2026-03-17 04:32:39 -07:00

1 2 3 4 5 ...

2248 Commits