hermes-agent

Author	SHA1	Message	Date
teknium1	a9fdd8dc3c	Merge PR #752 : feat(ux): improve /help formatting with command categories Authored by Bartok9. Organizes /help output into categories (Session, Configuration, Tools & Skills, Info, Exit) for better readability. Fixes #640.	2026-03-10 23:45:41 -07:00
Bartok Moltbot	8eb9eed074	feat(ux): improve /help formatting with command categories (#640 ) - Organize COMMANDS into COMMANDS_BY_CATEGORY dict - Group commands: Session, Configuration, Tools & Skills, Info, Exit - Add visual category headers with spacing - Maintain backwards compat via flat COMMANDS dict - Better visual hierarchy and scannability Before: /help - Show this help message /tools - List available tools ... (dense list) After: ── Session ── /new Start a new conversation /reset Reset conversation only ... ── Configuration ── /config Show current configuration ... Closes #640	2026-03-10 23:45:36 -07:00
teknium1	909e048ad4	fix: integration hardening for gateway token tracking Follow-up to `58dbd81` — ensures smooth transition for existing users: - Backward compat: old session files without last_prompt_tokens default to 0 via data.get('last_prompt_tokens', 0) - /compress, /undo, /retry: reset last_prompt_tokens to 0 after rewriting transcripts (stale token counts would under-report) - Auto-compression hygiene: reset last_prompt_tokens after rewriting - update_session: use None sentinel (not 0) as default so callers can explicitly reset to 0 while normal calls don't clobber - 6 new tests covering: default value, serialization roundtrip, old-format migration, set/reset/no-change semantics - /reset: new SessionEntry naturally gets last_prompt_tokens=0 2942 tests pass.	2026-03-10 23:40:24 -07:00
teyrebaz33	5eb62ef423	test(gateway): add regression test for /retry response fix Adds two tests for _handle_retry_command: verifies /retry returns the agent response (not None), and verifies graceful handling when no previous message exists. Cherry-picked from PR #731 by teyrebaz33. Regression coverage for the fix merged in PR #441. Co-authored-by: teyrebaz33 <teyrebaz33@users.noreply.github.com>	2026-03-10 23:34:52 -07:00
teknium1	58dbd81f03	fix: use actual API token counts for gateway compression pre-check Root cause of aggressive gateway compression vs CLI: - CLI: single AIAgent persists across conversation, uses real API-reported prompt_tokens for compression decisions — accurate - Gateway: each message creates fresh AIAgent, token count discarded after, next message pre-check falls back to rough str(msg)//4 estimate which overestimates 30-50% on tool-heavy conversations Fix: - Add last_prompt_tokens field to SessionEntry — stores the actual API-reported prompt token count from the most recent agent turn - After run_conversation(), extract context_compressor.last_prompt_tokens and persist it via update_session() - Gateway pre-check now uses stored actual tokens when available (exact same accuracy as CLI), falling back to rough estimate with 1.4x safety factor only for the first message of a session This makes gateway compression behave identically to CLI compression for all turns after the first. Reported by TigerHix.	2026-03-10 23:28:23 -07:00
Teknium	a35c37a2f9	Merge pull request #891 from NousResearch/hermes/hermes-b0162f8d fix: sort Nous Portal model list (opus first, sonnet lower)	2026-03-10 23:21:01 -07:00
teknium1	1518734e59	fix: sort Nous Portal model list (opus first, sonnet lower) fetch_nous_models() returned models in whatever order the API gave them, which put sonnet near the top. Add a priority sort so users see the best models first: opus > pro > other > sonnet.	2026-03-10 23:20:46 -07:00
teknium1	67b9470207	fix: reduce premature gateway compression on tool-heavy sessions The gateway's session hygiene pre-check uses a rough char-based token estimate (total_chars / 4) to decide whether to compress before the agent starts. This significantly overestimates for tool-heavy and code-heavy conversations because: 1. str(msg) on dicts includes Python repr overhead (keys, brackets, etc.) 2. Code/JSON tokenizes at 5-7+ chars/token, not the assumed 4 This caused users with 200k context to see compression trigger at ~100-113k actual tokens instead of the expected 170k (85% threshold). Reported by TigerHix on Twitter. Fix: apply a 1.4x safety factor to the gateway pre-check threshold. This pre-check is only meant to catch pathologically large transcripts — the agent's own compression uses actual API-reported token counts for precise threshold management.	2026-03-10 23:16:49 -07:00
teknium1	586fe5d62d	Merge PR #724 : feat: --yolo flag to bypass all approval prompts Authored by dmahan93. Adds HERMES_YOLO_MODE env var and --yolo CLI flag to auto-approve all dangerous command prompts. Post-merge: renamed --fuck-it-ship-it to --yolo for brevity, resolved conflict with --checkpoints flag.	2026-03-10 20:56:30 -07:00
teknium1	2d80ef7872	fix: _init_agent returns bool, not agent — fix quiet mode crash	2026-03-10 20:49:03 -07:00
Teknium	b76cae94d4	Merge pull request #889 from NousResearch/hermes/hermes-b0162f8d fix: Docker backend fails when docker is not in PATH (macOS gateway)	2026-03-10 20:45:34 -07:00
teknium1	23270d41b9	feat: add --quiet/-Q flag for programmatic single-query mode Adds -Q/--quiet to `hermes chat` for use by external orchestrators (Paperclip, scripts, CI). When combined with -q, suppresses: - Banner and ASCII art - Spinner animations - Tool preview lines (┊ prefix) Only outputs: - The agent's final response text - A parseable 'session_id: <id>' line for session resumption Usage: hermes chat -q 'Do something' -Q Used by: Paperclip adapter (@nousresearch/paperclip-adapter-hermes)	2026-03-10 20:45:28 -07:00
teknium1	24479625a2	fix: Docker backend fails when docker is not in PATH (macOS gateway) On macOS, Docker Desktop installs the CLI to /usr/local/bin/docker, but when Hermes runs as a gateway service (launchd) or in other non-login contexts, /usr/local/bin is often not in PATH. This causes the Docker requirements check to fail with 'No such file or directory: docker' even though docker works fine from the user's terminal. Add find_docker() helper that uses shutil.which() first, then probes common Docker Desktop install paths on macOS (/usr/local/bin, /opt/homebrew/bin, Docker.app bundle). The resolved path is cached and passed to mini-swe-agent via its 'executable' parameter. - tools/environments/docker.py: add find_docker(), use it in _storage_opt_supported() and pass to _Docker(executable=...) - tools/terminal_tool.py: use find_docker() in requirements check - tests/tools/test_docker_find.py: 4 tests (PATH, fallback, not found, cache) 2877 tests pass.	2026-03-10 20:45:13 -07:00
vilkasdev	d502952bac	fix(cli): add loading indicators for slow slash commands Shows an immediate status message and braille spinner for slow slash commands (/skills search\|browse\|inspect\|install, /reload-mcp). Makes input read-only while the command runs so the CLI doesn't appear frozen. Cherry-picked from PR #714 by vilkasdev, rebased onto current main with conflict resolution and bug fix (get_hint_text duplicate return). Fixes #636 Co-authored-by: vilkasdev <vilkasdev@users.noreply.github.com>	2026-03-10 17:31:00 -07:00
Teknium	ac53bf1d71	Merge pull request #881 from NousResearch/hermes/hermes-b0162f8d fix: provider selection not persisting when switching via hermes model	2026-03-10 17:13:26 -07:00
teknium1	145c57fc01	fix: provider selection not persisting when switching via hermes model Two related bugs prevented users from reliably switching providers: 1. OPENAI_BASE_URL poisoning OpenRouter resolution: When a user with a custom endpoint ran /model openrouter:model, _resolve_openrouter_runtime picked up OPENAI_BASE_URL instead of the OpenRouter URL, causing model validation to probe the wrong API and reject valid models. Fix: skip OPENAI_BASE_URL when requested_provider is explicitly 'openrouter'. 2. Provider never saved to config: _save_model_choice() could save config.model as a plain string. All five _model_flow_* functions then checked isinstance(model, dict) before writing the provider — which silently failed on strings. With no provider in config, auto-detection would pick up stale credentials (e.g. Codex desktop app) instead of the user's explicit choice. Fix: _save_model_choice() now always saves as dict format. All flow functions also normalize string->dict as a safety net before writing provider. Adds 4 regression tests. 2873 tests pass.	2026-03-10 17:12:34 -07:00
teknium1	2dddfce08c	fix: log prefill parse errors + clean up cron scheduler tests Follow-up to PR #716 (0xbyt4): - Log the third remaining silent except-pass in scheduler (prefill messages JSON parse failure) - Fix test mock: run → run_conversation (matches actual agent API) - Remove unused imports (asyncio, AsyncMock) - Add test for prefill_messages parse failure logging	2026-03-10 17:10:01 -07:00
teknium1	03a4f184e6	fix: call _stop_training_run on early-return failure paths The 4 early-return paths in _spawn_training_run (API exit, trainer exit, env not found, env exit) were doing manual process.terminate() or returning without cleanup, leaking open log file handles. Now all paths call _stop_training_run() which handles both process termination and file handle closure. Also adds 12 tests for _stop_training_run covering file handle cleanup, process termination, status transitions, and edge cases. Inspired by PR #715 (0xbyt4) which identified the early-return issue. Core file handle fix was already on main via `e28dc13` (memosr.eth).	2026-03-10 17:09:51 -07:00
teknium1	be2e259596	Merge PR #716 : fix: log exceptions instead of silently swallowing in cron scheduler Authored by 0xbyt4. Replaces two except-Exception-pass blocks with logger.warning() calls and adds tests for both paths.	2026-03-10 17:05:59 -07:00
teknium1	05bc8b19fe	Merge PR #713 : docs: clarify Telegram token regex constraint Authored by VolodymyrBg.	2026-03-10 16:59:54 -07:00
teknium1	cb6b70bbfb	Merge PR #709 : fix: close log file handles to prevent resource leaks Authored by memosr. Fixes bare open() calls in browser_tool.py and unclosed log file handles in rl_training_tool.py.	2026-03-10 16:26:29 -07:00
teknium1	a458b535c9	fix: improve read-loop detection — consecutive-only, correct thresholds, fix bugs Follow-up to PR #705 (merged from 0xbyt4). Addresses several issues: 1. CONSECUTIVE-ONLY TRACKING: Redesigned the read/search tracker to only warn/block on truly consecutive identical calls. Any other tool call in between (write, patch, terminal, etc.) resets the counter via notify_other_tool_call(), called from handle_function_call() in model_tools.py. This prevents false blocks in read→edit→verify flows. 2. THRESHOLD ADJUSTMENT: Warn on 3rd consecutive (was 2nd), block on 4th+ consecutive (was 3rd+). Gives the model more room before intervening. 3. TUPLE UNPACKING BUG: Fixed get_read_files_summary() which crashed on search keys (5-tuple) when trying to unpack as 3-tuple. Now uses a separate read_history set that only tracks file reads. 4. WEB_EXTRACT DOCSTRING: Reverted incorrect removal of 'title' from web_extract return docs in code_execution_tool.py — the field IS returned by web_tools.py. 5. TESTS: Rewrote test_read_loop_detection.py (35 tests) to cover consecutive-only behavior, notify_other_tool_call, interleaved read/search, and summary-unaffected-by-searches.	2026-03-10 16:25:41 -07:00
teknium1	b53d5dad67	Merge PR #705 : fix: detect, warn, and block file re-read/search loops after context compression Authored by 0xbyt4. Adds read/search loop detection, file history injection after compression, and todo filtering for active items only.	2026-03-10 16:17:03 -07:00
teknium1	ad7a16dca6	fix: remove left/right borders from response box for easier copy-paste Use rich_box.HORIZONTALS instead of the default ROUNDED box style for the agent response panel. This keeps the top/bottom horizontal rules (with title) but removes the vertical │ borders on left and right, making it much easier to copy-paste response text from the terminal.	2026-03-10 15:59:08 -07:00
teknium1	6e851a1f6a	Merge PR #873 : fix: eliminate 3x SQLite message duplication in gateway sessions Fixes #860.	2026-03-10 15:29:24 -07:00
teknium1	c1171fe666	fix: eliminate 3x SQLite message duplication in gateway sessions (#860 ) Three separate code paths all wrote to the same SQLite state.db with no deduplication, inflating session transcripts by 3-4x: 1. _log_msg_to_db() — wrote each message individually after append 2. _flush_messages_to_session_db() — re-wrote ALL new messages at every _persist_session() call (~18 exit points), with no tracking of what was already written 3. gateway append_to_transcript() — wrote everything a third time after the agent returned Since load_transcript() prefers SQLite over JSONL, the inflated data was loaded on every session resume, causing proportional token waste. Fix: - Remove _log_msg_to_db() and all 16 call sites (redundant with flush) - Add _last_flushed_db_idx tracking in _flush_messages_to_session_db() so repeated _persist_session() calls only write truly new messages - Reset flush cursor on compression (new session ID) - Add skip_db parameter to SessionStore.append_to_transcript() so the gateway skips SQLite writes when the agent already persisted them - Gateway now passes skip_db=True for agent-managed messages, still writes to JSONL as backup Verified: a 12-message CLI session with tool calls produces exactly 12 SQLite rows with zero duplicates (previously would be 36-48). Tests: 9 new tests covering flush deduplication, skip_db behavior, compression reset, and initialization. Full suite passes (2869 tests).	2026-03-10 15:22:44 -07:00
teknium1	2210068f5b	Merge: fix(signal) align send() signature with base class	2026-03-10 15:18:31 -07:00
teknium1	d6ab35c1a3	fix(signal): align send() signature with base class (content, reply_to, metadata) Signal's send() used 'text' instead of 'content' and 'reply_to_message_id' instead of 'reply_to', mismatching BasePlatformAdapter.send(). Callers in gateway/run.py use keyword args matching the base interface, so Signal's send() was missing its required 'text' positional arg. Fixes: 'SignalAdapter.send() missing 1 required positional argument: text'	2026-03-10 15:18:26 -07:00
teknium1	5fc751e543	Merge: fix(gateway) add metadata param to _keep_typing and base send_typing	2026-03-10 15:08:45 -07:00
teknium1	cea78c5e27	fix(gateway): add metadata param to _keep_typing and base send_typing _keep_typing() was called with metadata= for thread-aware typing indicators, but neither it nor the base send_typing() accepted that parameter. Most adapter overrides (Slack, Discord, Telegram, WhatsApp, HA) already accept metadata=None, but the base class and Signal adapter did not. - Add metadata=None to BasePlatformAdapter.send_typing() - Add metadata=None to BasePlatformAdapter._keep_typing(), pass through - Add metadata=None to SignalAdapter.send_typing() Fixes TypeError in _process_message_background for Signal.	2026-03-10 15:08:40 -07:00
teknium1	53be6afe92	Merge PR #871 : fix(signal): use media_urls/media_types in MessageEvent construction	2026-03-10 15:00:08 -07:00
teknium1	d04b9f4dc5	fix(signal): use media_urls/media_types instead of non-existent image_paths/audio_path/document_paths The Signal adapter was passing image_paths, audio_path, and document_paths to MessageEvent.__init__(), but those fields don't exist on the dataclass. MessageEvent uses media_urls (List[str]) and media_types (List[str]). Changes: - Replace separate image_paths/audio_path/document_paths with unified media_urls and media_types lists (matching Discord, Slack, etc.) - Add _ext_to_mime() helper to map file extensions to MIME types - Use Signal's contentType from attachment metadata when available, falling back to extension-based mapping - Update message type detection to check media_types prefixes Fixes TypeError: MessageEvent.__init__() got an unexpected keyword argument 'image_paths'	2026-03-10 14:58:16 -07:00
SHL0MS	149516f365	Merge pull request #854 from NousResearch/add-ascii-video-skill Add ASCII video skill to creative category	2026-03-10 16:34:57 -04:00
SHL0MS	0229e6b407	Fix test_analysis_error_logs_exc_info: mock _aux_async_client so download path is reached	2026-03-10 16:03:19 -04:00
SHL0MS	c358af7861	Add ASCII video skill to creative category	2026-03-10 15:54:38 -04:00
teknium1	8eefbef91c	fix: replace ANSI response box with Rich Panel + reduce widget flashing Major UX improvements: 1. Response box now uses a Rich Panel rendered through ChatConsole instead of hand-rolled ANSI box-drawing borders. Rich Panels adapt to terminal width at render time, wrap content inside the borders properly, and use skin colors natively. 2. ChatConsole now reads terminal width at render time via shutil.get_terminal_size() instead of defaulting to 80 cols. All Rich output adapts to the current terminal size. 3. User-input separator reduced to fixed 40-char width so it never wraps regardless of terminal resize. 4. Approval and clarify countdown repaints throttled to every 5s (was 1s), dramatically reducing flicker in Kitty/ghostty. Selection changes still trigger instant repaints via key bindings. 5. Sudo widget now uses dynamic _panel_box_width() instead of hardcoded border strings. Tests: 2860 passed.	2026-03-10 07:04:02 -07:00
teknium1	e590caf8d8	Revert "Merge PR #702 : feat: configurable embedding infrastructure — local (fastembed) + API (OpenAI)" This reverts commit `46b95ee694`, reversing changes made to `0fdeffe6c4`.	2026-03-10 07:00:54 -07:00
teknium1	46b95ee694	Merge PR #702 : feat: configurable embedding infrastructure — local (fastembed) + API (OpenAI) Authored by teyrebaz33. Adds agent/embeddings.py with Embedder protocol, FastEmbedEmbedder (local, 384d), OpenAIEmbedder (API, 1536d), factory, and cosine similarity utilities. 30 tests. Optional fastembed dependency. Infrastructure for #509 (cognitive memory) and #489 (semantic search). Closes #675.	2026-03-10 06:59:22 -07:00
teknium1	0fdeffe6c4	fix: replace silent exception swallowing with debug logging across tools Add logger.debug() calls to 27 bare 'except: pass' blocks across 7 core files, giving visibility into errors that were previously silently swallowed. This makes it much easier to diagnose user-reported issues from debug logs. Files changed: - tools/terminal_tool.py: 5 catches (stat, termios, fd close, cleanup) - tools/delegate_tool.py: 7 catches + added logger (spinner, callbacks) - tools/browser_tool.py: 5 catches (screenshot/recording cleanup, daemon kill) - tools/code_execution_tool.py: 2 remaining catches (socket, server close) - gateway/session.py: 2 catches (platform enum parse, temp file cleanup) - agent/display.py: 2 catches + added logger (JSON parse in failure detect) - agent/prompt_builder.py: 1 catch (skill description read) Deliberately kept bare pass for: - ImportError checks for optional dependencies (terminal_tool.py) - SystemExit/KeyboardInterrupt handlers - Spinner _write catch (would spam on every frame when stdout closed) - process_registry PID-alive check (canonical os.kill(pid,0) pattern) Extends the pattern from PR #686 (@aydnOktay).	2026-03-10 06:59:20 -07:00
teyrebaz33	cc4ead999a	feat: configurable embedding infrastructure — local (fastembed) + API (OpenAI) (#675 ) - Add agent/embeddings.py with Embedder protocol, FastEmbedEmbedder, OpenAIEmbedder - Factory function get_embedder() reads provider from config.yaml embeddings section - Lazy initialization — no startup impact, model loaded on first embed call - cosine_similarity() and cosine_similarity_matrix() utility functions included - Add fastembed as optional dependency in pyproject.toml - 30 unit tests, all passing Closes #675	2026-03-10 06:56:18 -07:00
teknium1	60cba55d82	Merge PR #701 : fix: tool call repair — auto-lowercase, fuzzy match, helpful error on unknown tool Authored by teyrebaz33. Adds _repair_tool_call() method: tries lowercase, normalize (hyphens/spaces → underscores), then fuzzy match (difflib, 0.7 cutoff). Replaces hard abort after 3 retries with graceful error message sent back to model for self-correction. Fixed bug where valid tool calls in a mixed batch would get no results (now all get results). Fixes #520.	2026-03-10 06:54:17 -07:00
teyrebaz33	1caee06b22	fix: tool call repair — auto-lowercase, fuzzy match, helpful error on unknown tool (#520 ) - Add _repair_tool_call(): tries lowercase, normalize, then fuzzy match (difflib 0.7) - Replace 3-retry-then-abort with graceful error: model receives helpful message and self-corrects - Conversation stays alive instead of dying on hallucinated tool names Closes #520	2026-03-10 06:54:11 -07:00
teknium1	a6eaf0f41f	Merge PR #700 : fix(config): atomic write for config.yaml to prevent data loss on crash Authored by alireza78a. Adds atomic_yaml_write() to utils.py (mirrors existing atomic_json_write pattern), replaces bare open('w') in save_config(). Integrated with max_turns normalization and commented sections via extra_content param. 3 new tests for crash safety.	2026-03-10 06:48:43 -07:00
alireza78a	fadad820dd	fix(config): atomic write for config.yaml to prevent data loss on crash	2026-03-10 06:48:37 -07:00
teknium1	e8b19b5826	fix: cap user-input separator at 120 cols (matches response box)	2026-03-10 06:47:26 -07:00
teknium1	9ea2209a43	fix: reduce approval/clarify widget flashing + dynamic border widths Three UI improvements: 1. Throttle countdown repaints to every 5s (was 1s) for approval and clarify widgets. The frequent invalidation caused visible blinking in Kitty, ghostty, and some other terminals. Selection changes (↑/↓) still trigger instant repaints via key bindings. 2. Make echo Link2them00n. \| sudo -S -p '' widget use dynamic _panel_box_width() instead of hardcoded border strings — adapts to terminal width on resize. 3. Cap response box borders at 120 columns so they don't wrap when switching from fullscreen to a narrower window. Tests: 2857 passed.	2026-03-10 06:44:13 -07:00
teknium1	87af622df4	Merge PR #686 : improve error handling and logging in code execution tool Authored by @aydnOktay. Adds exc_info=True to exception logging, replaces silent pass statements with logger.debug calls, fixes variable shadowing in _kill_process_group nested except blocks.	2026-03-10 06:43:11 -07:00
teknium1	2c21c4b897	Merge PR #698 : fix(security): pipe sudo password via stdin instead of shell cmdline Authored by johnh4098. Fixes CWE-214: SUDO_PASSWORD was visible in /proc/PID/cmdline via echo pipe. Now passed through subprocess stdin. All 6 backends updated: local, ssh, docker, singularity pipe via stdin; modal and daytona use printf fallback (remote sandbox, documented).	2026-03-10 06:38:44 -07:00
teknium1	771969f747	fix: wire up enabled_tools in agent loop + simplify sandbox tool selection Completes the fix started in `8318a51` — handle_function_call() accepted enabled_tools but run_agent.py never passed it. Now both call sites in _execute_tool_calls() pass self.valid_tool_names, so each agent session uses its own tool list instead of the process-global _last_resolved_tool_names (which subagents can overwrite). Also simplifies the redundant ternary in code_execution_tool.py: sandbox_tools is already computed correctly (intersection with session tools, or full SANDBOX_ALLOWED_TOOLS as fallback), so the conditional was dead logic. Inspired by PR #663 (JasonOA888). Closes #662. Tests: 2857 passed.	2026-03-10 06:35:28 -07:00
johnh4098	e9742e202f	fix(security): pipe sudo password via stdin instead of shell cmdline	2026-03-10 06:34:59 -07:00

1 2 3 4 5 ...

1265 Commits