hermes-agent

Author	SHA1	Message	Date
teknium1	b8067ac27e	feat: add /background command to gateway and CLI commands registry Add /background <prompt> to the gateway, allowing users on Telegram, Discord, Slack, etc. to fire off a prompt in a separate agent session. The result is delivered back to the same chat when done, without modifying the active conversation history. Implementation: - _handle_background_command: validates input, spawns asyncio task - _run_background_task: creates AIAgent in executor thread, delivers result (text, images, media files) back via the platform adapter - Inherits model, toolsets, provider routing from gateway config - Error handling with user-visible failure messages Also adds /background to hermes_cli/commands.py registry so it appears in /help and autocomplete. Tests: 15 new tests covering usage, task creation, uniqueness, multi-platform, error paths, and help/autocomplete integration.	2026-03-11 02:46:31 -07:00
teknium1	bd2606a576	fix: initialize self.config in HermesCLI to fix AttributeError on slash commands HermesCLI.__init__ never assigned self.config, causing an AttributeError ('HermesCLI' object has no attribute 'config') whenever an unrecognized slash command fell through to the quick_commands check on line 2838. This affected skill slash commands like /x-thread-creation since the quick_commands lookup runs before the skill command check. Set self.config = CLI_CONFIG in __init__ to match the pattern used by the gateway (run.py:199).	2026-03-11 02:33:31 -07:00
teknium1	f5324f9aa5	fix: initialize self.config in HermesCLI to fix AttributeError on slash commands HermesCLI.__init__ never assigned self.config, causing an AttributeError ('HermesCLI object has no attribute config') whenever an unrecognized slash command fell through to the quick_commands check (line 2832). This broke skill slash commands like /x-thread-creation since the quick_commands lookup runs before the skill command check. Set self.config = CLI_CONFIG in __init__, matching the pattern used by the gateway (run.py:199).	2026-03-11 02:33:25 -07:00
SPANISH FLU	de2b881886	test(cron): cover topic thread delivery metadata	2026-03-11 09:22:32 +01:00
SPANISH FLU	0d6b25274c	fix(gateway): isolate telegram forum topic sessions	2026-03-11 09:15:34 +01:00
teknium1	fbfdde496b	docs: update AGENTS.md with new files and test count - Add hermes_cli/ files: skills_config, tools_config, skills_hub, models, auth - Add acp_adapter/ directory - Update test count: ~2500 → ~3000 (~3 min runtime)	2026-03-11 00:54:49 -07:00
Bartok Moltbot	ae1c11c5a5	fix(cli): resolve duplicate 'skills' subparser crash on Python 3.11+ Fixes #898 — Python 3.11 changed argparse to raise an exception on duplicate subparser names (CPython #94331). The 'skills' name was registered twice: once for Skills Hub and once for skills config. Changes: - Remove duplicate 'skills' subparser registration - Add 'config' as a sub-action under the existing 'hermes skills' command - Route 'hermes skills config' to skills_config module - Add regression test to catch future duplicates Migration: 'hermes skills' (config) is now 'hermes skills config'	2026-03-11 00:50:39 -07:00
Teknium	5abee4fb23	Merge pull request #769 from 0xbyt4/fix/codex-models-visibility-mismatch Minor defensive fix — accept both 'hide' and 'hidden' visibility values in codex model filtering.	2026-03-11 00:49:59 -07:00
teknium1	331af8df23	fix: clean up tools --summary output and type annotations - Use Optional[List[str]] instead of List[str] \| None (consistency) - Add header, per-platform counts, and checkmark list format - Matches the visual style of the interactive configurator	2026-03-11 00:47:26 -07:00
teknium1	3a2fd1a5c9	Merge PR #767 : feat: add --summary flag to hermes tools Authored by luisv-1. Adds hermes tools --summary for a quick non-interactive view of enabled tools per platform.	2026-03-11 00:46:32 -07:00
teknium1	2e1aa1b424	docs: add iteration budget pressure section to configuration guide Documents the two-tier budget warning system from PR #762: - Explains caution (70%) and warning (90%) thresholds - Table showing what the model sees at each tier - Notes on how injection preserves prompt caching - Links to max_turns config	2026-03-11 00:40:44 -07:00
teknium1	aead9c8ead	chore: remove unnecessary pragma comments from Telegram adapter Strip 18 '# pragma: no cover - defensive logging' annotations — these are real code paths, not worth excluding from coverage.	2026-03-11 00:37:45 -07:00
teknium1	93230af7bd	Merge PR #763 : improve Telegram gateway error handling and logging Authored by aydnOktay. Replaces print() statements with structured logging calls (error/warning/info/debug) throughout the Telegram adapter. Adds exc_info=True for stack traces on failures.	2026-03-11 00:37:28 -07:00
teknium1	21ff0d39ad	feat: iteration budget pressure via tool result injection Two-tier warning system that nudges the LLM as it approaches max_iterations, injected into the last tool result JSON rather than as a separate system message: - Caution (70%): {"_budget_warning": "[BUDGET: 42/60...]"} - Warning (90%): {"_budget_warning": "[BUDGET WARNING: 54/60...]"} For JSON tool results, adds a _budget_warning field to the existing dict. For plain text results, appends the warning as text. Key properties: - No system messages injected mid-conversation - No changes to message structure - Prompt cache stays valid - Configurable thresholds (0.7 / 0.9) - Can be disabled: _budget_pressure_enabled = False Inspired by PR #421 (@Bartok9) and issue #414. 8 tests covering thresholds, edge cases, JSON and text injection.	2026-03-11 00:37:24 -07:00
teknium1	4b619c9672	Merge PR #761 : Improve Discord gateway error handling and logging Authored by aydnOktay. Replaces bare print statements with structured logger calls (error/warning/info) and adds exc_info=True for stack traces on failure paths.	2026-03-11 00:35:31 -07:00
teknium1	c5321298ce	docs: add quick commands documentation Documents the quick_commands config feature from PR #746: - configuration.md: full section with examples (server status, disk, gpu, update), behavior notes (timeout, priority, works everywhere) - cli.md: brief section with config example + link to config guide	2026-03-11 00:28:52 -07:00
teknium1	359352b947	Merge PR #755 : fix: head+tail truncation for execute_code stdout Replaces head-only stdout capture with 40/60 head/tail split so final print() output is never lost. 3 new tests.	2026-03-11 00:26:26 -07:00
teknium1	a9241f3e3e	fix: head+tail truncation for execute_code stdout Replaces head-only stdout capture with a two-buffer approach (40% head, 60% tail rolling window) so scripts that print() their final results at the end never lose them. Adds truncation notice between sections. Cherry-picked from PR #755, conflict resolved (test file additions). 3 new tests for short output, head+tail preservation, and notice format.	2026-03-11 00:26:13 -07:00
teknium1	ea0a263434	Merge PR #758 : feat(discord): add DISCORD_ALLOW_BOTS config for bot message filtering Adds configurable bot message filtering via DISCORD_ALLOW_BOTS env var: - 'none' (default): ignore all bot messages - 'mentions': accept bots only when they @mention us - 'all': accept all bot messages Includes 8 tests.	2026-03-11 00:25:51 -07:00
teknium1	3be6e8a5f2	Merge PR #746 : feat(cli,gateway): add user-defined quick commands that bypass agent loop Authored by teyrebaz33. Adds config-driven quick commands that execute shell commands without invoking the LLM — zero token usage, works from Telegram/Discord/Slack/etc. Closes #744.	2026-03-11 00:24:34 -07:00
teknium1	2b244762e1	feat: add missing commands to categorized /help Post-merge follow-up to PR #752 — adds 10 commands that were added since the PR was submitted: Session: /title, /compress, /rollback Configuration: /provider, /verbose, /skin Tools & Skills: /reload-mcp (+ full /skills description) Info: /usage, /insights, /paste Also preserved existing color formatting (_cprint, _GOLD, _BOLD, _DIM) and skill commands section from main.	2026-03-10 23:49:03 -07:00
teknium1	a169a656b4	Merge PR #743 : feat: hermes skills — enable/disable individual skills and categories Authored by teyrebaz33. Fixes #642.	2026-03-10 23:46:42 -07:00
teknium1	a9fdd8dc3c	Merge PR #752 : feat(ux): improve /help formatting with command categories Authored by Bartok9. Organizes /help output into categories (Session, Configuration, Tools & Skills, Info, Exit) for better readability. Fixes #640.	2026-03-10 23:45:41 -07:00
Bartok Moltbot	8eb9eed074	feat(ux): improve /help formatting with command categories (#640 ) - Organize COMMANDS into COMMANDS_BY_CATEGORY dict - Group commands: Session, Configuration, Tools & Skills, Info, Exit - Add visual category headers with spacing - Maintain backwards compat via flat COMMANDS dict - Better visual hierarchy and scannability Before: /help - Show this help message /tools - List available tools ... (dense list) After: ── Session ── /new Start a new conversation /reset Reset conversation only ... ── Configuration ── /config Show current configuration ... Closes #640	2026-03-10 23:45:36 -07:00
teknium1	909e048ad4	fix: integration hardening for gateway token tracking Follow-up to `58dbd81` — ensures smooth transition for existing users: - Backward compat: old session files without last_prompt_tokens default to 0 via data.get('last_prompt_tokens', 0) - /compress, /undo, /retry: reset last_prompt_tokens to 0 after rewriting transcripts (stale token counts would under-report) - Auto-compression hygiene: reset last_prompt_tokens after rewriting - update_session: use None sentinel (not 0) as default so callers can explicitly reset to 0 while normal calls don't clobber - 6 new tests covering: default value, serialization roundtrip, old-format migration, set/reset/no-change semantics - /reset: new SessionEntry naturally gets last_prompt_tokens=0 2942 tests pass.	2026-03-10 23:40:24 -07:00
teyrebaz33	5eb62ef423	test(gateway): add regression test for /retry response fix Adds two tests for _handle_retry_command: verifies /retry returns the agent response (not None), and verifies graceful handling when no previous message exists. Cherry-picked from PR #731 by teyrebaz33. Regression coverage for the fix merged in PR #441. Co-authored-by: teyrebaz33 <teyrebaz33@users.noreply.github.com>	2026-03-10 23:34:52 -07:00
teknium1	58dbd81f03	fix: use actual API token counts for gateway compression pre-check Root cause of aggressive gateway compression vs CLI: - CLI: single AIAgent persists across conversation, uses real API-reported prompt_tokens for compression decisions — accurate - Gateway: each message creates fresh AIAgent, token count discarded after, next message pre-check falls back to rough str(msg)//4 estimate which overestimates 30-50% on tool-heavy conversations Fix: - Add last_prompt_tokens field to SessionEntry — stores the actual API-reported prompt token count from the most recent agent turn - After run_conversation(), extract context_compressor.last_prompt_tokens and persist it via update_session() - Gateway pre-check now uses stored actual tokens when available (exact same accuracy as CLI), falling back to rough estimate with 1.4x safety factor only for the first message of a session This makes gateway compression behave identically to CLI compression for all turns after the first. Reported by TigerHix.	2026-03-10 23:28:23 -07:00
Teknium	a35c37a2f9	Merge pull request #891 from NousResearch/hermes/hermes-b0162f8d fix: sort Nous Portal model list (opus first, sonnet lower)	2026-03-10 23:21:01 -07:00
teknium1	1518734e59	fix: sort Nous Portal model list (opus first, sonnet lower) fetch_nous_models() returned models in whatever order the API gave them, which put sonnet near the top. Add a priority sort so users see the best models first: opus > pro > other > sonnet.	2026-03-10 23:20:46 -07:00
teknium1	67b9470207	fix: reduce premature gateway compression on tool-heavy sessions The gateway's session hygiene pre-check uses a rough char-based token estimate (total_chars / 4) to decide whether to compress before the agent starts. This significantly overestimates for tool-heavy and code-heavy conversations because: 1. str(msg) on dicts includes Python repr overhead (keys, brackets, etc.) 2. Code/JSON tokenizes at 5-7+ chars/token, not the assumed 4 This caused users with 200k context to see compression trigger at ~100-113k actual tokens instead of the expected 170k (85% threshold). Reported by TigerHix on Twitter. Fix: apply a 1.4x safety factor to the gateway pre-check threshold. This pre-check is only meant to catch pathologically large transcripts — the agent's own compression uses actual API-reported token counts for precise threshold management.	2026-03-10 23:16:49 -07:00
teknium1	586fe5d62d	Merge PR #724 : feat: --yolo flag to bypass all approval prompts Authored by dmahan93. Adds HERMES_YOLO_MODE env var and --yolo CLI flag to auto-approve all dangerous command prompts. Post-merge: renamed --fuck-it-ship-it to --yolo for brevity, resolved conflict with --checkpoints flag.	2026-03-10 20:56:30 -07:00
teknium1	2d80ef7872	fix: _init_agent returns bool, not agent — fix quiet mode crash	2026-03-10 20:49:03 -07:00
Teknium	b76cae94d4	Merge pull request #889 from NousResearch/hermes/hermes-b0162f8d fix: Docker backend fails when docker is not in PATH (macOS gateway)	2026-03-10 20:45:34 -07:00
teknium1	23270d41b9	feat: add --quiet/-Q flag for programmatic single-query mode Adds -Q/--quiet to `hermes chat` for use by external orchestrators (Paperclip, scripts, CI). When combined with -q, suppresses: - Banner and ASCII art - Spinner animations - Tool preview lines (┊ prefix) Only outputs: - The agent's final response text - A parseable 'session_id: <id>' line for session resumption Usage: hermes chat -q 'Do something' -Q Used by: Paperclip adapter (@nousresearch/paperclip-adapter-hermes)	2026-03-10 20:45:28 -07:00
teknium1	24479625a2	fix: Docker backend fails when docker is not in PATH (macOS gateway) On macOS, Docker Desktop installs the CLI to /usr/local/bin/docker, but when Hermes runs as a gateway service (launchd) or in other non-login contexts, /usr/local/bin is often not in PATH. This causes the Docker requirements check to fail with 'No such file or directory: docker' even though docker works fine from the user's terminal. Add find_docker() helper that uses shutil.which() first, then probes common Docker Desktop install paths on macOS (/usr/local/bin, /opt/homebrew/bin, Docker.app bundle). The resolved path is cached and passed to mini-swe-agent via its 'executable' parameter. - tools/environments/docker.py: add find_docker(), use it in _storage_opt_supported() and pass to _Docker(executable=...) - tools/terminal_tool.py: use find_docker() in requirements check - tests/tools/test_docker_find.py: 4 tests (PATH, fallback, not found, cache) 2877 tests pass.	2026-03-10 20:45:13 -07:00
arceus77-7	d41a214c1a	feat(skills): add official optional 1password skill	2026-03-10 20:45:29 -04:00
vilkasdev	d502952bac	fix(cli): add loading indicators for slow slash commands Shows an immediate status message and braille spinner for slow slash commands (/skills search\|browse\|inspect\|install, /reload-mcp). Makes input read-only while the command runs so the CLI doesn't appear frozen. Cherry-picked from PR #714 by vilkasdev, rebased onto current main with conflict resolution and bug fix (get_hint_text duplicate return). Fixes #636 Co-authored-by: vilkasdev <vilkasdev@users.noreply.github.com>	2026-03-10 17:31:00 -07:00
Teknium	ac53bf1d71	Merge pull request #881 from NousResearch/hermes/hermes-b0162f8d fix: provider selection not persisting when switching via hermes model	2026-03-10 17:13:26 -07:00
teknium1	145c57fc01	fix: provider selection not persisting when switching via hermes model Two related bugs prevented users from reliably switching providers: 1. OPENAI_BASE_URL poisoning OpenRouter resolution: When a user with a custom endpoint ran /model openrouter:model, _resolve_openrouter_runtime picked up OPENAI_BASE_URL instead of the OpenRouter URL, causing model validation to probe the wrong API and reject valid models. Fix: skip OPENAI_BASE_URL when requested_provider is explicitly 'openrouter'. 2. Provider never saved to config: _save_model_choice() could save config.model as a plain string. All five _model_flow_* functions then checked isinstance(model, dict) before writing the provider — which silently failed on strings. With no provider in config, auto-detection would pick up stale credentials (e.g. Codex desktop app) instead of the user's explicit choice. Fix: _save_model_choice() now always saves as dict format. All flow functions also normalize string->dict as a safety net before writing provider. Adds 4 regression tests. 2873 tests pass.	2026-03-10 17:12:34 -07:00
teknium1	2dddfce08c	fix: log prefill parse errors + clean up cron scheduler tests Follow-up to PR #716 (0xbyt4): - Log the third remaining silent except-pass in scheduler (prefill messages JSON parse failure) - Fix test mock: run → run_conversation (matches actual agent API) - Remove unused imports (asyncio, AsyncMock) - Add test for prefill_messages parse failure logging	2026-03-10 17:10:01 -07:00
teknium1	03a4f184e6	fix: call _stop_training_run on early-return failure paths The 4 early-return paths in _spawn_training_run (API exit, trainer exit, env not found, env exit) were doing manual process.terminate() or returning without cleanup, leaking open log file handles. Now all paths call _stop_training_run() which handles both process termination and file handle closure. Also adds 12 tests for _stop_training_run covering file handle cleanup, process termination, status transitions, and edge cases. Inspired by PR #715 (0xbyt4) which identified the early-return issue. Core file handle fix was already on main via `e28dc13` (memosr.eth).	2026-03-10 17:09:51 -07:00
teknium1	be2e259596	Merge PR #716 : fix: log exceptions instead of silently swallowing in cron scheduler Authored by 0xbyt4. Replaces two except-Exception-pass blocks with logger.warning() calls and adds tests for both paths.	2026-03-10 17:05:59 -07:00
teknium1	05bc8b19fe	Merge PR #713 : docs: clarify Telegram token regex constraint Authored by VolodymyrBg.	2026-03-10 16:59:54 -07:00
teknium1	cb6b70bbfb	Merge PR #709 : fix: close log file handles to prevent resource leaks Authored by memosr. Fixes bare open() calls in browser_tool.py and unclosed log file handles in rl_training_tool.py.	2026-03-10 16:26:29 -07:00
teknium1	a458b535c9	fix: improve read-loop detection — consecutive-only, correct thresholds, fix bugs Follow-up to PR #705 (merged from 0xbyt4). Addresses several issues: 1. CONSECUTIVE-ONLY TRACKING: Redesigned the read/search tracker to only warn/block on truly consecutive identical calls. Any other tool call in between (write, patch, terminal, etc.) resets the counter via notify_other_tool_call(), called from handle_function_call() in model_tools.py. This prevents false blocks in read→edit→verify flows. 2. THRESHOLD ADJUSTMENT: Warn on 3rd consecutive (was 2nd), block on 4th+ consecutive (was 3rd+). Gives the model more room before intervening. 3. TUPLE UNPACKING BUG: Fixed get_read_files_summary() which crashed on search keys (5-tuple) when trying to unpack as 3-tuple. Now uses a separate read_history set that only tracks file reads. 4. WEB_EXTRACT DOCSTRING: Reverted incorrect removal of 'title' from web_extract return docs in code_execution_tool.py — the field IS returned by web_tools.py. 5. TESTS: Rewrote test_read_loop_detection.py (35 tests) to cover consecutive-only behavior, notify_other_tool_call, interleaved read/search, and summary-unaffected-by-searches.	2026-03-10 16:25:41 -07:00
teknium1	b53d5dad67	Merge PR #705 : fix: detect, warn, and block file re-read/search loops after context compression Authored by 0xbyt4. Adds read/search loop detection, file history injection after compression, and todo filtering for active items only.	2026-03-10 16:17:03 -07:00
teknium1	ad7a16dca6	fix: remove left/right borders from response box for easier copy-paste Use rich_box.HORIZONTALS instead of the default ROUNDED box style for the agent response panel. This keeps the top/bottom horizontal rules (with title) but removes the vertical │ borders on left and right, making it much easier to copy-paste response text from the terminal.	2026-03-10 15:59:08 -07:00
teknium1	6e851a1f6a	Merge PR #873 : fix: eliminate 3x SQLite message duplication in gateway sessions Fixes #860.	2026-03-10 15:29:24 -07:00
teknium1	c1171fe666	fix: eliminate 3x SQLite message duplication in gateway sessions (#860 ) Three separate code paths all wrote to the same SQLite state.db with no deduplication, inflating session transcripts by 3-4x: 1. _log_msg_to_db() — wrote each message individually after append 2. _flush_messages_to_session_db() — re-wrote ALL new messages at every _persist_session() call (~18 exit points), with no tracking of what was already written 3. gateway append_to_transcript() — wrote everything a third time after the agent returned Since load_transcript() prefers SQLite over JSONL, the inflated data was loaded on every session resume, causing proportional token waste. Fix: - Remove _log_msg_to_db() and all 16 call sites (redundant with flush) - Add _last_flushed_db_idx tracking in _flush_messages_to_session_db() so repeated _persist_session() calls only write truly new messages - Reset flush cursor on compression (new session ID) - Add skip_db parameter to SessionStore.append_to_transcript() so the gateway skips SQLite writes when the agent already persisted them - Gateway now passes skip_db=True for agent-managed messages, still writes to JSONL as backup Verified: a 12-message CLI session with tool calls produces exactly 12 SQLite rows with zero duplicates (previously would be 36-48). Tests: 9 new tests covering flush deduplication, skip_db behavior, compression reset, and initialization. Full suite passes (2869 tests).	2026-03-10 15:22:44 -07:00
teknium1	2210068f5b	Merge: fix(signal) align send() signature with base class	2026-03-10 15:18:31 -07:00

... 3 4 5 6 7 ...

1520 Commits