hermes-agent

Author	SHA1	Message	Date
Teknium	5a79e423fe	Merge branch 'main' into codex/align-codex-provider-conventions-mainrepo	2026-02-28 18:13:38 -08:00
teknium1	7f7643cf63	feat(hooks): introduce event hooks system for lifecycle management Add a new hooks system allowing users to run custom code at key lifecycle points in the agent's operation. This includes support for events such as `gateway:startup`, `session:start`, `agent:step`, and more. Documentation for creating hooks and available events has been added to `README.md` and a new `hooks.md` file. Additionally, integrate step callbacks in the agent to facilitate hook execution during tool-calling iterations.	2026-02-28 17:09:26 -08:00
Teknium	31a5cd185a	Merge pull request #174 from Bartok9/fix-think-block-leakage fix: strip <think> blocks from final response to users	2026-02-28 16:43:47 -08:00
teknium1	2205b22409	fix(headers): update X-OpenRouter-Categories to include 'productivity'	2026-02-28 10:38:49 -08:00
teknium1	6366177118	refactor: update context compression configuration to use config.yaml and improve model handling	2026-02-28 04:46:38 -08:00
Bartok9	1e463a8e39	fix: strip <think> blocks from final response to users Fixes #149 The _strip_think_blocks() method existed but was not applied to the final_response in the normal completion path. This caused <think>...</think> XML tags to leak into user-facing responses on all platforms (CLI, Telegram, Discord, Slack, WhatsApp). Changes: - Strip think blocks from final_response before returning in normal path (line ~2600) - Strip think blocks from fallback content when salvaging from prior tool_calls turn Notes: - The raw content with think blocks is preserved in messages[] for trajectory export - this only affects the user-facing final_response - The _has_content_after_think_block() check still uses raw content before stripping, which is correct for detecting think-only responses	2026-02-28 03:06:20 -05:00
Teknium	4a9086b848	Merge branch 'main' into feat/honcho-integration	2026-02-27 23:32:49 -08:00
teknium1	50cb4d5fc7	fix(agent): update error message for unsupported Anthropic API endpoints to clarify usage of OpenRouter	2026-02-27 23:23:31 -08:00
Teknium	2bc9508b7c	Merge pull request #173 from adavyas/fix/anthropic-base-url-guard fix(agent): fail fast on Anthropic native base URLs	2026-02-27 23:22:01 -08:00
teknium1	19f28a633a	fix(agent): enhance 413 error handling and improve conversation history management in tests	2026-02-27 23:04:32 -08:00
Teknium	2c817ce4a5	Merge pull request #153 from tekelala/main fix(agent): handle 413 payload-too-large via compression instead of aborting	2026-02-27 22:57:55 -08:00
adavyas	0c0a2eb0a2	fix(agent): fail fast on Anthropic native base URLs	2026-02-27 21:19:29 -08:00
teknium1	de0829cec3	fix(cli): increase max iterations for child agents and extend API call timeout for improved reliability	2026-02-27 17:35:29 -08:00
tekelala	79bd65034c	fix(agent): handle 413 payload-too-large via compression instead of aborting The 413 "Request Entity Too Large" error from the LLM API was caught by the generic 4xx handler which aborts immediately. This is wrong for 413 — it's a payload-size issue that can be resolved by compressing conversation history. - Intercept 413 before the generic 4xx block and route to _compress_context - Exclude 413 from generic is_client_error detection - Add 'request entity too large' to context-length phrases as safety net - Add tests for 413 compression behavior Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-27 12:21:27 -05:00
teknium1	c77f3da0ce	Cherry-pick 6 bug fixes from PR #76 and update documentation Code fixes (run_agent.py): - Fix off-by-one in _flush_messages_to_session_db skipping one message per flush - Add clear_interrupt() to 3 early-return paths preventing stale interrupt state - Wrap handle_function_call in try/except so tool crashes don't kill the conversation - Replace fragile `is` identity check with _flush_sentinel marker for memory flush cleanup - Fix retry loop off-by-one (6 attempts not 7) - Remove redundant inline `import re`	2026-02-27 03:21:49 -08:00
Bartok Moltbot	8aa531c7fa	fix(gateway): Pass session_db to AIAgent, fixing session_search error When running via the gateway (e.g. Telegram), the session_search tool returned: {"error": "session_search must be handled by the agent loop"} Root cause: - gateway/run.py creates AIAgent without passing session_db= - self._session_db is None in the agent instance - The dispatch condition "elif function_name == 'session_search' and self._session_db" skips when _session_db is None, falling through to the generic error This fix: 1. Initializes self._session_db in GatewayRunner.__init__() 2. Passes session_db to all AIAgent instantiations in gateway/run.py 3. Adds defensive fallback in run_agent.py to return a clear error when session_db is unavailable, instead of falling through Fixes #105	2026-02-27 00:32:17 -05:00
teknium1	58fce0a37b	feat(api): implement dynamic max tokens handling for various providers - Added _max_tokens_param method in AIAgent to return appropriate max tokens parameter based on the provider (OpenAI vs. others). - Updated API calls in AIAgent to utilize the new max tokens handling. - Introduced auxiliary_max_tokens_param function in auxiliary_client for consistent max tokens management across auxiliary clients. - Refactored multiple tools to use auxiliary_max_tokens_param for improved compatibility with different models and providers.	2026-02-26 20:23:56 -08:00
Erosika	70d1abf81b	refactor: run Honcho and USER.md in tandem USER.md stays in system prompt when Honcho is active -- prefetch is additive context, not a replacement. Memory tool user observations write to both USER.md (local) and Honcho (cross-session) simultaneously.	2026-02-26 18:07:33 -05:00
Erosika	1fd0fcddb2	feat: integrate Honcho with USER.md memory system When Honcho is active: - System prompt uses Honcho prefetch instead of USER.md - memory tool target=user add routes to Honcho - MEMORY.md untouched in all cases When disabled, everything works as before. Also wires up contextTokens config to cap prefetch size.	2026-02-26 18:07:17 -05:00
Erosika	ab4bbf2fb2	feat: add Honcho AI-native memory integration Opt-in persistent cross-session user modeling via Honcho. Reads ~/.honcho/config.json as single source of truth (shared with Claude Code, Cursor, and other Honcho-enabled tools). Zero impact when disabled or unconfigured. - honcho_integration/ package (client, session manager, peer resolution) - Host-based config resolution matching claude-honcho/cursor-honcho pattern - Prefetch user context into system prompt per conversation turn - Sync user/assistant messages to Honcho after each exchange - query_user_context tool for mid-conversation dialectic reasoning - Gated activation: requires ~/.honcho/config.json with enabled=true	2026-02-26 18:07:17 -05:00
George Pickett	32070e6bc0	Merge remote-tracking branch 'origin/main' into codex/align-codex-provider-conventions-mainrepo # Conflicts: # cron/scheduler.py # gateway/run.py # tools/delegate_tool.py	2026-02-26 10:56:29 -08:00
Dean Kerr	5a569eb1b6	fix: resolve .env and config paths from HERMES_HOME, not PROJECT_ROOT The `hermes` CLI entry point (hermes_cli/main.py) and the agent runner (run_agent.py) only loaded .env from the project installation directory. After the standard installer, code lives at ~/.hermes/hermes-agent/ but config lives at ~/.hermes/ — so the .env was never found. Aligns these entry points with the pattern already used by gateway/run.py and rl_cli.py: load ~/.hermes/.env first, fall back to project root .env for dev-mode compatibility. Also fixes: - status.py checking .env existence and API keys at PROJECT_ROOT - doctor.py KeyError on tool availability (missing_vars vs env_vars) - doctor.py checking logs/ and Skills Hub at PROJECT_ROOT instead of HERMES_HOME - doctor.py redundant logs/ check (already covered by subdirectory loop) - mini-swe-agent loading config from platformdirs default instead of ~/.hermes/ Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 16:49:14 +11:00
George Pickett	74c662b63a	Harden Codex auth refresh and responses compatibility	2026-02-25 19:27:54 -08:00
George Pickett	91bdb9eb2d	Fix Codex stream fallback for Responses completion gaps	2026-02-25 19:08:11 -08:00
George Pickett	47f16505d2	Omit optional function_call id in Responses replay input	2026-02-25 19:00:11 -08:00
George Pickett	e63986b534	Harden Codex stream handling and ack continuation	2026-02-25 18:56:06 -08:00
George Pickett	ce175d7372	Fix Codex Responses continuation and schema parity	2026-02-25 18:20:41 -08:00
George Pickett	609b19b630	Add OpenAI Codex provider runtime and responses integration (without .agent/PLANS.md)	2026-02-25 18:20:38 -08:00
teknium1	e3cb957a10	refactor: streamline reasoning configuration checks in AIAgent - Simplified the logic for determining support for reasoning based on the base URL by introducing clearer variable names. - Added product attribution for the Nous Portal to the extra body of requests when applicable, enhancing tagging for better tracking.	2026-02-25 16:49:41 -08:00
teknium1	9a858b8d67	add identifier for openrouter calls	2026-02-25 16:34:47 -08:00
teknium1	d72b9eadec	More fixes for windoze	2026-02-25 15:20:42 -08:00
teknium1	f64a87209d	refactor: enhance session content handling in AIAgent and update TTS output path - Introduced a new static method `_clean_session_content` in the `AIAgent` class to convert REASONING_SCRATCHPAD tags to <think> blocks and clean up whitespace in session logs. - Updated the `_save_session_log` method to utilize the cleaned content for assistant messages, ensuring consistency in session logs. - Changed the default output directory for TTS audio files from `~/voice-memos` to `~/.hermes/audio_cache`, reflecting a more appropriate storage location.	2026-02-25 04:22:03 -08:00
teknium1	41df8ee4f5	refactor: enhance interrupt handling in AIAgent class - Updated the `clear_interrupt` method to also reset the global tool interrupt signal, improving the clarity of interrupt management within the agent. - This change ensures that all interrupt states are properly cleared, enhancing the reliability of the agent's operation.	2026-02-25 03:45:47 -08:00
teknium1	681141a526	fix: ansi escapes causing broken terminal cli output	2026-02-24 03:42:12 -08:00
teknium1	e049441d93	feat: add reasoning effort configuration for agent - Introduced a new configuration option for reasoning effort in the CLI, allowing users to specify the level of reasoning the agent should perform before responding. - Updated the CLI and agent initialization to incorporate the reasoning configuration, enhancing the agent's responsiveness and adaptability. - Implemented logic to load reasoning effort from environment variables and configuration files, providing flexibility in agent behavior. - Enhanced the documentation in the example configuration file to clarify the new reasoning effort options available.	2026-02-24 03:30:19 -08:00
teknium1	2bf96ad244	feat: add ephemeral prefill messages and system prompt loading - Implemented functionality to load ephemeral prefill messages from a JSON file, enhancing few-shot priming capabilities for the agent. - Introduced a mechanism to load an ephemeral system prompt from environment variables or configuration files, ensuring dynamic prompt adjustments at API-call time. - Updated the CLI and agent initialization to utilize the new prefill messages and system prompt, improving the overall interaction experience. - Enhanced configuration options with new environment variables for prefill messages and system prompts, allowing for greater customization without persistence.	2026-02-23 23:55:42 -08:00
teknium1	d18c753b3c	refactor: streamline scratchpad handling in AIAgent - Removed static methods for converting and checking <REASONING_SCRATCHPAD> tags, simplifying the codebase. - Replaced calls to the removed methods with direct function calls for better clarity and maintainability. - Updated trajectory saving logic to utilize a dedicated function for improved organization and readability.	2026-02-23 09:55:09 -08:00
teknium1	90af34bc83	feat: enhance interrupt handling and container resource configuration - Introduced a shared interrupt signaling mechanism to allow tools to check for user interrupts during long-running operations. - Updated the AIAgent to handle interrupts more effectively, ensuring in-progress tool calls are canceled and multiple interrupt messages are combined into one prompt. - Enhanced the CLI configuration to include container resource limits (CPU, memory, disk) and persistence options for Docker, Singularity, and Modal environments. - Improved documentation to clarify interrupt behaviors and container resource settings, providing users with better guidance on configuration and usage.	2026-02-23 02:11:33 -08:00
teknium1	c7857dc1d4	feat: enhance AIAgent's tool usage nudges and content handling - Introduced a method to strip <think> blocks from content, improving text visibility. - Implemented counters to reset nudge intervals when memory and skill tools are used, enhancing user guidance. - Captured content from turns with tool calls to provide fallback responses, ensuring continuity in conversation. - Updated nudge logic to remind users about saving memories and creating skills based on interaction patterns.	2026-02-22 21:33:28 -08:00
teknium1	6037b6a5ab	Fix session saving to DB with full conversation history (not just user/assistant messages without tool calls)	2026-02-22 17:10:24 -08:00
teknium1	db23f51bc6	feat: introduce skills management features in AIAgent and CLI - Added skills configuration options in cli-config.yaml.example, including a nudge interval for skill creation reminders. - Implemented skills guidance in AIAgent to prompt users to save reusable workflows after complex tasks. - Enhanced skills indexing in the prompt builder to include descriptions from SKILL.md files for better context. - Updated the agent's behavior to periodically remind users about potential skills during tool-calling iterations.	2026-02-22 13:28:13 -08:00
teknium1	3c6750f37b	feat: enhance memory management features in AIAgent and CLI - Added configuration options for memory nudge interval and flush minimum turns in cli-config.yaml.example. - Implemented memory flushing before conversation reset, clearing, and exit in the CLI to ensure memories are saved. - Introduced a flush_memories method in AIAgent to handle memory persistence before context loss. - Added periodic nudges to remind the agent to consider saving memories based on user interactions.	2026-02-22 10:15:17 -08:00
teknium1	e223b4ac09	Enhance agent guidance with memory and session search tools - Introduced MEMORY_GUIDANCE and SESSION_SEARCH_GUIDANCE to improve agent's contextual awareness and proactive assistance. - Updated AIAgent to conditionally include tool-aware guidance in prompts based on available tools. - Enhanced descriptions in memory and session search schemas for clearer user instructions on when to utilize these features.	2026-02-22 02:31:52 -08:00
teknium1	f072801f38	refactor: remove unused compression model variable in AIAgent - Eliminated the `compression_model` variable from the AIAgent class, as it was not being utilized. - Cleaned up the context compressor initialization for improved clarity and maintainability.	2026-02-22 02:17:33 -08:00
teknium1	ededaaa874	Hermes Agent UX Improvements	2026-02-22 02:16:11 -08:00
teknium1	51b95236f9	refactor: move model metadata functions to agent/model_metadata.py - Relocated functions related to model metadata, including fetch_model_metadata, get_model_context_length, estimate_tokens_rough, and estimate_messages_tokens_rough, to agent/model_metadata.py for better organization and maintainability. - Updated imports in run_agent.py to reflect the new location of these functions.	2026-02-21 22:34:18 -08:00
teknium1	9123cfb5dd	Refactor Terminal and AIAgent cleanup	2026-02-21 22:31:43 -08:00
teknium1	8f6788474b	feat: enhance logging in AIAgent for quiet mode - Added functionality to suppress logging noise from specific modules when in quiet mode, improving user experience in CLI. - Updated terminal_tool.py to change the log level for fallback directory usage from warning to debug, providing clearer context without cluttering logs.	2026-02-21 12:41:05 -08:00
teknium1	c98ee98525	feat: implement interactive prompts for sudo password and command approval in CLI - Added methods for handling sudo password and dangerous command approval prompts using a callback mechanism in cli.py. - Integrated these prompts with the prompt_toolkit UI for improved user experience. - Updated terminal_tool.py to support callback registration for interactive prompts, enhancing the CLI's interactivity. - Introduced a background thread for API calls in run_agent.py to allow for interrupt handling during long-running operations. - Enhanced error handling for interrupted API calls, ensuring graceful degradation of user experience.	2026-02-21 12:15:40 -08:00
teknium1	ecb430effe	refactor: enhance API interaction and message handling in AIAgent - Introduced new methods in run_agent.py for building API keyword arguments and normalizing assistant messages from API responses. - Added functionality for compressing conversation context and managing session state in SQLite. - Improved tool call execution handling, including enhanced logging and error management. - Updated path handling in multiple platform files to utilize pathlib for better compatibility and readability.	2026-02-21 04:17:27 -08:00

1 2 3

120 Commits