hermes-agent

Author	SHA1	Message	Date
teknium1	84718d183a	Add platform-specific formatting hints and identity for AIAgent - Introduced a default agent identity prompt to ensure consistent behavior across platforms. - Added platform-specific formatting hints for CLI, WhatsApp, Telegram, and Discord to guide the agent's output style. - Updated the AIAgent initialization to accept a platform parameter, enhancing adaptability to different interfaces.	2026-02-12 16:11:16 -08:00
teknium1	3099a2f53c	Add timestamp to active system prompt in AIAgent - Appended the current local date and time to the active system prompt to provide context for the model, addressing potential misinterpretations due to training cutoffs.	2026-02-12 15:59:31 -08:00
teknium1	ed010752dd	Update .env.example to use new Docker, Singularity, and Modal images for Python 3.11 with Node.js 20 support	2026-02-12 10:07:03 -08:00
teknium1	f5be6177b2	Add Text-to-Speech (TTS) functionality with multiple providers Add tool previews Add AGENTS and SOUL.md support Add Exec Approval	2026-02-12 10:05:08 -08:00
teknium	89c6f24d48	Merge branch 'main' of github.com:nousresearch/hermes-agent	2026-02-12 05:38:15 +00:00
teknium	f23856df8e	Add kill_modal script to manage Modal applications and better handling of file and terminal tools - Introduced a new script, `kill_modal.sh`, to facilitate stopping running Modal apps, including the ability to stop all apps or specific swe-rex sandboxes. - Enhanced user experience with clear usage instructions and feedback during the stopping process. - Improved error handling to ensure smooth execution even if some apps fail to stop.	2026-02-12 05:37:14 +00:00
teknium	1b7bc299f3	Enhance TerminalBench2 environment with task filtering due to incompat with modal and logging improvements - Updated task filter descriptions for clarity and added a new skip task feature to exclude incompatible tasks. - Introduced a set of modal incompatible tasks to prevent execution errors in cloud environments. - Implemented streaming JSONL logging for task results, preserving data even on interruptions. - Refactored task evaluation logic to include skipped task reporting and improved error handling.	2026-02-12 05:36:45 +00:00
teknium	a291cc99cf	more extra kwarg support for provider selection etc on openrouter in agent rl envs and evals	2026-02-12 05:36:25 +00:00
teknium	389ac5e017	pass extrabody for agentloop to ban and allowlist providers on openrouter, control thinking, etc	2026-02-12 05:35:48 +00:00
nightwing	fc792a4be9	Update Project_notes.md: grailed-embedding-search status and TODOs (June 2025)	2026-02-11 17:54:47 -07:00
nightwing	07501bef14	Add Project_notes.md — centralized status tracker for all side projects	2026-02-11 17:36:18 -07:00
teknium1	137ce05324	Add image generation tool to toolsets for messaging platforms - Included "image_generate" in the toolsets for web, vision, and skills categories, expanding functionality for image-related tasks. - Updated comments for clarity on the new tool's purpose, ensuring users understand its integration within the existing framework.	2026-02-10 21:04:24 -08:00
teknium1	ada0b4f131	Enhance image handling in platform adapters - Updated the image generation function description to clarify usage with markdown. - Added `send_image` method to `BasePlatformAdapter` for native image sending across platforms. - Implemented `send_image` in `DiscordAdapter` and `TelegramAdapter` to handle image attachments directly. - Introduced `extract_images` method to extract image URLs from markdown and HTML, improving content processing. - Enhanced message handling to support sending images as attachments while maintaining text content.	2026-02-10 21:02:40 -08:00
teknium	abe925e212	Update hermes-discord toolset to enable full terminal access with safety checks - Revised the description to reflect full access capabilities, including terminal usage with a dangerous command approval system. - Added terminal and file manipulation tools to the toolset, enhancing functionality for users. - Updated comments for clarity on tool purposes, ensuring better understanding of available features.	2026-02-11 04:44:30 +00:00
teknium1	8fb44608bf	Update SKILL.md and related references to implement container binding for labeled shapes and arrows in Excalidraw - Revised the labeled shape and arrow sections to utilize container binding instead of the deprecated "label" property, ensuring proper text rendering. - Added warnings about the invalidity of the "label" property and emphasized the use of `boundElements` for text elements. - Updated examples in dark-mode and general references to reflect the new binding approach, enhancing clarity and usability for users creating diagrams.	2026-02-10 20:05:23 -08:00
teknium1	153cd5bb44	Refactor skills tool integration and enhance system prompt - Removed the skills_categories tool from the skills toolset, streamlining the skills functionality to focus on skills_list and skill_view. - Updated the system prompt to dynamically build a compact skills index, allowing the model to quickly reference available skills without additional tool calls. - Cleaned up related code and documentation to reflect the removal of skills_categories, ensuring clarity and consistency across the codebase.	2026-02-10 19:48:38 -08:00
teknium1	669545f551	Add diagramming skills for Excalidraw - Introduced a new DESCRIPTION.md file outlining diagram creation skills for visual diagrams and flowcharts using Excalidraw. - Added SKILL.md for the Excalidraw skill, detailing its functionality, usage, and workflow for creating hand-drawn style diagrams. - Created references for color palettes, dark mode diagrams, and example diagrams to assist users in utilizing the Excalidraw skill effectively. - Implemented an upload script for sharing diagrams via Excalidraw.com, ensuring user-friendly access to generated diagrams.	2026-02-10 19:30:46 -08:00
teknium1	cfe2f3fe15	Implement interrupt handling for long-running tool executions in AIAgent - Added functionality to signal and terminate long-running terminal commands when a new user message is received, allowing for immediate agent response. - Introduced a global interrupt event in the terminal tool to facilitate early termination of subprocesses. - Updated the AIAgent class to handle interrupts gracefully, ensuring that remaining tool calls are skipped and appropriate messages are returned to maintain valid message sequences.	2026-02-10 16:34:27 -08:00
teknium1	140d609e0c	Refine agent history conversion logic in GatewayRunner - Enhanced the conversion of message history to agent format by distinguishing between normal and rich agent messages. - Implemented logic to preserve full message structure for tool-related messages, ensuring valid assistant-to-tool sequences. - Simplified handling of simple text messages by stripping unnecessary fields while retaining essential role and content information.	2026-02-10 16:16:30 -08:00
teknium	a32ad1a656	Fix infinite interrupt loop in gateway by consuming pending messages with .pop() and clearing interrupt events before recursion - Added logic to clear the adapter's interrupt event to prevent infinite loops during message processing. - Updated the get_pending_message method to pop messages from the pending queue, ensuring proper message handling.	2026-02-11 00:05:30 +00:00
teknium1	62ba69a29d	Fix gateway exit code to enable systemd auto-restart on connection failure - Updated the start_gateway function to return a boolean indicating success or failure, allowing for better control over exit codes. - Modified the main function to handle gateway startup failures, ensuring systemd can automatically restart on transient errors. - Enhanced error handling in the hermes_cli gateway to exit with code 1 if the gateway fails to connect to any platform.	2026-02-10 16:01:00 -08:00
teknium1	9b0f2a16ca	Enhance CLI functionality with retry and undo commands - Added /retry command to resend the last user message, improving user experience by allowing message re-sending without retyping. - Introduced /undo command to remove the last user/assistant exchange from conversation history, providing better control over conversation flow. - Updated save_config_value function to respect user and project config precedence, enhancing configuration management. - Improved prompt handling and visual output for user input, adapting to terminal width for better readability.	2026-02-10 15:59:46 -08:00
teknium	85e629e915	Add cleanup functionality for orphaned sandboxes in TerminalBench2EvalEnv - Implemented a cleanup process to terminate any remaining sandboxes after evaluation, addressing issues with orphaned thread pool workers. - Enhanced logging to inform users about the cleanup process, ensuring better resource management and user awareness.	2026-02-10 23:48:49 +00:00
teknium	999a28062d	Implement graceful exit cleanup for terminal tool - Added a new `_atexit_cleanup` function to handle cleanup of active environments and stop the cleanup thread upon program exit. - Enhanced logging to inform users about the number of remaining sandboxes being shut down during cleanup.	2026-02-10 22:53:44 +00:00
teknium	ba3fea24f1	Enhance TerminalBench 2 configuration and evaluation handling - Added task_timeout parameter to enforce a maximum wall-clock time for each task, automatically scoring as FAIL if exceeded. - Introduced terminal_timeout and tool_pool_size parameters to improve command execution and concurrency management. - Updated logging to provide detailed task execution times and timeout handling, enhancing overall monitoring. - Removed outdated evaluate_config.yaml file to streamline configuration management.	2026-02-10 22:53:24 +00:00
teknium	6b4a8d0b17	Add terminal configuration options and enhance environment setup - Introduced terminal_timeout and terminal_lifetime parameters to control command execution and sandbox inactivity. - Updated environment variable handling to allow configuration overrides for terminal settings. - Enhanced logging to provide detailed information about terminal settings during initialization. - Added tool_pool_size parameter to dynamically resize the thread pool for tool execution, improving concurrency management.	2026-02-10 22:51:50 +00:00
teknium	5ec75e38b9	Enhance tool execution and logging in HermesAgentLoop - Increased thread pool size for tool execution from 8 to 128 to improve concurrency and prevent starvation. - Added a function to resize the tool executor dynamically based on configuration. - Enhanced logging to track API call durations and tool execution times, including warnings for slow tools. - Improved overall performance monitoring by logging detailed information for each turn in the agent loop.	2026-02-10 22:51:18 +00:00
teknium	ad042fdd68	Update terminalbench_2 configuration for enhanced performance and evaluation - Increased max_token_length from 16000 to 32000 to allow for longer inputs. - Adjusted agent_temperature from 0.6 to 0.8 for more varied responses. - Extended test_timeout from 180 to 600 seconds to accommodate longer evaluations. - Updated data directory path for saving evaluations to ensure proper organization.	2026-02-10 19:48:41 +00:00
teknium	35ad3146a8	Add new environments and enhance tool context functionality - Introduced new environments: Terminal Test Environment and SWE Environment, each with default configurations for testing and software engineering tasks. - Added TerminalBench 2.0 evaluation environment with comprehensive setup for agentic LLMs, including task execution and verification. - Enhanced ToolContext with methods for uploading and downloading files, ensuring binary-safe operations. - Updated documentation across environments to reflect new features and usage instructions. - Refactored existing environment configurations for consistency and clarity.	2026-02-10 19:39:05 +00:00
teknium	e8343f2d87	Refactor Singularity environment for persistent container management - Updated the _SingularityEnvironment class to utilize a persistent Apptainer instance, allowing state (files, installs, environment changes) to persist across commands. - Enhanced the initialization process to start a background instance with full isolation and writable filesystem. - Modified the execute method to connect to the running instance, ensuring commands run within the same container context. - Implemented cleanup functionality to stop the persistent instance on cleanup or destruction, improving resource management. - Updated class documentation to reflect new features and usage of the persistent environment.	2026-02-10 06:49:58 +00:00
teknium	1b1307d0d1	Implement Anthropic prompt caching for Claude models via OpenRouter - Introduced a caching strategy that reduces input token costs by ~75% on multi-turn conversations by caching the conversation prefix. - Added functions to apply cache control markers to messages, enhancing efficiency in token usage. - Updated AIAgent to auto-enable prompt caching for Claude models, with configurable cache TTL. - Enhanced logging to track cache hit statistics when caching is active, improving monitoring of token usage.	2026-02-10 06:49:41 +00:00
teknium	7a11be9f3f	Enhance browser tool functionality and cleanup process - Added checks for local installation of the agent-browser CLI in the `_find_agent_browser` function, improving installation guidance. - Implemented per-task socket directory management in `_run_browser_command` to prevent concurrency issues. - Updated `cleanup_browser` to remove per-task socket directories, ensuring proper resource cleanup after task completion. - Refactored comments for clarity and improved documentation throughout the browser tool code.	2026-02-09 04:36:37 +00:00
teknium1	192ce958c3	Enhance CLI command handling and introduce resource cleanup features - Added imports for resource cleanup during safe shutdown, including terminal and browser session cleanup. - Refactored command handling to preserve original case for model names and prompt text, improving user experience. - Introduced a dedicated interrupt queue to manage user input while the agent is running, preventing race conditions. - Updated comments and documentation for clarity on command processing and input handling.	2026-02-08 13:31:45 -08:00
teknium1	c441681dc2	Update default model to 'anthropic/claude-opus-4.6' and refine terminal working directory settings - Changed the default LLM model in the setup wizard and example environment file to 'anthropic/claude-opus-4.6'. - Updated terminal working directory settings in CLI and related files to use the current directory ('.') instead of '/tmp'. - Enhanced documentation comments for clarity on terminal configuration and working directory behavior.	2026-02-08 12:56:40 -08:00
teknium	dd70d57b9b	Refactor BatchRunner and AIAgent for enhanced reasoning and tool management, improved tool definitions for fileops - Updated `ALL_POSSIBLE_TOOLS` to auto-derive from `TOOL_TO_TOOLSET_MAP` for consistent schema. - Introduced `_extract_reasoning_stats` function to track reasoning coverage in assistant turns. - Enhanced `_process_batch_worker` to discard prompts with no reasoning and aggregate reasoning statistics. - Updated documentation and comments for clarity on new features and changes.	2026-02-08 20:19:14 +00:00
teknium	f12ea1bc02	Enhance BatchRunner and AIAgent with new configuration options, default model now opus 4.6, default summarizer gemini flash 3 - Added `max_tokens`, `reasoning_config`, and `prefill_messages` parameters to `BatchRunner` and `AIAgent` for improved model response control. - Updated CLI to support new options for reasoning effort and prefill messages from a JSON file. - Modified example configuration files to reflect changes in default model and summary model. - Improved error handling for loading prefill messages and reasoning configurations in the CLI. - Updated documentation to include new parameters and usage examples.	2026-02-08 10:49:24 +00:00
Teknium	fa76a331b0	Merge pull request #19 from NousResearch/atropos-hermes-agent Enhance async tool execution and error handling in Hermes agent for A…	2026-02-07 21:01:06 -08:00
teknium	d999d9876d	Enhance async tool execution and error handling in Hermes agent for Atropos integration - Updated `.gitignore` to exclude `testlogs` directory. - Refactored `handle_web_function_call` in `model_tools.py` to support running async functions in existing event loops, improving compatibility with Atropos. - Introduced a thread pool executor in `agent_loop.py` for running synchronous tool calls that internally use `asyncio.run()`, preventing deadlocks. - Added `ToolError` class to track tool execution errors, enhancing error reporting during agent loops. - Updated `wandb_log` method in `hermes_base_env.py` to log tool error statistics for better monitoring. - Implemented patches in `patches.py` to ensure async-safe operation of tools within Atropos's event loop. - Enhanced `ToolContext` and `terminal_tool.py` to utilize the new async handling, improving overall tool execution reliability.	2026-02-08 05:00:47 +00:00
Teknium	578a5fb6a9	Merge pull request #18 from NousResearch/atropos-hermes-agent Upgrade installers to use uv	2026-02-07 16:00:05 -08:00
teknium	a8809bbd3e	Transition installation to uv for py version and speed to be easier to streamline - Integrated `uv` as a fast Python package manager for automatic Python provisioning and dependency management. - Updated installation scripts (`setup-hermes.sh`, `install.sh`, `install.ps1`) to utilize `uv` for installing Python and packages, streamlining the setup process. - Revised `README.md` to reflect changes in installation steps, including symlinking `hermes` for global access and clarifying Python version requirements. - Adjusted commands in `doctor.py` and other scripts to recommend `uv` for package installations, ensuring consistency across the project.	2026-02-07 23:54:53 +00:00
teknium	a478e44585	Increase max_token_length in TerminalTestEnv to 16000 for enhanced processing capacity	2026-02-07 21:11:07 +00:00
teknium	c0494b3558	Update pyproject.toml to refine dependency management - Reorganized the 'all' dependencies to include specific optional groups for better modularity. - Added support for 'hermes-agent' with distinct categories: modal, messaging, cron, cli, and dev.	2026-02-07 21:11:01 +00:00
Teknium	7f1cd014f2	Merge pull request #17 from NousResearch/atropos-hermes-agent Add support for Atropos Agentic RL environments (requires branch tool…	2026-02-07 09:12:10 -08:00
teknium	07b615e96e	Add support for Atropos Agentic RL environments (requires branch tool_call_support in Atropos atm) - Added new environments for reinforcement learning, including `HermesSweEnv` for software engineering tasks and `TerminalTestEnv` for inline testing. - Introduced `ToolContext` for unrestricted access to tools during reward computation. - Updated `.gitignore` to exclude `wandb/` directory. - Enhanced `README.md` with detailed architecture and usage instructions for Atropos environments. - Added configuration files for SWE and terminal test environments to streamline setup. - Removed unnecessary compiled Python files from `__pycache__`.	2026-02-07 09:17:16 +00:00
Teknium	ab387a6120	Merge pull request #16 from NousResearch/atropos-hermes-agent Update dependencies and enhance installation scripts	2026-02-06 16:05:50 -08:00
teknium	ac79725923	Update dependencies and enhance installation scripts - Added `prompt_toolkit` as a direct dependency for interactive CLI support. - Updated `modal` optional dependency to require `swe-rex[modal]>=1.4.0` for improved cloud execution capabilities. - Enhanced `messaging` optional dependencies to include `aiohttp>=3.9.0` for WhatsApp bridge communication. - Refined installation scripts to check for Python version requirements, emphasizing the need for Python 3.11+ for RL training tools. - Improved setup scripts to ensure proper installation of submodules and dependencies, enhancing user experience during setup.	2026-02-07 00:05:04 +00:00
Teknium	8dd38318fc	Merge pull request #15 from NousResearch/rl-capabilities Rl capabilities && File Operator Tools	2026-02-05 03:50:42 -08:00
teknium1	533c064269	Add file manipulation tools and enhance setup scripts - Introduced file manipulation capabilities in `model_tools.py`, including functions for reading, writing, patching, and searching files. - Added a new `file` toolset in `toolsets.py` and updated distributions to include file tools. - Enhanced `setup-hermes.sh` and `install.sh` scripts to check for and optionally install `ripgrep` for faster file searching. - Implemented a new `file_operations.py` module to encapsulate file operations using shell commands. - Updated `doctor.py` and `install.ps1` to check for `ripgrep` and provide installation guidance if not found. - Added fuzzy matching and patch parsing capabilities to improve file manipulation accuracy and flexibility.	2026-02-05 03:49:46 -08:00
teknium1	5c3105b437	Enhance RL test inference with WandB integration and real-time output streaming - Added unique run ID generation for WandB tracking during test inference. - Enabled WandB usage for test tracking and updated command-line arguments accordingly. - Implemented real-time output streaming for process execution, improving log visibility and debugging. - Enhanced error handling to display last few lines of stderr for better troubleshooting.	2026-02-04 21:07:07 -08:00
teknium1	3c0d0dba49	Update RL tools and enhance configuration management - Modified `model_tools.py` to update default model IDs and add new RL function `rl_test_inference`. - Enhanced `README.md` with installation instructions for submodules and updated API key usage. - Improved `rl_cli.py` to load configuration from `~/.hermes/config.yaml` and set terminal working directory for RL tools. - Updated `run_agent.py` to handle empty string arguments as empty objects for better JSON validation. - Refined installation scripts to ensure submodules are cloned and installed correctly, enhancing setup experience.	2026-02-04 13:57:59 -08:00

1 2 3 4

184 Commits