hermes-agent

Author	SHA1	Message	Date
teknium1	f55f625277	chore: reorder terminal backends in setup wizard Local, Docker, Modal, SSH, Daytona, Singularity (Linux-only, last).	2026-03-06 22:21:57 -08:00
teknium1	9dac85b069	fix: uv pip install fails outside venv in setup wizard uv pip install requires a virtual environment by default. When hermes is installed system-wide or via pipx, the setup wizard's SDK installs (daytona, swe-rex[modal], tinker-atropos) fail with 'No virtual environment found'. Fix by passing --python sys.executable to uv, which targets the correct Python regardless of venv state. Also show the actual error message on install failure so users can debug.	2026-03-06 21:55:33 -08:00
teknium1	99bd69baa8	Merge feat/modular-setup-wizard: modular setup wizard with section subcommands and tool-first UX - 5 standalone sections: hermes setup [model\|terminal\|gateway\|tools\|agent] - Returning user menu with section shortcuts - Tool-first UX: category -> provider -> API key flow - Unified hermes tools / hermes setup tools - Fixed dict-format model config display bug Closes #567	2026-03-06 21:12:30 -08:00
teknium1	a62a137a4f	fix: handle dict-format model config in setup wizard display config['model'] can be a dict (old format: {default, base_url, provider}) or a string (new format). The setup wizard was showing the raw dict in 'Keep current' and 'Model set to' messages. Now extracts the model name from either format.	2026-03-06 21:11:40 -08:00
teknium1	82b18e8ac2	feat: unify hermes tools and hermes setup tools into single flow Both 'hermes tools' and 'hermes setup tools' now use the same unified flow in tools_config.py: 1. Select platform (CLI, Telegram, Discord, etc.) 2. Toggle all 18 toolsets on/off in checklist 3. Newly enabled tools that need API keys → provider-aware config (e.g., TTS shows Edge/OpenAI/ElevenLabs picker) 4. Already-configured tools that stay enabled → silent, no prompts 5. Menu option: 'Reconfigure an existing tool' for updating providers or API keys on tools that are already set up Key changes: - Move TOOL_CATEGORIES, provider config, and post-setup hooks from setup.py to tools_config.py - Replace flat _check_and_prompt_requirements() with provider-aware _configure_toolset() that uses TOOL_CATEGORIES - Add _reconfigure_tool() flow for updating existing configs - setup.py's setup_tools() now delegates to tools_command() - tools_command() menu adds 'Reconfigure' option alongside platforms - Only prompt for API keys on tools that are NEWLY toggled on AND don't already have keys configured No breaking changes. All 2013 tests pass.	2026-03-06 21:02:00 -08:00
teknium1	0111c9848d	fix: remove ANSI codes and em dashes from menu labels simple_term_menu miscalculates string widths when labels contain ANSI escape codes (from color()) or em dashes, causing duplicated and garbled lines on arrow key navigation. Replace color() status indicators with plain text [configured]/[active] and em dashes with regular dashes in all prompt_choice/prompt_checklist labels.	2026-03-06 21:02:00 -08:00
teknium1	ab9cadfeee	feat: modular setup wizard with section subcommands and tool-first UX Restructure the monolithic hermes setup wizard into independently-runnable sections with a category-first tool configuration experience. Changes: - Break setup into 5 sections: model, terminal, gateway, tools, agent - Each section is a standalone function, runnable individually via 'hermes setup model', 'hermes setup terminal', etc. - Returning users get a menu: Quick Setup / Full Setup / individual sections - First-time users get a guided walkthrough of all sections Tool Configuration UX overhaul: - Replace flat API key checklist with category-first approach - Show tool types (TTS, Web Search, Image Gen, etc.) as top-level items - Within each category, let users pick a provider: - TTS: Microsoft Edge (Free), OpenAI, ElevenLabs - Web: Firecrawl Cloud, Firecrawl Self-Hosted - Image Gen: FAL.ai - Browser: Browserbase - Smart Home: Home Assistant - RL Training: Tinker/Atropos - GitHub: Personal Access Token - Shows configured status on each tool and provider - Only prompts for API keys after provider selection Also: - Add section argument to setup argparse parser in main.py - Update summary to show new section commands - Add self-hosted Firecrawl and Home Assistant to tool setup - All 2013 tests pass	2026-03-06 21:02:00 -08:00
teknium1	ce28f847ce	fix: update OpenRouter model names for yc-bench config Use anthropic/claude-sonnet-4.6 (OpenRouter format) instead of anthropic/claude-sonnet-4-20250514 (direct API format).	2026-03-06 19:58:56 -08:00
teknium1	b4fbb6fe10	feat: add YC-Bench long-horizon agent benchmark environment Adds eval-only benchmark for YC-Bench (collinear-ai/yc-bench), a deterministic long-horizon benchmark where the agent acts as CEO of an AI startup over a simulated 1-3 year run. Key design decisions verified against the official yc-bench repo: - Uses 'sim init' (NOT 'yc-bench run') to avoid starting a competing built-in agent loop - Correct DB table names: 'companies' and 'sim_events' - Correct 4 domains: research, inference, data_environment, training - Penalty values are preset-dependent (not hardcoded in system prompt) - Sequential evaluation (each run is 100-500 turns) - Follows TerminalBench2 patterns: KeyboardInterrupt handling, cleanup_all_environments(), tqdm logging handler, streaming JSONL yc-bench added as optional dependency: pip install hermes-agent[yc-bench] Closes #340	2026-03-06 19:25:56 -08:00
teknium1	82d7e9429e	chore: add GLM/Kimi/MiniMax models to insights pricing (zero cost) These direct providers don't return cost in API responses and their per-token pricing isn't readily available externally. Treat as local models with zero cost so they appear in /insights without fake estimates.	2026-03-06 19:12:14 -08:00
teknium1	e2821effb5	feat: add direct API-key providers as auxiliary client fallbacks When the user only has a z.ai/Kimi/MiniMax API key (no OpenRouter key), auxiliary tasks (context compression, web summarization, session search) now fall back to the configured direct provider instead of returning None. Resolution chain: OpenRouter -> Nous -> Custom endpoint -> Codex OAuth -> direct API-key providers -> None. Uses cheap/fast models for auxiliary tasks: - zai: glm-4.5-flash - kimi-coding: kimi-k2-turbo-preview - minimax/minimax-cn: MiniMax-M2.5-highspeed Vision auxiliary intentionally NOT modified — vision needs multimodal models (Gemini) that these providers don't serve.	2026-03-06 19:08:54 -08:00
teknium1	9742f11fda	chore: add context lengths for Kimi and MiniMax models Adds DEFAULT_CONTEXT_LENGTHS entries for kimi-k2.5 (262144), kimi-k2-thinking (262144), kimi-k2-turbo-preview (262144), kimi-k2-0905-preview (131072), MiniMax-M2.5/M2.5-highspeed/M2.1 (204800), and glm-4.5/4.5-flash (131072). Avoids unnecessary 2M-token probe on first use with direct providers.	2026-03-06 19:01:38 -08:00
teknium1	388dd4789c	feat: add z.ai/GLM, Kimi/Moonshot, MiniMax as first-class providers Adds 4 new direct API-key providers (zai, kimi-coding, minimax, minimax-cn) to the inference provider system. All use standard OpenAI-compatible chat/completions endpoints with Bearer token auth. Core changes: - auth.py: Extended ProviderConfig with api_key_env_vars and base_url_env_var fields. Added providers to PROVIDER_REGISTRY. Added provider aliases (glm, z-ai, zhipu, kimi, moonshot). Added auto-detection of API-key providers in resolve_provider(). Added resolve_api_key_provider_credentials() and get_api_key_provider_status() helpers. - runtime_provider.py: Added generic API-key provider branch in resolve_runtime_provider() — any provider with auth_type='api_key' is automatically handled. - main.py: Added providers to hermes model menu with generic _model_flow_api_key_provider() flow. Updated _has_any_provider_configured() to check all provider env vars. Updated argparse --provider choices. - setup.py: Added providers to setup wizard with API key prompts and curated model lists. - config.py: Added env vars (GLM_API_KEY, KIMI_API_KEY, MINIMAX_API_KEY, etc.) to OPTIONAL_ENV_VARS. - status.py: Added API key display and provider status section. - doctor.py: Added connectivity checks for each provider endpoint. - cli.py: Updated provider docstrings. Docs: Updated README.md, .env.example, cli-config.yaml.example, cli-commands.md, environment-variables.md, configuration.md. Tests: 50 new tests covering registry, aliases, resolution, auto-detection, credential resolution, and runtime provider dispatch. Inspired by PR #33 (numman-ali) which proposed a provider registry approach. Credit to tars90percent (PR #473) and manuelschipper (PR #420) for related provider improvements merged earlier in this changeset.	2026-03-06 18:55:18 -08:00
Teknium	fdebca4573	Merge pull request #571 from NousResearch/rewbs/nous-key-remint-attempt-on-401 fix: implement Nous credential refresh on 401 error for retry logic	2026-03-06 18:52:01 -08:00
teknium1	479dfc096a	Merge PR #473 : Update model id in OpenRouter from minimax-m2.1 to minimax-m2.5 Authored by tars90percent. Updates remaining minimax-m2.1 references to minimax-m2.5 in rl_training_tool.py and docs.	2026-03-06 18:43:18 -08:00
teknium1	3c6c11b7c9	Merge PR #420 : fix: respect OPENAI_BASE_URL when resolving API key priority Authored by manuelschipper. Adds GLM-4.7 and GLM-5 context lengths (202752) to model_metadata.py. The key priority fix (prefer OPENAI_API_KEY for non-OpenRouter endpoints) was already applied in PR #295; merged the Z.ai mention into the comment.	2026-03-06 18:43:13 -08:00
Robin Fernandes	bc091eb7ef	fix: implement Nous credential refresh on 401 error for retry logic	2026-03-07 13:34:23 +11:00
teknium1	f75b1d21b4	fix: execute_code and delegate_task now respect disabled toolsets When a user disables the web toolset via 'hermes tools', the execute_code schema description still hardcoded web_search/web_extract as available, causing the model to keep trying to use them. Similarly, delegate_task always defaulted to ['terminal', 'file', 'web'] for subagents regardless of the parent's config. Changes: - execute_code schema is now built dynamically via build_execute_code_schema() based on which sandbox tools are actually enabled - model_tools.py rebuilds the execute_code schema at definition time using the intersection of sandbox-allowed and session-enabled tools - delegate_task now inherits the parent agent's enabled_toolsets instead of hardcoding DEFAULT_TOOLSETS when no explicit toolsets are specified - delegate_task description updated to say 'inherits your enabled toolsets' Reported by kotyKD on Discord.	2026-03-06 17:36:14 -08:00
teknium1	94053d75a6	fix: custom endpoint no longer leaks OPENROUTER_API_KEY (#560 ) API key selection is now base_url-aware: when the resolved base_url targets OpenRouter, OPENROUTER_API_KEY takes priority (preserving the #289 fix). When hitting any other endpoint (Z.ai, vLLM, custom, etc.), OPENAI_API_KEY takes priority so the OpenRouter key doesn't leak. Applied in both the runtime provider resolver (the real code path) and the CLI initial default (for consistency). Fixes #560.	2026-03-06 17:16:14 -08:00
teknium1	2a68099675	fix(tests): isolate tests from user ~/.hermes/ config and SOUL.md _make_cli() now patches CLI_CONFIG with clean defaults so test_cli_init tests don't depend on the developer's local config.yaml. test_empty_dir_returns_empty now mocks Path.home() so it doesn't pick up a global SOUL.md. Credit to teyrebaz33 for identifying and fixing these in PR #557. Fixes #555.	2026-03-06 17:10:35 -08:00
teknium1	6cd3bc6640	Merge PR #563 : fix: prevent data loss in skills sync on copy/update failure Authored by 0xbyt4. Two bugs fixed: 1. Failed copytree no longer poisons the manifest (skill gets retried) 2. Failed update no longer destroys user's copy (backup + restore)	2026-03-06 17:01:30 -08:00
0xbyt4	211b55815e	fix: prevent data loss in skills sync on copy/update failure Two bugs in sync_skills(): 1. Failed copytree poisons manifest: when shutil.copytree fails (disk full, permission error), the skill is still recorded in the manifest. On the next sync, the skill appears as "in manifest but not on disk" which is interpreted as "user deliberately deleted it" — the skill is never retried. Fix: only write to manifest on successful copy. 2. Failed update destroys user copy: rmtree deletes the existing skill directory before copytree runs. If copytree then fails, the user's skill is gone with no way to recover. Fix: move to .bak before copying, restore from backup if copytree fails. Both bugs are proven by new regression tests that fail on the old code and pass on the fix.	2026-03-07 03:58:32 +03:00
teknium1	8ae4a6f824	fix: improve handling of empty responses after tool calls - Added fallback mechanism to utilize previous content when the model generates an empty response after tool calls, reducing unnecessary API retries. - Enhanced logging to indicate when prior content is used as a final response. - Updated logic to ensure that genuine empty responses are retried appropriately, maintaining user experience.	2026-03-06 16:54:31 -08:00
teknium1	b98301677a	docs: add /insights to all help menus and documentation - website/docs/reference/cli-commands.md: Added 'hermes insights' terminal command section with --days and --source flags, plus /insights slash command in the Conversation section - website/docs/user-guide/cli.md: Added /insights to slash commands table - website/docs/user-guide/messaging/index.md: Added /insights to gateway chat commands table - website/docs/user-guide/sessions.md: Added cross-reference to hermes insights from the sessions stats section	2026-03-06 16:48:58 -08:00
teknium1	f2fdde5ba4	fix: show user-modified skills count in hermes update output	2026-03-06 16:14:43 -08:00
teknium1	4f56e31dc7	fix: track origin hashes in skills manifest to preserve user modifications Upgrade skills_sync manifest to v2 format (name:origin_hash). The origin hash records the MD5 of the bundled skill at the time it was last synced. On update, the user's copy is compared against the origin hash: - User copy == origin hash → unmodified → safe to update from bundled - User copy != origin hash → user customized → skip (preserve changes) v1 manifests (plain names) are auto-migrated: the user's current hash becomes the baseline, so future syncs can detect modifications. Output now shows user-modified skills: ~ whisper (user-modified, skipping) 27 tests covering all scenarios including v1→v2 migration, user modification detection, update after migration, and origin hash tracking. 2009 tests pass.	2026-03-06 16:13:58 -08:00
Teknium	6d3804770c	Merge pull request #552 from NousResearch/feat/insights feat: /insights command — usage analytics, cost estimation & activity patterns	2026-03-06 16:00:28 -08:00
teknium1	ab0f4126cf	fix: restore all removed bundled skills + fix skills sync system - Restored 21 skills removed in commits `757d012` and `740dd92`: accelerate, audiocraft, code-review, faiss, flash-attention, gguf, grpo-rl-training, guidance, llava, nemo-curator, obliteratus, peft, pytorch-fsdp, pytorch-lightning, simpo, slime, stable-diffusion, tensorrt-llm, torchtitan, trl-fine-tuning, whisper - Rewrote sync_skills() with proper update semantics: * New skills (not in manifest): copied to user dir * Existing skills (in manifest + on disk): updated via hash comparison * User-deleted skills (in manifest, not on disk): respected, not re-added * Stale manifest entries (removed from bundled): cleaned from manifest - Added sync_skills() to CLI startup (cmd_chat) and gateway startup (start_gateway) — previously only ran during 'hermes update' - Updated cmd_update output to show new/updated/cleaned counts - Rewrote tests: 20 tests covering manifest CRUD, dir hashing, fresh install, user deletion respect, update detection, stale cleanup, and name collision handling 75 bundled skills total. 2002 tests pass.	2026-03-06 15:57:30 -08:00
teknium1	585f8528b2	fix: deep review — prefix matching, tool_calls extraction, query perf, serialization Issues found and fixed during deep code path review: 1. CRITICAL: Prefix matching returned wrong prices for dated model names - 'gpt-4o-mini-2024-07-18' matched gpt-4o ($2.50) instead of gpt-4o-mini ($0.15) - Same for o3-mini→o3 (9x), gpt-4.1-mini→gpt-4.1 (5x), gpt-4.1-nano→gpt-4.1 (20x) - Fix: use longest-match-wins strategy instead of first-match - Removed dangerous key.startswith(bare) reverse matching 2. CRITICAL: Top Tools section was empty for CLI sessions - run_agent.py doesn't set tool_name on tool response messages (pre-existing) - Insights now also extracts tool names from tool_calls JSON on assistant messages, which IS populated for all sessions - Uses max() merge strategy to avoid double-counting between sources 3. SELECT * replaced with explicit column list - Skips system_prompt and model_config blobs (can be thousands of chars) - Reduces memory and I/O for large session counts 4. Sets in overview dict converted to sorted lists - models_with_pricing / models_without_pricing were Python sets - Sets aren't JSON-serializable — would crash json.dumps() 5. Negative duration guard - end > start check prevents negative durations from clock drift 6. Model breakdown sort fallback - When all tokens are 0, now sorts by session count instead of arbitrary order 7. Removed unused timedelta import Added 6 new tests: dated model pricing (4), tool_calls JSON extraction, JSON serialization safety. Total: 69 tests.	2026-03-06 14:50:57 -08:00
teknium1	75f523f5c0	fix: unknown/custom models get zero cost instead of fake estimates Custom OAI endpoints, self-hosted models, and local inference should NOT show fabricated cost estimates. Changed default pricing from $3/$12 per million tokens to $0/$0 for unrecognized models. - Added _has_known_pricing() to distinguish commercial vs custom models - Models with known pricing show $ amounts; unknown models show 'N/A' - Overview shows asterisk + note when some models lack pricing data - Gateway format adds '(excludes custom/self-hosted models)' note - Added 7 new tests for custom model cost handling	2026-03-06 14:18:19 -08:00
teknium1	68fbae5692	docs: add Custom & Self-Hosted LLM Providers guide Comprehensive guide for using Hermes Agent with alternative LLM backends: - Ollama (local models, zero config) - vLLM (high-performance GPU inference) - SGLang (RadixAttention, prefix caching) - llama.cpp / llama-server (CPU & Metal inference) - LiteLLM Proxy (multi-provider gateway) - ClawRouter (cost-optimized routing with complexity scoring) - 10+ other compatible providers table (Together, Groq, DeepSeek, etc.) - Choosing the Right Setup decision table - General custom endpoint setup instructions All of these work via the existing OPENAI_BASE_URL + OPENAI_API_KEY custom endpoint support — no code changes needed.	2026-03-06 14:16:06 -08:00
teknium1	80f1dd8d37	docs: add Custom & Self-Hosted LLM Providers guide Comprehensive guide for using Hermes Agent with alternative LLM backends: - Ollama (local models, zero config) - vLLM (high-performance GPU inference) - SGLang (RadixAttention, prefix caching) - llama.cpp / llama-server (CPU & Metal inference) - LiteLLM Proxy (multi-provider gateway) - ClawRouter (cost-optimized routing with complexity scoring) - 10+ other compatible providers table (Together, Groq, DeepSeek, etc.) - Choosing the Right Setup decision table - General custom endpoint setup instructions All of these work via the existing OPENAI_BASE_URL + OPENAI_API_KEY custom endpoint support — no code changes needed.	2026-03-06 14:15:57 -08:00
teknium1	b52b37ae64	feat: add /insights command with usage analytics and cost estimation Inspired by Claude Code's /insights, adapted for Hermes Agent's multi-platform architecture. Analyzes session history from state.db to produce comprehensive usage insights. Features: - Overview stats: sessions, messages, tokens, estimated cost, active time - Model breakdown: per-model sessions, tokens, and cost estimation - Platform breakdown: CLI vs Telegram vs Discord etc. (unique to Hermes) - Tool usage ranking: most-used tools with percentages - Activity patterns: day-of-week chart, peak hours, streaks - Notable sessions: longest, most messages, most tokens, most tool calls - Cost estimation: real pricing data for 25+ models (OpenAI, Anthropic, DeepSeek, Google, Meta) with fuzzy model name matching - Configurable time window: --days flag (default 30) - Source filtering: --source flag to filter by platform Three entry points: - /insights slash command in CLI (supports --days and --source flags) - /insights slash command in gateway (compact markdown format) - hermes insights CLI subcommand (standalone) Includes 56 tests covering pricing helpers, format helpers, empty DB, populated DB with multi-platform data, filtering, formatting, and edge cases.	2026-03-06 14:04:59 -08:00
teknium1	d63b363cde	refactor: extract atomic_json_write helper, add 24 checkpoint tests Extract the duplicated temp-file + fsync + os.replace pattern from batch_runner.py (1 instance) and process_registry.py (2 instances) into a shared utils.atomic_json_write() function. Add 12 tests for atomic_json_write covering: valid JSON, parent dir creation, overwrite, crash safety (original preserved on error), no temp file leaks, string paths, unicode, custom indent, concurrent writes. Add 12 tests for batch_runner checkpoint behavior covering: _save_checkpoint (valid JSON, last_updated, overwrite, lock/no-lock, parent dirs, no temp leaks), _load_checkpoint (missing file, existing data, corrupt JSON), and resume logic (preserves prior progress, different run_name starts fresh).	2026-03-06 05:50:12 -08:00
teknium1	c05c60665e	Merge PR #298 : Make process_registry checkpoint writes atomic Authored by aydnOktay. Companion to PR #297 (batch_runner). Applies the same atomic write pattern (temp file + fsync + os.replace) to both _write_checkpoint() and recover_from_checkpoint() in process_registry.py. Prevents checkpoint corruption on gateway crashes. Also improves error handling: bare 'pass' replaced with logger.debug(..., exc_info=True) for better debugging.	2026-03-06 05:32:35 -08:00
teknium1	b4873a5de7	fix(setup): Escape skips instead of exiting, add control hints to all prompts Previously pressing Escape in any setup wizard menu called sys.exit(1), killing the entire wizard with no way to recover. Now: - prompt_choice: Escape keeps the current default and moves on (prints 'Skipped (keeping current)'). Shows '↑/↓ Navigate Enter Select Esc Skip Ctrl+C Exit' hint. - prompt_checklist: Escape returns pre-selected items instead of empty list. Shows 'SPACE Toggle ENTER Confirm ESC Skip Ctrl+C Exit'. - prompt_yes_no: now catches KeyboardInterrupt/EOFError properly. - Fallback number prompts also show control hints. Ctrl+C still exits the wizard cleanly.	2026-03-06 05:27:11 -08:00
teknium1	913f8ce0a5	Merge PR #297 : Make batch_runner checkpoint incremental and atomic Authored by aydnOktay. Three improvements to batch_runner fault tolerance: 1) Atomic checkpoint writes (temp file + fsync + os.replace) to prevent corruption on crashes — same pattern as auth.py's _save_auth_store(). 2) Incremental checkpoints after each batch result instead of only at end, so interrupted runs can resume with minimal progress loss. 3) Resume loads existing checkpoint state instead of initializing empty, preventing clobber of prior progress. Conflict resolved: kept both the incremental checkpoint logic (PR) and the batch worker error handling (HEAD) in the imap_unordered loop.	2026-03-06 05:16:31 -08:00
teknium1	4a63737227	Merge PR #433 : fix(whatsapp): replace Linux-only fuser with cross-platform port cleanup Authored by Farukest. Fixes #432. Extracts _kill_port_process() helper that uses netstat+taskkill on Windows and fuser on Linux. Previously, fuser calls were inline with bare except-pass, so on Windows orphaned bridge processes were never cleaned up — causing 'address already in use' errors on reconnect. Includes 5 tests covering both platforms, port matching edge cases, and exception suppression.	2026-03-06 04:52:25 -08:00
teknium1	3e93db16bd	Merge PR #436 : fix: use _max_tokens_param in max-iterations retry path Authored by Farukest. Fixes #435. The retry summary in _handle_max_iterations() hardcoded max_tokens instead of using _max_tokens_param(), which returns max_completion_tokens for direct OpenAI API (required by gpt-4o, o-series). The first attempt already used _max_tokens_param correctly — only the retry path was wrong. Includes 4 tests for _max_tokens_param provider detection.	2026-03-06 04:46:24 -08:00
teknium1	f863a42351	Merge PR #441 : fix(gateway): return response from /retry handler instead of discarding it Authored by PercyDikec. Fixes #440. _handle_retry_command called _handle_message(retry_event) but discarded the return value, returning None instead. Since only _process_message_background sends the response via adapter.send(), this meant the agent would run (tool progress was visible) but the final answer was silently dropped on all platforms.	2026-03-06 04:42:54 -08:00
teknium1	dc55f493be	fix: add missing re.DOTALL to DeepSeek V3.1 parser (same bug as V3) The V3.1 parser had the same issue — .*? without re.DOTALL fails to match multi-line JSON arguments. Found during review of PR #444.	2026-03-06 04:41:47 -08:00
teknium1	936fda3f9e	Merge PR #444 : fix: add missing re.DOTALL flag to DeepSeek V3 tool call parser Authored by PercyDikec. Fixes #443. Without re.DOTALL, the regex .* doesn't match newlines, so multi-line JSON arguments (the normal case) silently fail to parse. Every other parser in the codebase that matches across lines already uses re.DOTALL.	2026-03-06 04:39:53 -08:00
teknium1	ecb8148a9f	Merge PR #448 : fix(cli): use correct dict key for codex auth file path in status output Authored by PercyDikec. Fixes #447. The status display used codex_status.get('auth_file') but get_codex_auth_status() in auth.py returns the path under 'auth_store' (line 1220). This one-char key mismatch silently dropped the auth file path from 'hermes status'.	2026-03-06 04:34:46 -08:00
teknium1	2dbbedc05a	docs: rebrand messaging — 'the self-improving AI agent' - Lead with the learning loop: autonomous skill creation, skill self-improvement, memory nudges, FTS5 session search, Honcho dialectic user modeling - 'Runs anywhere' angle: 6 backends, serverless persistence with Daytona/Modal, not tied to your laptop - 'Built by model trainers' replaces 'model-agnostic' - Updated README tagline, feature table, subtitle - Updated docs landing page hero, description, key features - Updated docusaurus tagline and pyproject.toml description	2026-03-06 04:34:06 -08:00
teknium1	c30967806c	test: add 26 tests for set_config_value secret routing Verifies explicit allowlist keys, catch-all _API_KEY/_TOKEN patterns, case insensitivity, TERMINAL_SSH prefix, and config.yaml routing for non-secret keys. Covers the fix from PR #469.	2026-03-06 04:26:18 -08:00
teknium1	145f719d30	Merge PR #469 : fix(config): route API keys and tokens to .env instead of config.yaml Authored by ygd58. Fixes #465. Adds missing keys to allowlist and catch-all patterns (_API_KEY, _TOKEN suffixes) for future-proofing.	2026-03-06 04:23:49 -08:00
teknium1	b89eb29174	fix: correct mock tool name 'search' → 'search_files' in test_code_execution The mock handler checked for function_name == 'search' but the RPC sends 'search_files'. Any test exercising search_files through the mock would get 'Unknown tool' instead of the canned response.	2026-03-06 03:53:43 -08:00
teknium1	3670089a42	docs: add Daytona to batch_runner, process_registry, agent_loop, tool_context Add daytona_image to batch_runner per-prompt container image overrides so batch processing works with the Daytona backend. Update inline comments in RL environment files (agent_loop, tool_context) and process_registry docstrings to include Daytona in backend lists.	2026-03-06 03:49:59 -08:00
teknium1	3982fcf095	fix: sync execute_code sandbox stubs with real tool schemas The _TOOL_STUBS dict in code_execution_tool.py was out of sync with the actual tool schemas, causing TypeErrors when the LLM used parameters it sees in its system prompt but the sandbox stubs didn't accept: search_files: - Added missing params: context, offset, output_mode - Fixed target default: 'grep' → 'content' (old value was obsolete) patch: - Added missing params: mode, patch (V4A multi-file patch support) Also added 4 drift-detection tests (TestStubSchemaDrift) that will catch future divergence between stubs and real schemas: - test_stubs_cover_all_schema_params: every schema param in stub - test_stubs_pass_all_params_to_rpc: every stub param sent over RPC - test_search_files_target_uses_current_values: no obsolete values - test_generated_module_accepts_all_params: generated code compiles All 28 tests pass.	2026-03-06 03:40:06 -08:00
teknium1	8481fdcf08	docs: complete Daytona backend documentation coverage Update all remaining files that enumerate terminal backends to include Daytona. Covers security docs (bypass info, backend comparison table), environment variables reference (DAYTONA_API_KEY, TERMINAL_DAYTONA_IMAGE, container resources header), AGENTS.md (architecture tree, config keys), environments/README.md, hermes_base_env.py field description, and various module docstrings. Follow-up to PR #451 merge.	2026-03-06 03:37:05 -08:00

1 2 3 4 5 ...

893 Commits