hermes-agent

Files

Teknium b2b4a9ee7d fix(gateway): hygiene compression ignores config context_length and 1.4x exceeds model limit

Three bugs in gateway session hygiene pre-compression caused 'Session too
large' errors for ~200K context models like GLM-5-turbo on z.ai:

1. Gateway hygiene called get_model_context_length(model) without passing
   config_context_length, provider, or base_url — so user overrides like
   model.context_length: 180000 were ignored, and provider-aware detection
   (models.dev, z.ai endpoint) couldn't fire. The agent's own compressor
   correctly passed all three (run_agent.py line 1038).

2. The 1.4x safety factor on rough token estimates pushed the compression
   threshold above the model's actual context limit:
     200K * 0.85 * 1.4 = 238K > 200K (model limit)
   So hygiene never compressed, sessions grew past the limit, and the API
   rejected the request.

3. Same issue for the warn threshold: 200K * 0.95 * 1.4 = 266K.

Fix:
- Read model.context_length, provider, and base_url from config.yaml
  (same as run_agent.py does) and pass them to get_model_context_length()
- Resolve provider/base_url from runtime when not in config
- Cap the 1.4x-adjusted compress threshold at 95% of context_length
- Cap the 1.4x-adjusted warn threshold at context_length

Affects: z.ai GLM-5/GLM-5-turbo, any ~200K or smaller context model
where the 1.4x factor would push 85% above 100%.

Ref: Discord report from Ddox — glm-5-turbo on z.ai coding plan

2026-03-22 15:15:37 -07:00

acp

fix(acp): preserve session provider when switching models

2026-03-21 15:54:10 -07:00

agent

fix(tests): resolve all consistently failing tests

2026-03-22 05:58:26 -07:00

cron

fix(cron): support Telegram topic delivery via platform:chat_id:thread_id format (#2455 )

2026-03-22 04:18:28 -07:00

fakes

fix: streaming tool call parsing, error handling, and fake HA state mutation

2026-03-14 14:27:20 +03:00

gateway

fix(gateway): hygiene compression ignores config context_length and 1.4x exceeds model limit

2026-03-22 15:15:37 -07:00

hermes_cli

Merge pull request #2465 from NousResearch/hermes/hermes-31d7db3b

2026-03-22 04:56:48 -07:00

honcho_integration

feat(honcho): instance-local config via HERMES_HOME, default session strategy to per-directory

2026-03-21 09:34:00 -07:00

integration

feat(web): add Parallel as alternative web search/extract backend (#1696 )

2026-03-17 04:02:02 -07:00

skills

fix: persist google oauth pkce for headless auth

2026-03-14 22:11:34 -07:00

tools

fix(mcp-oauth): port mismatch, path traversal, and shared handler state (salvage #2521 ) (#2552 )

2026-03-22 15:02:26 -07:00

__init__.py

…

conftest.py

fix(approval): show full command in dangerous command approval (#1553 )

2026-03-17 02:02:33 -07:00

run_interrupt_test.py

fix: thread safety for concurrent subagent delegation (#1672 )

2026-03-17 02:53:33 -07:00

test_413_compression.py

feat: improve context compaction handoff summaries (#1273 )

2026-03-14 02:33:31 -07:00

test_860_dedup.py

fix: eliminate 3x SQLite message duplication in gateway sessions (#860 )

2026-03-10 15:22:44 -07:00

test_1630_context_overflow_loop.py

fix: prevent infinite 400 loop on context overflow + block prompt injection via cache files (#1630 , #1558 )

2026-03-17 01:50:59 -07:00

test_agent_guardrails.py

feat: pre-call sanitization and post-call tool guardrails (#1732 )

2026-03-17 04:24:27 -07:00

test_agent_loop_tool_calling.py

fix: skip hanging tests + add global test timeout

2026-03-12 01:23:28 -07:00

test_agent_loop_vllm.py

test: restore vllm integration coverage and add dict-args regression

2026-03-15 08:02:29 -07:00

test_agent_loop.py

fix: salvage gateway dedup and executor cleanup from PR #993

2026-03-14 11:03:20 -07:00

test_anthropic_adapter.py

fix(prompt-caching): skip top-level cache_control on role:tool for OpenRouter

2026-03-21 16:54:43 -07:00

test_anthropic_error_handling.py

fix(anthropic): retry 429/529 errors and surface error details to users

2026-03-17 01:07:11 +03:00

test_anthropic_oauth_flow.py

fix: preflight Anthropic auth and prefer Claude store

2026-03-14 19:38:55 -07:00

test_anthropic_provider_persistence.py

fix: preflight Anthropic auth and prefer Claude store

2026-03-14 19:38:55 -07:00

test_api_key_providers.py

fix: resolve MiniMax 401 auth error by defaulting to anthropic_messages (#2103 )

2026-03-19 17:47:05 -07:00

test_atomic_json_write.py

test: cover atomic temp cleanup on interrupts

2026-03-14 22:31:51 -07:00

test_atomic_yaml_write.py

test: cover atomic temp cleanup on interrupts

2026-03-14 22:31:51 -07:00

test_auth_codex_provider.py

…

test_auth_nous_provider.py

…

test_auxiliary_config_bridge.py

feat(compression): add summary_base_url + move compression config to YAML-only

2026-03-17 04:46:15 -07:00

test_batch_runner_checkpoint.py

fix: sanitize chat payloads and provider precedence

2026-03-13 23:59:12 -07:00

test_cli_approval_ui.py

fix(cli): repair dangerous command approval UI

2026-03-14 11:57:44 -07:00

test_cli_extension_hooks.py

refactor(cli): add protected TUI extension hooks for wrapper CLIs

2026-03-21 09:42:07 -07:00

test_cli_init.py

fix: skip model auto-detection for custom/local providers

2026-03-20 04:35:17 -07:00

test_cli_interrupt_subagent.py

fix: thread safety for concurrent subagent delegation (#1672 )

2026-03-17 02:53:33 -07:00

test_cli_loading_indicator.py

fix(cli): add loading indicators for slow slash commands

2026-03-10 17:31:00 -07:00

test_cli_mcp_config_watch.py

fix: auto-reload MCP tools when mcp_servers config changes without restart (#1474 )

2026-03-15 19:03:34 -07:00

test_cli_model_command.py

feat: auto-detect provider when switching models via /model (#1506 )

2026-03-16 04:34:45 -07:00

test_cli_new_session.py

fix: complete session reset — missing compressor counters + test

2026-03-20 04:35:17 -07:00

test_cli_plan_command.py

fix: save /plan output in workspace (#1381 )

2026-03-14 21:28:51 -07:00

test_cli_prefix_matching.py

feat: add /tools disable/enable/list slash commands with session reset (#1652 )

2026-03-17 02:05:26 -07:00

test_cli_preloaded_skills.py

feat: preload CLI skills on launch (#1359 )

2026-03-14 19:33:59 -07:00

test_cli_provider_resolution.py

feat: overhaul context length detection with models.dev and provider-aware resolution (#2158 )

2026-03-20 06:04:33 -07:00

test_cli_retry.py

test: lock retry replacement semantics

2026-03-14 21:19:22 -07:00

test_cli_secret_capture.py

feat: secure skill env setup on load (core #688 )

2026-03-13 03:14:04 -07:00

test_cli_skin_integration.py

fix(test): add missing voice state attrs to CLI stub in skin tests

2026-03-14 15:00:45 +03:00

test_cli_status_bar.py

feat: add route-aware pricing estimates (#1695 )

2026-03-17 03:44:44 -07:00

test_cli_tools_command.py

feat: add /tools disable/enable/list slash commands with session reset (#1652 )

2026-03-17 02:05:26 -07:00

test_codex_execution_paths.py

…

test_codex_models.py

fix: add codex forward-compat model listing

2026-03-13 21:34:01 -07:00

test_compression_boundary.py

fix(agent): prevent silent tool result loss during context compression (#1993 )

2026-03-18 15:22:51 -07:00

test_context_pressure.py

feat: context pressure warnings for CLI and gateway (#2159 )

2026-03-20 08:37:36 -07:00

test_context_references.py

feat: @ context references — inline file, folder, diff, git, and URL injection

2026-03-21 15:57:13 -07:00

test_context_token_tracking.py

fix(tests): resolve all consistently failing tests

2026-03-22 05:58:26 -07:00

test_dict_tool_call_args.py

test: restore vllm integration coverage and add dict-args regression

2026-03-15 08:02:29 -07:00

test_display.py

fix: add upstream guard for non-dict function_args + tests for build_tool_preview

2026-03-09 21:01:40 -07:00

test_evidence_store.py

feat: add OSS Security Forensics skill (Skills Hub) (#1482 )

2026-03-15 21:59:53 -07:00

test_external_credential_detection.py

…

test_fallback_model.py

feat: upgrade MiniMax default to M2.7 + add new OpenRouter models

2026-03-18 02:42:58 -07:00

test_file_permissions.py

…

test_flush_memories_codex.py

fix: update all test mocks for call_llm migration

2026-03-11 21:06:54 -07:00

test_hermes_state.py

fix: search all sources by default in session_search (#1892 )

2026-03-18 02:21:29 -07:00

test_honcho_client_config.py

…

test_insights.py

feat: add route-aware pricing estimates (#1695 )

2026-03-17 03:44:44 -07:00

test_interactive_interrupt.py

fix: thread safety for concurrent subagent delegation (#1672 )

2026-03-17 02:53:33 -07:00

test_interrupt_propagation.py

fix: thread safety for concurrent subagent delegation (#1672 )

2026-03-17 02:53:33 -07:00

test_managed_server_tool_support.py

test: fix stale CI assumptions in parser and quick-command coverage (#1236 )

2026-03-13 21:56:12 -07:00

test_minisweagent_path.py

fix: worktree-aware minisweagent path discovery + clean up requirements check (#1248 )

2026-03-13 23:39:51 -07:00

test_model_metadata_local_ctx.py

fix: prefer loaded instance context size over max for LM Studio

2026-03-19 21:24:53 +01:00

test_model_provider_persistence.py

feat: integrate GitHub Copilot providers across Hermes

2026-03-17 23:40:22 -07:00

test_model_tools_async_bridge.py

fix: use per-thread persistent event loops in worker threads

2026-03-20 15:41:06 -04:00

test_model_tools.py

…

test_openai_client_lifecycle.py

fix: audit fixes — 5 bugs found and resolved

2026-03-16 06:35:46 -07:00

test_personality_none.py

…

test_plugins_cmd.py

feat(cli): add hermes plugins install/remove/list command

2026-03-21 09:47:33 -07:00

test_plugins.py

fix(tests): resolve all consistently failing tests

2026-03-22 05:58:26 -07:00

test_provider_parity.py

feat: add Vercel AI Gateway provider (#1628 )

2026-03-17 00:12:16 -07:00

test_quick_commands.py

fix: thread safety for concurrent subagent delegation (#1672 )

2026-03-17 02:53:33 -07:00

test_real_interrupt_subagent.py

fix: thread safety for concurrent subagent delegation (#1672 )

2026-03-17 02:53:33 -07:00

test_reasoning_command.py

fix: /reasoning command — add gateway support, fix display, persist settings (#1031 )

2026-03-12 05:38:19 -07:00

test_redirect_stdout_issue.py

fix: use session_key instead of chat_id for adapter interrupt lookups

2026-03-12 08:35:45 -07:00

test_resume_display.py

…

test_run_agent_codex_responses.py

fix(codex): handle reasoning-only responses and replay path (#2070 )

2026-03-19 10:34:44 -07:00

test_run_agent.py

fix: prevent Anthropic token leaking to third-party anthropic_messages providers (salvage #2383 ) (#2389 )

2026-03-21 16:42:46 -07:00

test_runtime_provider_resolution.py

fix: respect DashScope v1 runtime mode for alibaba (#2459 )

2026-03-22 04:24:43 -07:00

test_setup_model_selection.py

fix(setup): remove dead code causing is_coding_plan NameError crash

2026-03-13 04:42:26 +03:00

test_sql_injection.py

fix(security): eliminate SQL string formatting in execute() calls

2026-03-19 15:16:35 +01:00

test_streaming.py

fix: always fall back to non-streaming on ANY streaming error

2026-03-16 06:15:09 -07:00

test_timezone.py

fix: skip stale cron jobs on gateway restart instead of firing immediately

2026-03-16 23:48:14 -07:00

test_tool_call_parsers.py

fix(mistral-parser): handle nested JSON in fallback extraction

2026-03-21 09:41:17 -07:00

test_toolset_distributions.py

…

test_toolsets.py

fix: add missing Platform.SIGNAL to toolset mappings, update test + config docs

2026-03-09 23:27:19 -07:00

test_trajectory_compressor.py

fix: harden trajectory compressor summary content handling

2026-03-14 11:03:25 -07:00

test_worktree_security.py

fix: harden salvaged worktree include checks

2026-03-14 21:51:27 -07:00

test_worktree.py

fix: harden salvaged worktree include checks

2026-03-14 21:51:27 -07:00