hermes-agent/tests at 85993fbb5a77d577f027bccecf84054906238aee - hermes-agent - Hermes Gitea

Timmy_Foundation/hermes-agent

Files

History

Teknium 85993fbb5a feat: pre-call sanitization and post-call tool guardrails (#1732 )

Salvage of PR #1321 by @alireza78a (cherry-picked concept, reimplemented
against current main).

Phase 1 — Pre-call message sanitization:
  _sanitize_api_messages() now runs unconditionally before every LLM call.
  Previously gated on context_compressor being present, so sessions loaded
  from disk or running without compression could accumulate dangling
  tool_call/tool_result pairs causing API errors.

Phase 2a — Delegate task cap:
  _cap_delegate_task_calls() truncates excess delegate_task calls per turn
  to MAX_CONCURRENT_CHILDREN. The existing cap in delegate_tool.py only
  limits the task array within a single call; this catches multiple
  separate delegate_task tool_calls in one turn.

Phase 2b — Tool call deduplication:
  _deduplicate_tool_calls() drops duplicate (tool_name, arguments) pairs
  within a single turn when models stutter.

All three are static methods on AIAgent, independently testable.
29 tests covering happy paths and edge cases.

2026-03-17 04:24:27 -07:00

..

feat(acp): support slash commands in ACP adapter (#1532 )

2026-03-16 05:19:36 -07:00

test: align Hermes setup and full-suite expectations (#1710 )

2026-03-17 04:01:37 -07:00

fix: skip stale cron jobs on gateway restart instead of firing immediately

2026-03-16 23:48:14 -07:00

…

fix(gateway): persist watcher metadata in checkpoint for crash recovery (#1706 )

2026-03-17 03:52:15 -07:00

fix(update): use .[all] extras with fallback in hermes update (#1728 )

2026-03-17 04:22:37 -07:00

honcho_integration

test: align Hermes setup and full-suite expectations (#1710 )

2026-03-17 04:01:37 -07:00

feat(web): add Parallel as alternative web search/extract backend (#1696 )

2026-03-17 04:02:02 -07:00

fix: persist google oauth pkce for headless auth

2026-03-14 22:11:34 -07:00

feat(web): add Parallel as alternative web search/extract backend (#1696 )

2026-03-17 04:02:02 -07:00

__init__.py

…

conftest.py

fix(approval): show full command in dangerous command approval (#1553 )

2026-03-17 02:02:33 -07:00

run_interrupt_test.py

fix: thread safety for concurrent subagent delegation (#1672 )

2026-03-17 02:53:33 -07:00

test_413_compression.py

…

test_860_dedup.py

…

test_1630_context_overflow_loop.py

fix: prevent infinite 400 loop on context overflow + block prompt injection via cache files (#1630 , #1558 )

2026-03-17 01:50:59 -07:00

test_agent_guardrails.py

feat: pre-call sanitization and post-call tool guardrails (#1732 )

2026-03-17 04:24:27 -07:00

test_agent_loop_tool_calling.py

…

test_agent_loop_vllm.py

test: restore vllm integration coverage and add dict-args regression

2026-03-15 08:02:29 -07:00

test_agent_loop.py

…

test_anthropic_adapter.py

fix: isolate test_anthropic_adapter from local credentials

2026-03-16 22:53:32 -07:00

test_anthropic_error_handling.py

fix(anthropic): retry 429/529 errors and surface error details to users

2026-03-17 01:07:11 +03:00

test_anthropic_oauth_flow.py

fix: preflight Anthropic auth and prefer Claude store

2026-03-14 19:38:55 -07:00

test_anthropic_provider_persistence.py

fix: preflight Anthropic auth and prefer Claude store

2026-03-14 19:38:55 -07:00

test_api_key_providers.py

test: align Hermes setup and full-suite expectations (#1710 )

2026-03-17 04:01:37 -07:00

test_atomic_json_write.py

test: cover atomic temp cleanup on interrupts

2026-03-14 22:31:51 -07:00

test_atomic_yaml_write.py

test: cover atomic temp cleanup on interrupts

2026-03-14 22:31:51 -07:00

test_auth_codex_provider.py

…

test_auth_nous_provider.py

…

test_auxiliary_config_bridge.py

feat: add direct endpoint overrides for auxiliary and delegation

2026-03-14 21:11:37 -07:00

test_batch_runner_checkpoint.py

…

test_cli_approval_ui.py

fix(cli): repair dangerous command approval UI

2026-03-14 11:57:44 -07:00

test_cli_init.py

…

test_cli_interrupt_subagent.py

fix: thread safety for concurrent subagent delegation (#1672 )

2026-03-17 02:53:33 -07:00

test_cli_loading_indicator.py

…

test_cli_mcp_config_watch.py

fix: auto-reload MCP tools when mcp_servers config changes without restart (#1474 )

2026-03-15 19:03:34 -07:00

test_cli_model_command.py

feat: auto-detect provider when switching models via /model (#1506 )

2026-03-16 04:34:45 -07:00

test_cli_new_session.py

…

test_cli_plan_command.py

fix: save /plan output in workspace (#1381 )

2026-03-14 21:28:51 -07:00

test_cli_prefix_matching.py

feat: add /tools disable/enable/list slash commands with session reset (#1652 )

2026-03-17 02:05:26 -07:00

test_cli_preloaded_skills.py

feat: preload CLI skills on launch (#1359 )

2026-03-14 19:33:59 -07:00

test_cli_provider_resolution.py

fix: hermes update causes dual gateways on macOS (launchd) (#1567 )

2026-03-16 12:36:29 -07:00

test_cli_retry.py

test: lock retry replacement semantics

2026-03-14 21:19:22 -07:00

test_cli_secret_capture.py

…

test_cli_skin_integration.py

…

test_cli_status_bar.py

feat: add route-aware pricing estimates (#1695 )

2026-03-17 03:44:44 -07:00

test_cli_tools_command.py

feat: add /tools disable/enable/list slash commands with session reset (#1652 )

2026-03-17 02:05:26 -07:00

test_codex_execution_paths.py

…

test_codex_models.py

…

test_context_token_tracking.py

fix: context counter shows cached token count in status bar

2026-03-17 05:06:11 +03:00

test_dict_tool_call_args.py

test: restore vllm integration coverage and add dict-args regression

2026-03-15 08:02:29 -07:00

test_display.py

…

test_evidence_store.py

feat: add OSS Security Forensics skill (Skills Hub) (#1482 )

2026-03-15 21:59:53 -07:00

test_external_credential_detection.py

…

test_fallback_model.py

…

test_file_permissions.py

…

test_flush_memories_codex.py

…

test_hermes_state.py

feat: add route-aware pricing estimates (#1695 )

2026-03-17 03:44:44 -07:00

test_honcho_client_config.py

…

test_insights.py

feat: add route-aware pricing estimates (#1695 )

2026-03-17 03:44:44 -07:00

test_interactive_interrupt.py

fix: thread safety for concurrent subagent delegation (#1672 )

2026-03-17 02:53:33 -07:00

test_interrupt_propagation.py

fix: thread safety for concurrent subagent delegation (#1672 )

2026-03-17 02:53:33 -07:00

test_managed_server_tool_support.py

…

test_minisweagent_path.py

…

test_model_provider_persistence.py

…

test_model_tools.py

…

test_openai_client_lifecycle.py

fix: audit fixes — 5 bugs found and resolved

2026-03-16 06:35:46 -07:00

test_personality_none.py

…

test_plugins.py

feat: first-class plugin architecture (#1555 )

2026-03-16 07:17:36 -07:00

test_provider_parity.py

feat: add Vercel AI Gateway provider (#1628 )

2026-03-17 00:12:16 -07:00

test_quick_commands.py

fix: thread safety for concurrent subagent delegation (#1672 )

2026-03-17 02:53:33 -07:00

test_real_interrupt_subagent.py

fix: thread safety for concurrent subagent delegation (#1672 )

2026-03-17 02:53:33 -07:00

test_reasoning_command.py

…

test_redirect_stdout_issue.py

…

test_resume_display.py

…

test_run_agent_codex_responses.py

feat: allow custom endpoints to use responses API via api_mode override (#1651 )

2026-03-17 02:04:36 -07:00

test_run_agent.py

fix: audit fixes — 5 bugs found and resolved

2026-03-16 06:35:46 -07:00

test_runtime_provider_resolution.py

refactor: tie api_mode to provider config instead of env var (#1656 )

2026-03-17 02:13:26 -07:00

test_setup_model_selection.py

…

test_streaming.py

fix: always fall back to non-streaming on ANY streaming error

2026-03-16 06:15:09 -07:00

test_timezone.py

fix: skip stale cron jobs on gateway restart instead of firing immediately

2026-03-16 23:48:14 -07:00

test_tool_call_parsers.py

…

test_toolset_distributions.py

…

test_toolsets.py

…

test_trajectory_compressor.py

…

test_worktree_security.py

fix: harden salvaged worktree include checks

2026-03-14 21:51:27 -07:00

test_worktree.py

fix: harden salvaged worktree include checks

2026-03-14 21:51:27 -07:00