hermes-agent/tests at f32105b3b9be366eee8d918e049b9c2ff41b72dd - hermes-agent - Hermes Gitea

Timmy_Foundation/hermes-agent

Files

History

Alexander Whitestone f32105b3b9

Contributor Attribution Check / check-attribution (pull_request) Failing after 34s

Details

Docker Build and Publish / build-and-push (pull_request) Has been skipped

Details

Docs Site Checks / docs-site-checks (pull_request) Failing after 2m44s

Details

Nix / nix (ubuntu-latest) (pull_request) Failing after 7s

Details

Supply Chain Audit / Scan PR for supply chain risks (pull_request) Successful in 31s

Details

Tests / e2e (pull_request) Successful in 3m1s

Details

Tests / test (pull_request) Failing after 39m39s

Details

Nix / nix (macos-latest) (pull_request) Has been cancelled

Details

feat: adaptive context compression thresholds (Phase 1 of Context vs RAG decision framework)

Instead of compressing at a hardcoded 50% of context length,
the threshold now adapts to model capacity:

- 500K+ context → 75% threshold (large-context models breathe)
- 200K-499K   → 65%
- 128K-199K   → 55%
- < 128K      → 50% (unchanged default, backward compatible)

Impact: Claude Opus (1M context) gets 750K working tokens
instead of 500K. MiMo v2 Pro same. Small models unaffected.

Explicit threshold_percent parameter still works (overrides
adaptive). update_model() also recomputes adaptive threshold.

Research: See ~/.timmy/research-backlog.md item #4.3 (Ratio: 4.0)
Paper refs: KIVI (2402.02750), SnapKV (2404.14469),
  Self-RAG (2310.11511), Long Context vs RAG survey (2407.16833)

2026-04-15 08:26:50 -04:00

..

fix(acp): declare session load and resume capabilities in initialize response (#6985 )

2026-04-10 03:45:36 -07:00

feat: adaptive context compression thresholds (Phase 1 of Context vs RAG decision framework)

2026-04-15 08:26:50 -04:00

fix: resolve CI test failures — add missing functions, fix stale tests (#9483 )

2026-04-14 01:43:45 -07:00

feat(cron): support Discord thread_id in deliver targets

2026-04-10 03:20:05 -07:00

refactor: extract shared helpers to deduplicate repeated code patterns (#7917 )

2026-04-11 13:59:52 -07:00

environments/benchmarks

fix(security): consolidated security hardening — SSRF, timing attack, tar traversal, credential leakage (#5944 )

2026-04-07 17:28:37 -07:00

…

fix: gateway reconnect drops active cron job notifications (#744 )

2026-04-15 00:04:13 -04:00

fix: hermes gateway restart waits for service to come back up (#8260 )

2026-04-14 17:12:58 -07:00

feat(honcho): add opt-in initOnSessionStart for tools mode and respect explicit peerName (#6995 )

2026-04-11 00:43:27 -07:00

refactor: remove dead code — 1,784 lines across 77 files (#9180 )

2026-04-13 16:32:04 -07:00

feat: sort tool search results by score and add corresponding unit test

2026-04-14 10:49:35 -07:00

fix: clamp 'minimal' reasoning effort to 'low' on Responses API (#9429 )

2026-04-13 23:11:13 -07:00

fix(migration): don't auto-archive OpenClaw source directory

2026-04-12 00:33:54 -07:00

fix: deploy Qwen2.5-7B for local crisis support (closes #668 )

2026-04-14 23:04:15 -04:00

__init__.py

…

conftest.py

fix(tests): fix several failing/flaky tests on main (#6777 )

2026-04-09 13:17:06 -07:00

run_interrupt_test.py

fix: thread safety for concurrent subagent delegation (#1672 )

2026-03-17 02:53:33 -07:00

test_batch_runner_checkpoint.py

…

test_cli_file_drop.py

fix(gateway): reject file paths in get_command() + file-drop tests (#7356 )

2026-04-10 13:06:02 -07:00

test_cli_skin_integration.py

fix: CLI/UX batch — ChatConsole errors, curses scroll, skin-aware banner, git state banner (#5974 )

2026-04-07 17:59:42 -07:00

test_ctx_halving_fix.py

fix(tests): fix 78 CI test failures and remove dead test (#9036 )

2026-04-13 10:50:24 -07:00

test_empty_model_fallback.py

fix: fall back to provider's default model when model config is empty (#8303 )

2026-04-12 03:53:30 -07:00

test_evidence_store.py

feat: add OSS Security Forensics skill (Skills Hub) (#1482 )

2026-03-15 21:59:53 -07:00

test_hermes_constants.py

fix(gateway): harden Docker/container gateway pathway

2026-04-12 16:36:11 -07:00

test_hermes_logging.py

fix(tests): fix 78 CI test failures and remove dead test (#9036 )

2026-04-13 10:50:24 -07:00

test_hermes_state.py

fix(state): orphan children instead of cascade-deleting in prune/delete (#6513 )

2026-04-09 02:41:56 -07:00

test_honcho_client_config.py

feat(memory): pluggable memory provider interface with profile isolation, review fixes, and honcho CLI restoration (#4623 )

2026-04-02 15:33:51 -07:00

test_ipv4_preference.py

feat: add network.force_ipv4 config to fix IPv6 timeout issues (#8196 )

2026-04-11 23:12:11 -07:00

test_mcp_serve.py

feat: add MCP server mode — hermes mcp serve (#3795 )

2026-03-29 15:47:19 -07:00

test_minisweagent_path.py

chore: remove all remaining mini-swe-agent references

2026-03-24 08:19:23 -07:00

test_model_picker_scroll.py

fix: CLI/UX batch — ChatConsole errors, curses scroll, skin-aware banner, git state banner (#5974 )

2026-04-07 17:59:42 -07:00

test_model_tools_async_bridge.py

fix: use per-thread persistent event loops in worker threads

2026-03-20 15:41:06 -04:00

test_model_tools.py

feat(plugins): let pre_tool_call hooks block tool execution

2026-04-13 22:01:49 -07:00

test_ollama_num_ctx.py

fix: provider/model resolution — salvage 4 PRs + MiniMax aux URL fix (#5983 )

2026-04-07 22:23:28 -07:00

test_packaging_metadata.py

chore: prepare Hermes for Homebrew packaging (#4099 )

2026-03-30 17:34:43 -07:00

test_plugin_skills.py

feat(plugins): namespaced skill registration for plugin skill bundles

2026-04-14 10:42:58 -07:00

test_project_metadata.py

refactor(matrix): swap matrix-nio for mautrix-python dependency

2026-04-10 21:15:59 -07:00

test_retry_utils.py

feat(agent): add jittered retry backoff

2026-04-08 00:41:36 -07:00

test_sql_injection.py

fix(security): eliminate SQL string formatting in execute() calls

2026-03-19 15:16:35 +01:00

test_subprocess_home_isolation.py

fix: per-profile subprocess HOME isolation (#4426 ) (#7357 )

2026-04-10 13:37:45 -07:00

test_timezone.py

fix: remove 115 verified dead code symbols across 46 production files

2026-04-10 03:44:43 -07:00

test_toolset_distributions.py

…

test_toolsets.py

fix(mcp): make server aliases explicit

2026-04-14 17:19:20 -07:00

test_trajectory_compressor_async.py

fix(tests): fix 78 CI test failures and remove dead test (#9036 )

2026-04-13 10:50:24 -07:00

test_trajectory_compressor.py

fix: load credentials from HERMES_HOME .env in trajectory_compressor

2026-04-14 10:24:19 -07:00

test_utils_truthy_values.py

Gate tool-gateway behind an env var, so it's not in users' faces until we're ready. Even if users enable it, it'll be blocked server-side for now, until we unlock for non-admin users on tool-gateway.

2026-03-30 13:28:10 +09:00