hermes-agent/tests at b73ebfee302a59a5892b0e521fd7eb8b6ae1d7ba - hermes-agent - Hermes Gitea

Timmy_Foundation/hermes-agent

Files

History

kshitij c14b3b5880 fix(kimi): force fixed temperature on kimi-k2.* models (k2.5, thinking, turbo) (#12144 )

* fix(kimi): force fixed temperature on kimi-k2.* models (k2.5, thinking, turbo)

The prior override only matched the literal model name "kimi-for-coding",
but Moonshot's coding endpoint is hit with real model IDs such as
`kimi-k2.5`, `kimi-k2-turbo-preview`, `kimi-k2-thinking`, etc.  Those
requests bypassed the override and kept the caller's temperature, so
Moonshot returns HTTP 400 "invalid temperature: only 0.6 is allowed for
this model" (or 1.0 for thinking variants).

Match the whole kimi-k2.* family:
  * kimi-k2-thinking / kimi-k2-thinking-turbo -> 1.0 (thinking mode)
  * all other kimi-k2.* -> 0.6 (non-thinking / instant mode)

Also accept an optional vendor prefix (e.g. `moonshotai/kimi-k2.5`) so
aggregator routings are covered.

* refactor(kimi): whitelist-match kimi coding models instead of prefix

Addresses review feedback on PR #12144.

- Replace `startswith("kimi-k2")` with explicit frozensets sourced from
  Moonshot's kimi-for-coding model list.  The prefix match would have also
  clamped `kimi-k2-instruct` / `kimi-k2-instruct-0905`, which are the
  separate non-coding K2 family with variable temperature (recommended 0.6
  but not enforced — see huggingface.co/moonshotai/Kimi-K2-Instruct).
- Confirmed via platform.kimi.ai docs that all five coding models
  (k2.5, k2-turbo-preview, k2-0905-preview, k2-thinking, k2-thinking-turbo)
  share the fixed-temperature lock, so the preview-model mapping is no
  longer an assumption.
- Drop the fragile `"thinking" in bare` substring test for a set lookup.
- Log a debug line on each override so operators can see when Hermes
  silently rewrites temperature.
- Update class docstring.  Extend the negative test to parametrize over
  kimi-k2-instruct, Kimi-K2-Instruct-0905, and a hypothetical future
  kimi-k2-experimental name — all must keep the caller's temperature.

2026-04-18 09:35:51 -07:00

..

fix(acp): improve zed integration

2026-04-17 13:29:26 -07:00

fix(kimi): force fixed temperature on kimi-k2.* models (k2.5, thinking, turbo) (#12144 )

2026-04-18 09:35:51 -07:00

test: update stale tests to match current code (#11963 )

2026-04-17 21:35:30 -07:00

feat(cron+tests): extend origin fallback to email/dingtalk/qqbot + fix Weixin test mocks

2026-04-17 06:26:43 -07:00

refactor: extract shared helpers to deduplicate repeated code patterns (#7917 )

2026-04-11 13:59:52 -07:00

environments/benchmarks

…

…

feat(steer): /steer <prompt> injects a mid-run note after the next tool call (#12116 )

2026-04-18 04:17:18 -07:00

feat(execute_code): add project/strict execution modes, default to project (#11971 )

2026-04-18 01:46:25 -07:00

fix(honcho): strip whitespace from conclusion and delete_id inputs

2026-04-16 09:50:10 -07:00

fix(discord): strip RTP padding before DAVE/Opus decode (#11267 )

2026-04-16 16:50:15 -07:00

test: speed up slow tests (backoff + subprocess + IMDS network) (#11797 )

2026-04-17 14:21:22 -07:00

feat(steer): /steer <prompt> injects a mid-run note after the next tool call (#12116 )

2026-04-18 04:17:18 -07:00

fix(google-workspace): normalize authorized user token writes

2026-04-16 04:22:16 -07:00

feat(execute_code): add project/strict execution modes, default to project (#11971 )

2026-04-18 01:46:25 -07:00

fix(tui): review follow-up — /retry, /plan, ANSI truncation, caching

2026-04-18 09:30:48 -07:00

__init__.py

…

conftest.py

Support browser CDP URL from config

2026-04-17 16:05:04 -07:00

run_interrupt_test.py

…

test_batch_runner_checkpoint.py

…

test_cli_file_drop.py

fix(gateway): reject file paths in get_command() + file-drop tests (#7356 )

2026-04-10 13:06:02 -07:00

test_cli_skin_integration.py

…

test_ctx_halving_fix.py

fix(tests): fix 78 CI test failures and remove dead test (#9036 )

2026-04-13 10:50:24 -07:00

test_empty_model_fallback.py

fix: fall back to provider's default model when model config is empty (#8303 )

2026-04-12 03:53:30 -07:00

test_evidence_store.py

…

test_hermes_constants.py

fix(gateway): harden Docker/container gateway pathway

2026-04-12 16:36:11 -07:00

test_hermes_logging.py

fix(tests): fix 78 CI test failures and remove dead test (#9036 )

2026-04-13 10:50:24 -07:00

test_hermes_state.py

test(session-search): regression coverage for CJK LIKE fallback

2026-04-18 01:57:57 -07:00

test_honcho_client_config.py

…

test_ipv4_preference.py

feat: add network.force_ipv4 config to fix IPv6 timeout issues (#8196 )

2026-04-11 23:12:11 -07:00

test_mcp_serve.py

…

test_mini_swe_runner.py

fix(kimi): cover remaining fixed-temperature bypasses

2026-04-17 20:25:42 -07:00

test_minisweagent_path.py

…

test_model_picker_scroll.py

…

test_model_tools_async_bridge.py

…

test_model_tools.py

feat(plugins): let pre_tool_call hooks block tool execution

2026-04-13 22:01:49 -07:00

test_ollama_num_ctx.py

…

test_packaging_metadata.py

…

test_plugin_skills.py

fix(tests): attach caplog to specific logger in 3 order-dependent tests (#11453 )

2026-04-17 00:20:40 -07:00

test_project_metadata.py

build(deps): add qrcode to dingtalk + feishu extras (parity with messaging) (#11627 )

2026-04-17 13:31:53 -07:00

test_retry_utils.py

…

test_sql_injection.py

…

test_subprocess_home_isolation.py

fix: per-profile subprocess HOME isolation (#4426 ) (#7357 )

2026-04-10 13:37:45 -07:00

test_timezone.py

test: speed up slow tests (backoff + subprocess + IMDS network) (#11797 )

2026-04-17 14:21:22 -07:00

test_toolset_distributions.py

…

test_toolsets.py

fix(mcp): make server aliases explicit

2026-04-14 17:19:20 -07:00

test_trajectory_compressor_async.py

fix(kimi): cover remaining fixed-temperature bypasses

2026-04-17 20:25:42 -07:00

test_trajectory_compressor.py

fix(kimi): cover remaining fixed-temperature bypasses

2026-04-17 20:25:42 -07:00

test_tui_gateway_server.py

feat(steer): /steer <prompt> injects a mid-run note after the next tool call (#12116 )

2026-04-18 04:17:18 -07:00

test_utils_truthy_values.py

…