hermes-agent/agent/model_metadata.py at bdccdd67a1c3f16aa4d15f700a9615bbc4d141f7

Files

Teknium 43af094ae3 fix(agent): include tool tokens in preflight estimate, guard context probe persistence (#3164 )

Two improvements salvaged from PR #2600 (paraddox):

1. Preflight compression now counts tool schema tokens alongside system
   prompt and messages.  With 50+ tools enabled, schemas can add 20-30K
   tokens that were previously invisible to the estimator, delaying
   compression until the API rejected the request.

2. Context probe persistence guard: when the agent steps down context
   tiers after a context-length error, only provider-confirmed numeric
   limits (parsed from the error message) are cached to disk.  Guessed
   fallback tiers from get_next_probe_tier() stay in-memory only,
   preventing wrong values from polluting the persistent cache.

Co-authored-by: paraddox <paraddox@users.noreply.github.com>

2026-03-26 02:00:50 -07:00

35 KiB

Raw Blame History

View Raw

35 KiB Raw Blame History

35 KiB

Raw Blame History