hermes-agent/agent/model_metadata.py at 900e848522091bcdc82bdc33ebd6055be04bc1e2

Files

Test 900e848522 fix: infer provider from base URL for models.dev context length lookup

Custom endpoint users (DashScope/Alibaba, Z.AI, Kimi, DeepSeek, etc.)
get wrong context lengths because their provider resolves as "openrouter"
or "custom", skipping the models.dev lookup entirely. For example,
qwen3.5-plus on DashScope falls to the generic "qwen" hardcoded default
(131K) instead of the correct 1M.

Add _infer_provider_from_url() that maps known API hostnames to their
models.dev provider IDs. When the explicit provider is generic
(openrouter/custom/empty), infer from the base URL before the models.dev
lookup. This resolves context lengths correctly for DashScope, Z.AI,
Kimi, MiniMax, DeepSeek, and Nous endpoints without requiring users to
manually set context_length in config.

Also refactors _is_known_provider_base_url() to use the same URL mapping,
removing the duplicated hostname list.

2026-03-20 11:57:24 -07:00

34 KiB

Raw Blame History

View Raw

34 KiB Raw Blame History

34 KiB

Raw Blame History