hermes-agent/run_agent.py at 35ad3146a8ab4b64c19dfe1b5b41b192c8d2b60a

Files

teknium 1b1307d0d1 Implement Anthropic prompt caching for Claude models via OpenRouter

- Introduced a caching strategy that reduces input token costs by ~75% on multi-turn conversations by caching the conversation prefix.
- Added functions to apply cache control markers to messages, enhancing efficiency in token usage.
- Updated AIAgent to auto-enable prompt caching for Claude models, with configurable cache TTL.
- Enhanced logging to track cache hit statistics when caching is active, improving monitoring of token usage.

2026-02-10 06:49:41 +00:00

132 KiB

Raw Blame History

View Raw

132 KiB Raw Blame History

132 KiB

Raw Blame History