hermes-agent/cli.py at 081079da629cf33206108e01ac736e1be725ded2

Files

teknium1 cf810c2950 fix: pre-process CLI clipboard images through vision tool instead of raw embedding

Images pasted in the CLI were embedded as raw base64 image_url content
parts in the conversation history, which only works with vision-capable
models. If the main model (e.g. Nous API) doesn't support vision, this
breaks the request and poisons all subsequent messages.

Now the CLI uses the same approach as the messaging gateway: images are
pre-processed through the auxiliary vision model (Gemini Flash via
OpenRouter or Nous Portal) and converted to text descriptions. The
local file path is included so the agent can re-examine via
vision_analyze if needed. Works with any model.

Fixes #638.

2026-03-08 06:22:00 -07:00

160 KiB

Executable File

Raw Blame History

View Raw

160 KiB Executable File Raw Blame History

160 KiB

Executable File

Raw Blame History