hermes-agent

Timmy_Foundation/hermes-agent

Fork 0

Commit Graph

Author	SHA1	Message	Date
Alexander Whitestone	95bb842a21	feat: Wire Gemma 4 vision into browser_tool for screenshot analysis All checks were successful Lint / lint (pull_request) Successful in 8s Details Default browser_vision screenshots to google/gemma-4-27b-it (Gemma 4 native multimodal) for reduced latency and unified text+vision model. Resolution order for _get_vision_model(): 1. BROWSER_VISION_MODEL env var (new, browser-specific override) 2. auxiliary.browser_vision.model in config.yaml (new config key) 3. AUXILIARY_VISION_MODEL env var (existing global vision override) 4. Default: google/gemma-4-27b-it Backward compatibility: existing AUXILIARY_VISION_MODEL users are unaffected — their override still flows through to browser_vision. Also documents the new auxiliary.browser_vision config section in cli-config.yaml.example and adds 14 unit tests covering the full priority chain. Fixes #816 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-21 17:14:32 -04:00
Alexander Whitestone	b6398b8b0d	feat: wire Gemma 4 vision into browser_tool for screenshot analysis All checks were successful Lint / lint (pull_request) Successful in 19s Details Default browser screenshot analysis now uses Gemma 4 27B (google/gemma-4-27b-it) instead of deferring to the auxiliary router's auto-detection. Gemma 4 is natively multimodal — the same model family already in use for text tasks — which avoids cold-start model-switching overhead and improves context continuity. Resolution order for _get_vision_model(): 1. BROWSER_VISION_MODEL env var (browser-specific override) 2. auxiliary.browser_vision.model in config.yaml 3. AUXILIARY_VISION_MODEL env var (shared/legacy override) 4. google/gemma-4-27b-it (new default) - Add _BROWSER_VISION_DEFAULT_MODEL constant to browser_tool.py - Document auxiliary.browser_vision config key in cli-config.yaml.example - Add 10 unit tests covering all resolution steps Fixes #816 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-21 12:49:46 -04:00

Author

SHA1

Message

Date

Alexander Whitestone

95bb842a21

feat: Wire Gemma 4 vision into browser_tool for screenshot analysis

Lint / lint (pull_request) Successful in 8s

Details

Default browser_vision screenshots to google/gemma-4-27b-it (Gemma 4
native multimodal) for reduced latency and unified text+vision model.

Resolution order for _get_vision_model():
1. BROWSER_VISION_MODEL env var (new, browser-specific override)
2. auxiliary.browser_vision.model in config.yaml (new config key)
3. AUXILIARY_VISION_MODEL env var (existing global vision override)
4. Default: google/gemma-4-27b-it

Backward compatibility: existing AUXILIARY_VISION_MODEL users are
unaffected — their override still flows through to browser_vision.

Also documents the new auxiliary.browser_vision config section in
cli-config.yaml.example and adds 14 unit tests covering the full
priority chain.

Fixes #816

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-04-21 17:14:32 -04:00

Alexander Whitestone

b6398b8b0d

feat: wire Gemma 4 vision into browser_tool for screenshot analysis

Lint / lint (pull_request) Successful in 19s

Details

Default browser screenshot analysis now uses Gemma 4 27B
(google/gemma-4-27b-it) instead of deferring to the auxiliary router's
auto-detection.  Gemma 4 is natively multimodal — the same model family
already in use for text tasks — which avoids cold-start model-switching
overhead and improves context continuity.

Resolution order for _get_vision_model():
  1. BROWSER_VISION_MODEL env var (browser-specific override)
  2. auxiliary.browser_vision.model in config.yaml
  3. AUXILIARY_VISION_MODEL env var (shared/legacy override)
  4. google/gemma-4-27b-it (new default)

- Add _BROWSER_VISION_DEFAULT_MODEL constant to browser_tool.py
- Document auxiliary.browser_vision config key in cli-config.yaml.example
- Add 10 unit tests covering all resolution steps

Fixes #816

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-04-21 12:49:46 -04:00

2 Commits