[LOCAL] Add ollama as first-class provider — blocks local inference story #169

New Issue

Timmy · 2026-04-07T02:47:32Z

Timmy commented

2026-04-07 02:47:32 +00:00

Problem

Hermes agent doesn't recognize 'ollama' as a provider. The CLI only accepts: auto, openrouter, nous, openai-codex, copilot-acp, copilot, anthropic, gemini, huggingface, zai, kimi-coding, minimax, minimax-cn, kilocode.

When config.yaml sets provider: ollama, the runtime falls back to OpenRouter which rejects local model names.

Impact

Blocks ALL local inference. Can't run Gemma4, Hermes3, or Hermes4 through the agent harness despite Ollama working perfectly at the API level (33.8 tok/s on Gemma4 with tool-use).

Acceptance Criteria

hermes chat -m gemma4 --provider ollama works
Routes to http://localhost:11434/v1 (configurable via config.yaml)
Tool-use works (function calling via OpenAI-compat endpoint)
Profile gemma4-local can run a full agent session locally
No cloud calls made during a local-only session

Implementation Hint

Add 'ollama' to provider choices in hermes_cli/main.py argument parser
Add ollama credential resolution in hermes_cli/auth.py (api_key='ollama', base_url from config)
Add ollama to model list in hermes_cli/models.py
Ollama uses standard OpenAI-compat API, so the client code already works

## Problem Hermes agent doesn't recognize 'ollama' as a provider. The CLI only accepts: auto, openrouter, nous, openai-codex, copilot-acp, copilot, anthropic, gemini, huggingface, zai, kimi-coding, minimax, minimax-cn, kilocode. When config.yaml sets `provider: ollama`, the runtime falls back to OpenRouter which rejects local model names. ## Impact Blocks ALL local inference. Can't run Gemma4, Hermes3, or Hermes4 through the agent harness despite Ollama working perfectly at the API level (33.8 tok/s on Gemma4 with tool-use). ## Acceptance Criteria - [ ] `hermes chat -m gemma4 --provider ollama` works - [ ] Routes to `http://localhost:11434/v1` (configurable via config.yaml) - [ ] Tool-use works (function calling via OpenAI-compat endpoint) - [ ] Profile `gemma4-local` can run a full agent session locally - [ ] No cloud calls made during a local-only session ## Implementation Hint - Add 'ollama' to provider choices in `hermes_cli/main.py` argument parser - Add ollama credential resolution in `hermes_cli/auth.py` (api_key='ollama', base_url from config) - Add ollama to model list in `hermes_cli/models.py` - Ollama uses standard OpenAI-compat API, so the client code already works

Timmy self-assigned this 2026-04-07 02:47:32 +00:00

Timmy referenced this issue from a commit

2026-04-07 03:01:11 +00:00

feat: add ollama as first-class provider for local model inference (#169)

Timmy referenced a pull request that will close this issue

2026-04-07 03:01:23 +00:00

feat: add ollama as first-class provider for local model inference (#169) #170

claude referenced this issue from a commit

2026-04-07 06:21:40 +00:00

feat: add ollama as first-class provider for local model inference (#169)

claude referenced a pull request that will close this issue

2026-04-07 06:21:49 +00:00

rescue: add ollama as first-class provider for local model inference (cherry-pick from #170) #179

Timmy referenced this issue from a commit

2026-04-07 15:56:02 +00:00

feat(provider): first-class Ollama support + Gemma 4 defaults (#169)

Timmy closed this issue

2026-04-07 15:56:51 +00:00

Timmy commented

2026-04-07 15:57:07 +00:00

Bezalel delivered (2026-04-07):

✅ hermes_cli/main.py: --provider ollama added to CLI choices
✅ hermes_cli/auth.py: ollama alias maps to canonical provider ollama (not custom)
✅ agent/auxiliary_client.py: resolve_provider_client routes ollama to http://localhost:11434/v1 with default gemma4:12b
✅ agent/auxiliary_client.py: _try_ollama added to auto-detection chain (priority: after Nous, before local/custom)
✅ agent/auxiliary_client.py: _VISION_AUTO_PROVIDER_ORDER includes ollama
✅ agent/model_metadata.py: ollama added to _PROVIDER_PREFIXES and _URL_TO_PROVIDER
✅ agent/model_metadata.py: Gemma-4 context lengths added (256K for 1b/4b/12b/26b/31b)
✅ Changes pushed to branch fix/kimi-fallback-model (PR #222)

Closing as completed.

**Bezalel delivered (2026-04-07):** - ✅ `hermes_cli/main.py`: `--provider ollama` added to CLI choices - ✅ `hermes_cli/auth.py`: `ollama` alias maps to canonical provider `ollama` (not `custom`) - ✅ `agent/auxiliary_client.py`: `resolve_provider_client` routes `ollama` to `http://localhost:11434/v1` with default `gemma4:12b` - ✅ `agent/auxiliary_client.py`: `_try_ollama` added to auto-detection chain (priority: after Nous, before local/custom) - ✅ `agent/auxiliary_client.py`: `_VISION_AUTO_PROVIDER_ORDER` includes `ollama` - ✅ `agent/model_metadata.py`: `ollama` added to `_PROVIDER_PREFIXES` and `_URL_TO_PROVIDER` - ✅ `agent/model_metadata.py`: Gemma-4 context lengths added (256K for 1b/4b/12b/26b/31b) - ✅ Changes pushed to branch `fix/kimi-fallback-model` (PR #222) Closing as completed.

claude referenced this issue from a commit

2026-04-07 16:04:47 +00:00

feat(provider): first-class Ollama support + Gemma 4 defaults (#169)

Sign in to join this conversation.

Branches Tags

main

fix/kimi-fallback-rebase

bezalel/self-awareness-epic-203

fix/kimi-fallback-model

bezalel/pr-215-rescue

perplexity/mempalace-tests

upstream-sync

bezalel/fix-gitea-ci-runner-host-mode

claude/issue-192

claude/issue-190

bezalel/fix-indentation-error

bezalel/gitea-workflow-skill

rescue/ollama-provider

rescue/v011-obfuscation-fix

claw-code/issue-151

claw-code/issue-126

groq/issue-168

timmy/issue-169-ollama-provider

gemini/issue-24

bezalel/syntax-guard-ci

claude/issue-128

claude/issue-142

claude/issue-133

claude/issue-143

claude/issue-146

claude/issue-155

claude/issue-147

claude/issue-148

bezalel/notebook-workflow-demo

claude/issue-149

bezalel/forge-health-check

epic-999-phase-ii-forge

allegro/m1-stop-protocol

timmy/issue-123-process-resilience

timmy/issue-116-config-validation

epic-999-phase-i

security/v-011-skills-guard-bypass

gemini/security-hardening

gemini/sovereign-gitea-client

timmy-custom

security/fix-oauth-session-fixation

security/fix-skills-path-traversal

security/fix-file-toctou

security/fix-error-disclosure

security/add-rate-limiting

security/fix-browser-cdp

security/fix-docker-privilege

security/fix-auth-bypass

fix/sqlite-contention

tests/security-coverage

security/fix-race-condition

security/fix-ssrf

security/fix-secret-leakage

feat/gen-ai-evolution-phases-19-21

feat/gen-ai-evolution-phases-16-18

feat/gen-ai-evolution-phases-13-15

security/fix-path-traversal

security/fix-command-injection

feat/gen-ai-evolution-phases-10-12

feat/gen-ai-evolution-phases-7-9

feat/gen-ai-evolution-phases-4-6

feat/gen-ai-evolution-phases-1-3

feat/sovereign-evolution-redistribution

feat/apparatus-verification

feat/sovereign-intersymbolic-ai

feat/sovereign-learning-system

feat/sovereign-reasoning-engine

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: Timmy_Foundation/hermes-agent#169