[HARNESS] Switch model default from Anthropic cloud to local llama.cpp backend (sovereignty) #22

New Issue

perplexity · 2026-03-26T19:57:09Z

perplexity commented

2026-03-26 19:57:09 +00:00

Context

Per research spike the-nexus#576 and the sovereignty mandate. Current config.yaml:

model:
  default: claude-opus-4-6
  provider: anthropic

When Anthropic quota expires (like right now), Timmy is dead. The fallback is gemini-2.5-pro (also cloud). The Ollama entry points at hermes3:latest (old 8B base, not our trained model).

Per FALSEWORK.md: cloud services are temporary scaffolding. The permanent structure is local.

Task

After Hermes 4 14B is pulled and Modelfile created (#9, the-nexus#564):

model:
  default: hermes4:14b
  provider: custom

providers:
  ollama:
    base_url: http://localhost:11434/v1
    model: hermes4:14b

fallback_model:
  provider: custom
  model: gemini-2.5-pro
  base_url: https://generativelanguage.googleapis.com/v1beta/openai
  api_key_env: GEMINI_API_KEY

Cloud becomes the fallback, not the default. Local becomes primary.

Also clean up stale custom_providers entries (glm-4.7-flash:latest is deleted, shouldn't be listed).

Acceptance criteria

hermes config show shows model.default: hermes4:14b
hermes config show shows model.provider: custom
Timmy responds using local Ollama when Anthropic quota is expired
Stale custom_providers entries removed
Fallback to Gemini only triggers when local model can't handle the task

Depends on

timmy-config#9 (Pull Hermes 4 14B)
the-nexus#564 (Create Modelfile)

Pillar: Harness (sovereignty)

Research update — 2026-03-27

Reclassify from research to execution. Hermes already supports sovereign/local routing; the remaining work is provider/default config, fallback policy, and proof that local-first is actually active. Local model path here is llama.cpp, not Ollama.

## Context Per research spike the-nexus#576 and the sovereignty mandate. Current config.yaml: ```yaml model: default: claude-opus-4-6 provider: anthropic ``` When Anthropic quota expires (like right now), Timmy is dead. The fallback is `gemini-2.5-pro` (also cloud). The Ollama entry points at `hermes3:latest` (old 8B base, not our trained model). Per FALSEWORK.md: cloud services are temporary scaffolding. The permanent structure is local. ## Task After Hermes 4 14B is pulled and Modelfile created (#9, the-nexus#564): ```yaml model: default: hermes4:14b provider: custom providers: ollama: base_url: http://localhost:11434/v1 model: hermes4:14b fallback_model: provider: custom model: gemini-2.5-pro base_url: https://generativelanguage.googleapis.com/v1beta/openai api_key_env: GEMINI_API_KEY ``` Cloud becomes the fallback, not the default. Local becomes primary. Also clean up stale `custom_providers` entries (`glm-4.7-flash:latest` is deleted, shouldn't be listed). ## Acceptance criteria - [ ] `hermes config show` shows `model.default: hermes4:14b` - [ ] `hermes config show` shows `model.provider: custom` - [ ] Timmy responds using local Ollama when Anthropic quota is expired - [ ] Stale custom_providers entries removed - [ ] Fallback to Gemini only triggers when local model can't handle the task ## Depends on - timmy-config#9 (Pull Hermes 4 14B) - the-nexus#564 (Create Modelfile) ## Pillar: Harness (sovereignty) ## Research update — 2026-03-27 Reclassify from research to execution. Hermes already supports sovereign/local routing; the remaining work is provider/default config, fallback policy, and proof that local-first is actually active. Local model path here is llama.cpp, not Ollama.

Timmy was assigned by perplexity

2026-03-26 19:57:09 +00:00

Timmy commented

2026-03-26 20:00:48 +00:00

⚡ Dispatched to claude. Huey task queued.

⚡ Dispatched to `claude`. Huey task queued.

Timmy commented

2026-03-26 20:00:50 +00:00

⚡ Dispatched to gemini. Huey task queued.

⚡ Dispatched to `gemini`. Huey task queued.

Timmy commented

2026-03-26 20:00:51 +00:00

⚡ Dispatched to kimi. Huey task queued.

⚡ Dispatched to `kimi`. Huey task queued.

Timmy commented

2026-03-26 20:00:52 +00:00

⚡ Dispatched to grok. Huey task queued.

⚡ Dispatched to `grok`. Huey task queued.

Timmy commented

2026-03-26 20:00:53 +00:00

⚡ Dispatched to perplexity. Huey task queued.

⚡ Dispatched to `perplexity`. Huey task queued.

gemini commented

2026-03-26 20:40:40 +00:00

🔧 gemini working on this via Huey. Branch: gemini/issue-22

🔧 `gemini` working on this via Huey. Branch: `gemini/issue-22`

grok commented

2026-03-26 20:40:54 +00:00

🔧 grok working on this via Huey. Branch: grok/issue-22

🔧 `grok` working on this via Huey. Branch: `grok/issue-22`

gemini referenced a pull request that will close this issue

2026-03-26 20:40:56 +00:00

[gemini] [HARNESS] Switch model default from Anthropic cloud to local Ollama (sovereignty) (#22) #23

gemini referenced this issue from a commit

2026-03-26 20:40:57 +00:00

[gemini] [HARNESS] Switch model default from Anthropic cloud to local Ollama (sovereignty) (#22)

grok commented

2026-03-26 20:40:59 +00:00

⚠️ grok produced no changes for this issue. Skipping.

⚠️ `grok` produced no changes for this issue. Skipping.

perplexity referenced this issue

2026-03-27 22:48:31 +00:00

[ARCH] Truth Engine v0 — Z3 SMT solver as Hermes tool #34

perplexity referenced this issue

2026-03-27 22:48:32 +00:00

[ARCH] Truth Engine v0 — Z3 SMT solver as Hermes tool #35

Timmy changed title from ~~[HARNESS] Switch model default from Anthropic cloud to local Ollama (sovereignty)~~ to [HARNESS] Switch model default from Anthropic cloud to local llama.cpp backend (sovereignty)

2026-03-28 03:36:39 +00:00

Timmy commented

2026-03-28 04:53:04 +00:00

Closing during the 2026-03-28 backlog burn-down.

Reason: this issue is being retired as part of a backlog reset toward the current final vision: Heartbeat, Harness, and Portal. If the work still matters after reset, it should return as a narrower, proof-oriented next-step issue rather than stay open as a broad legacy frontier.

Closing during the 2026-03-28 backlog burn-down. Reason: this issue is being retired as part of a backlog reset toward the current final vision: Heartbeat, Harness, and Portal. If the work still matters after reset, it should return as a narrower, proof-oriented next-step issue rather than stay open as a broad legacy frontier.

Timmy closed this issue

2026-03-28 04:53:05 +00:00

Sign in to join this conversation.

Branches Tags

main

ezra/lazarus-cell-spec-268

allegro/m2-commit-or-abort-845

gemini/issue-246

allegro/m1-stop-protocol-842

gemini/issue-182

master

feat/architecture-linter-provenance

feat/adr-system-provenance

sonnet/smoke-test-sonnet

sonnet/issue-260

docs/automation-audit-20260404

docs/architecture-kt-unified-schema

feat/frontier-local-layer-4-mesh

timmy/code-claw-docs

claw-code/issue-232

feat/frontier-local-layer-5-immortality

feat/frontier-local-layer-3

feature/workforce-manager

feat/frontier-local-agenda-v2

feat/cost-saving-guide

timmy/gemini-loop-hardening

timmy/orchestrator-kimi-heartbeat-status

timmy/orchestrator-kimi-visibility

timmy/issue-186-import-bridge

codex/workflow-pr-review

feat/sovereign-identity-phase-23

feat/sovereign-evolution-redistribution

gemini/orchestration-hardening

gemini/audit-bugfixes

timmy/issue-86-z3-crucible

feat/allegro-identity-fix

gemini/issue-75

gemini/issue-76

gemini/issue-78

review/move-last-two-main-commits-20260328-000322

gemini/issue-50

backup/main-before-reset-20260328-000322

gemini/issue-52

gemini/issue-54

fix/mcp-morrowind-tool-naming

gemini/issue-59

gemini/issue-60

gemini/issue-61

gemini/issue-62

gemini/issue-63

gemini/issue-41

gemini/issue-42

gemini/issue-43

codex/hermes-venv-runner

codex/twitter-archive-orchestration

codex/cleanup-pass-2

codex/cleanup-boundaries

gemini/issue-8

gemini/issue-20

gemini/issue-21

gemini/issue-22

gemini/issue-9

gemini/issue-10

gemini/issue-11

gemini/issue-12

gemini/issue-13

manus/dpo-data-pipeline

feature/dpo-training-pipeline

4 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: Timmy_Foundation/timmy-config#22