[HARNESS] Smart model routing — narrow decision space to competency space #609

New Issue

perplexity · 2026-03-27T01:10:18Z

perplexity commented

2026-03-27 01:10:18 +00:00

Key Insight

"I just need to narrow the decision space based off the competency space. Claude Opus or Gemini for design discussions. A very dumb but very well-aligned model handles simple decisions."

Routing Strategy

Decision Type	Model	Why
Architecture/design discussions	Claude Opus, Gemini	High reasoning ceiling
Simple yes/no routing decisions	Tiny aligned local model (Phi-3, Gemma 2B)	Fast, cheap, no cloud dependency
Code generation	Hermes 4 via harness	Self-improvement loops capture training data
Math/logic verification	Deterministic engine (SymPy/Z3)	No hallucination risk
Personality/conversation	Soul-file-tuned Hermes 4	Vibe alignment

Implementation

Define competency tiers for available models
Huey task router inspects task metadata → selects appropriate model
Log which model handled which task (feeds into Prometheus metrics)
Track cost-per-task across model tiers
Sovereignty bonus: prefer local model when confidence is sufficient

Depends on local inference telemetry (already wired into Huey)
Feeds into sovereignty rubric (% of tasks routed locally vs cloud)

Source: Gemini brainstorm session 2026-03-26 — triaged by Perplexity

## Key Insight > "I just need to narrow the decision space based off the competency space. Claude Opus or Gemini for design discussions. A very dumb but very well-aligned model handles simple decisions." ## Routing Strategy | Decision Type | Model | Why | |---|---|---| | Architecture/design discussions | Claude Opus, Gemini | High reasoning ceiling | | Simple yes/no routing decisions | Tiny aligned local model (Phi-3, Gemma 2B) | Fast, cheap, no cloud dependency | | Code generation | Hermes 4 via harness | Self-improvement loops capture training data | | Math/logic verification | Deterministic engine (SymPy/Z3) | No hallucination risk | | Personality/conversation | Soul-file-tuned Hermes 4 | Vibe alignment | ## Implementation - [ ] Define competency tiers for available models - [ ] Huey task router inspects task metadata → selects appropriate model - [ ] Log which model handled which task (feeds into Prometheus metrics) - [ ] Track cost-per-task across model tiers - [ ] Sovereignty bonus: prefer local model when confidence is sufficient ## Related - Depends on local inference telemetry (already wired into Huey) - Feeds into sovereignty rubric (% of tasks routed locally vs cloud) --- _Source: [Gemini brainstorm session 2026-03-26](https://g.co/gemini/share/3700c8d29b6b) — triaged by Perplexity_

perplexity added the modularization harness p1-important labels 2026-03-27 01:10:18 +00:00

Timmy commented

2026-03-27 01:10:28 +00:00

⚡ Dispatched to claude. Huey task queued.

⚡ Dispatched to `claude`. Huey task queued.

Timmy commented

2026-03-27 01:10:31 +00:00

⚡ Dispatched to gemini. Huey task queued.

⚡ Dispatched to `gemini`. Huey task queued.

Timmy commented

2026-03-27 01:10:34 +00:00

⚡ Dispatched to kimi. Huey task queued.

⚡ Dispatched to `kimi`. Huey task queued.

Timmy commented

2026-03-27 01:10:36 +00:00

⚡ Dispatched to grok. Huey task queued.

⚡ Dispatched to `grok`. Huey task queued.

Timmy commented

2026-03-27 01:10:37 +00:00

⚡ Dispatched to perplexity. Huey task queued.

⚡ Dispatched to `perplexity`. Huey task queued.

Timmy commented

2026-03-27 01:15:26 +00:00

🔍 Triaged by Huey — needs assignment.

Timmy commented

2026-03-27 01:30:21 +00:00

🔍 Triaged by Huey — needs assignment.

gemini commented

2026-03-27 01:40:30 +00:00

🔧 gemini working on this via Huey. Branch: gemini/issue-609

🔧 `gemini` working on this via Huey. Branch: `gemini/issue-609`

grok commented

2026-03-27 01:40:39 +00:00

🔧 grok working on this via Huey. Branch: grok/issue-609

🔧 `grok` working on this via Huey. Branch: `grok/issue-609`

grok commented

2026-03-27 01:40:42 +00:00

⚠️ grok produced no changes for this issue. Skipping.

⚠️ `grok` produced no changes for this issue. Skipping.

gemini referenced a pull request that will close this issue

2026-03-27 01:40:47 +00:00

[gemini] [HARNESS] Smart model routing — narrow decision space to competency space (#609) #612

gemini referenced this issue from a commit

2026-03-27 01:40:48 +00:00

[gemini] [HARNESS] Smart model routing — narrow decision space to competency space (#609)

Timmy commented

2026-03-27 01:45:21 +00:00

🔍 Triaged by Huey — needs assignment.

Timmy commented

2026-03-27 02:00:18 +00:00

🔍 Triaged by Huey — needs assignment.

perplexity referenced this issue

2026-03-27 16:55:32 +00:00

[PORTAL] Three-layer game architecture: Timmy → Reflex → Pilot #660

Timmy commented

2026-03-28 04:46:28 +00:00

Closing as duplicate during backlog burn-down. Canonical issue: #604.

Reason: identical title/workstream. Keeping one thread prevents duplicate agent labor and review waste.

Closing as duplicate during backlog burn-down. Canonical issue: #604. Reason: identical title/workstream. Keeping one thread prevents duplicate agent labor and review waste.

Timmy closed this issue

2026-03-28 04:46:28 +00:00

Sign in to join this conversation.

4 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: Timmy_Foundation/the-nexus#609