[sovereignty] Cut the Cloud Umbilical — Close #94

THE BUG ======= Issue #94 flagged: the active config's fallback_model pointed to Google Gemini cloud. The enabled Health Monitor cron job had model=null, provider=null — so it inherited whatever the config defaulted to. If the default was ever accidentally changed back to cloud, every 5-minute cron tick would phone home. THE FIX ======= config.yaml: - fallback_model → local Ollama (hermes3:latest on localhost:11434) - Google Gemini custom_provider → renamed '(emergency only)' - tts.openai.model → disabled (use edge TTS locally) cron/jobs.json: - Health Monitor → explicit model/provider/base_url fields - No enabled job can ever inherit cloud defaults again tests/test_sovereignty_enforcement.py (NEW — 13 tests): - Default model is localhost - Fallback model is localhost (the #94 fix) - No enabled cron has null model/provider - No enabled cron uses cloud URLs - First custom_provider is local - TTS and STT default to local tests/test_local_runtime_defaults.py (UPDATED): - Now asserts fallback is Ollama, not Gemini WHAT STILL WORKS ================ Google Gemini is still available for explicit override: hermes --model gemini-2.5-pro It's just not automatic anymore. You have to ask for it. FULL SUITE ========== 36/36 pass. Zero regressions. Closes #94 Signed-off-by: gemini <gemini@hermes.local>
2026-03-31 08:29:58 -04:00
parent 40ccc88ff1
commit d5f8647ce5
4 changed files with 216 additions and 8 deletions
--- a/config.yaml
+++ b/config.yaml
@@ -114,7 +114,7 @@ tts:
    voice_id: pNInz6obpgDQGcFmaJgB
    model_id: eleven_multilingual_v2
  openai:
-    model: gpt-4o-mini-tts
+    model: ''  # disabled — use edge TTS locally
    voice: alloy
  neutts:
    ref_audio: ''
@@ -189,7 +189,9 @@ custom_providers:
  base_url: http://localhost:8081/v1
  api_key: none
  model: hermes4:14b
- name: Google Gemini
+# ── Emergency cloud provider — not used by default or any cron job.
+# Available for explicit override only: hermes --model gemini-2.5-pro
+- name: Google Gemini (emergency only)
  base_url: https://generativelanguage.googleapis.com/v1beta/openai
  api_key_env: GEMINI_API_KEY
  model: gemini-2.5-pro
@@ -213,7 +215,7 @@ mcp_servers:
    env: {}
    timeout: 30
 fallback_model:
-  provider: custom
-  model: gemini-2.5-pro
-  base_url: https://generativelanguage.googleapis.com/v1beta/openai
-  api_key_env: GEMINI_API_KEY
+  provider: ollama
+  model: hermes3:latest
+  base_url: http://localhost:11434/v1
+  api_key: ''