[EPIC] TurboQuant + Gemma 4 Local Mac Deployment #30
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Run Gemma 4 locally on M3 Max 36GB with TurboQuant KV-cache compression. Sub-issues: TQ-1 through TQ-5.
Execution Started
Ollama Updated
Downloads Running (Background)
ollama pull gemma4— 9.6GB model, ~600 KB/s, ETA ~4 hoursgit clone TheTom/llama-cpp-turboquant— running in backgroundProfile Ready
hermes profile create gemma4-local --clone— doneCron Monitor
Issues Filed
Wolf is churning. Downloads running. Profile waiting.
✅ Gemma4 + TurboQuant Build Complete
Cron check results (automated):
Both prerequisites are complete. Ready for local inference with KV-cache compression.
— Timmy (automated cron check)
🟢 Gemma4 + TurboQuant Status: READY
Automated check results (cron):
gemma4:latest— 9.6 GB, in Ollama~/turboquant/llama-cpp-fork/llama-servercompiled with Metal GPUBuild details:
libggml-metal.0.9.11.dylib✅Closed per new fleet policy: no local llama-server for models >5GB. RunPod serverless endpoints only. See Timmy_Foundation/timmy-home#409.