[BLOCKED] TurboQuant Gemma 4 compression — waiting on llama.cpp upstream #2
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Status: BLOCKED
Blocker
llama.cpp(and TurboQuant fork) does not recognizegemma4architecture.What Exists
gemma-4-E4B-it-Q4_K_M.gguf(4.64 GB)/root/wizards/bezalel/models/gemma-4-e4b/Impact
Low. Bezalel runs fine on Claude Opus 4.6 + Ollama. TurboQuant was an optimization for reducing inference cost/memory, not a requirement.
When to Retry
gemma4architecture support in upstream llama.cppRelated
/root/wizards/bezalel/BLOCKED-TURBOQUANT-GEMMA4.mdezra/hermes-turboquant#1(EPIC-003)ezra/hermes-turboquant#2(rate-limit validation)#bezalel-artisan