Files
turboquant/README.md
2026-03-30 17:08:45 +00:00

181 B

turboquant

TurboQuant KV cache compression for local inference — PolarQuant + QJL on M4 Max via llama.cpp/Ollama. Build spec from Strago, build by Cid, coordination by Frankie.