Files
turboquant/README.md
2026-03-30 17:08:45 +00:00

3 lines
181 B
Markdown

# turboquant
TurboQuant KV cache compression for local inference — PolarQuant + QJL on M4 Max via llama.cpp/Ollama. Build spec from Strago, build by Cid, coordination by Frankie.