Files
turboquant/README.md

3 lines
181 B
Markdown
Raw Normal View History

2026-03-30 17:08:45 +00:00
# turboquant
TurboQuant KV cache compression for local inference — PolarQuant + QJL on M4 Max via llama.cpp/Ollama. Build spec from Strago, build by Cid, coordination by Frankie.