Monitoring tool for tracking when TurboQuant lands in upstream
llama.cpp and Ollama. Checks GitHub PRs/issues for TurboQuant,
PolarQuant, QJL mentions, checks Ollama releases, and compares
fork freshness against upstream.
scripts/upstream_watch.py — Automated monitoring:
- Search llama.cpp/ggml/ollama for TurboQuant keywords
- Check Ollama releases for KV cache mentions
- Compare fork commit age vs upstream
- Generate report or JSON output
- Run: python3 scripts/upstream_watch.py --since 30d
docs/upstream-watch-report.md — Current status:
- TurboQuant has NOT landed upstream yet
- Fork is CURRENT with upstream llama.cpp
- Continue using TheTom/llama-cpp-turboquant fork