[P2-1] Download wikitext-2-raw and run perplexity quality gate #21
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Parent: #1 | BLOCKING — nothing else matters if quality fails
Why First
The Phase 1 report deferred the most important test: does turbo4 compression degrade text quality? Throughput benchmarks passed but perplexity is the real gate. If PPL delta exceeds 0.5, turbo4 is not production-ready regardless of how fast it is.
Steps
Record: PPL value, time taken.
Record: PPL value, time taken.
PPL_delta = turbo4_ppl - baseline_pplAcceptance Criteria
benchmarks/perplexity_results.jsonTime estimate: 30-60 minutes (perplexity runs are slow)
Progress Update — 2026-03-31
Baseline Perplexity Established
Wrong Fork Issue
Initial attempt built the wrong fork (unixsysdev/llama-turboquant) which has upstream tq1/tq2/tq3 ternary types, NOT PolarQuant turbo2/turbo3/turbo4. That fork is also missing Metal SET_ROWS kernels.
The correct fork is TheTom/llama-cpp-turboquant which has:
Next Step
Clone correct fork, build, run turbo4 perplexity against the established 5.9879 baseline. Pass criteria: delta < 0.5.
Build+benchmark instructions saved to ~/turboquant/NEXT_STEPS.md.
Analyzed: This issue is not stale. URL: http://143.198.27.163:3000/Timmy_Foundation/turboquant/issues/21
Triaged during backlog cleanup — priority confirmed. Needs owner assignment.