turboquant

Timmy_Foundation/turboquant

Fork 0

Files

History

Timmy 064ab602dd

Smoke Test / smoke (pull_request) Successful in 16s

Details

feat: M1 benchmark suite — throughput, memory, perplexity comparison

Implements Issue #80: benchmark turboquant vs llama.cpp baseline on M1.

New files:
- benchmarks/run_m1_benchmark.py — comprehensive benchmark runner
- benchmarks/run_benchmark_m1.sh — shell wrapper for easy execution
- tests/test_m1_benchmark.py — unit tests for benchmark functions

Measures:
- Tokens/sec throughput (f16 vs turbo4, 3-run average)
- Memory usage (RSS monitoring during inference)
- Quality via perplexity (llama-perplexity on wikitext-2)

Generates:
- benchmarks/m1_benchmark_results.json — raw results
- benchmarks/m1_benchmark_report.md — markdown comparison table

Closes #80

2026-04-15 22:10:43 -04:00

roundtrip_test.cpp

feat: add standalone build system and roundtrip tests (Issue #17 )

2026-04-14 11:34:38 -04:00

test_m1_benchmark.py

feat: M1 benchmark suite — throughput, memory, perplexity comparison

2026-04-15 22:10:43 -04:00