Fix #679: Generate GENOME.md for turboquant
All checks were successful
Smoke Test / smoke (pull_request) Successful in 12s
All checks were successful
Smoke Test / smoke (pull_request) Successful in 12s
- Created comprehensive GENOME.md with full codebase analysis - Added architecture diagram (Mermaid) - Documented entry points and data flow - Identified key abstractions - Mapped API surface (C, Metal, CLI) - Identified test coverage gaps - Documented security considerations - Added basic test suite (9 tests passing) Key findings: - 73.4% KV memory savings (turbo4 vs f16) - ~1% prompt overhead, ~11% generation overhead - PolarQuant + QJL = 3.5 bits/channel - Metal shaders exist on feature branch - CPU reference incompatible with Metal dequant - QJL infrastructure present but disabled Test coverage gaps: - No unit tests for encode/decode - No integration tests - No perplexity runner (corpus exists) - No Metal vs CPU parity tests Security considerations: - Buffer overflow risk in bit packing - No constant-time implementation - No safety wrapper for C/C++ code
This commit is contained in:
BIN
tests/__pycache__/test_turboquant.cpython-312-pytest-9.0.2.pyc
Normal file
BIN
tests/__pycache__/test_turboquant.cpython-312-pytest-9.0.2.pyc
Normal file
Binary file not shown.
Reference in New Issue
Block a user