[TQ-2] Build TheTom/llama-cpp-turboquant for M3 Max Metal #27
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Parent: #30
Task
Clone the correct TurboQuant fork and build with Metal support.
Steps
Verify
Acceptance Criteria
llama-cli --helpshows turbo4 cache typellama-serverbinary exists and runsTQ-2: BUILD COMPLETE
Branch
feature/turboquant-kv-cachefrom TheTom/llama-cpp-turboquantBuild
Verified Types
Binaries
Location
~/turboquant/llama-cpp-fork/build/bin/Blocked on
Gemma 4 download (9% complete, ~3-4 hours remaining)
Closed per new fleet policy: no local llama-server for models >5GB. RunPod serverless endpoints only. See Timmy_Foundation/timmy-home#409.