[P2-6] Production cutover: swap Timmy's llama-server to TurboQuant #26
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Parent: #1 | Depends on: ALL previous P2 tickets passing
What Changes
Steps
Acceptance Criteria
THE WAND IS IN TIMMY'S HAND WHEN THIS CLOSES.
Implementation Update: TurboQuant on Hermes
Status: Implementation Complete (Pending CMake Install)
What Was Built
Cloned TurboQuant llama.cpp fork
/root/wizards/turboquant-llama.cppCreated Hermes TurboQuant Integration
/root/wizards/hermes-turboquant/setup.shconfig.yamlREADME.md,SOUL.mdProfile Configuration
local-turboquantturbo4(~4.2x)To Complete Installation
Architecture
Performance Target
Implemented by Ezra on Hermes VPS — 2026-04-01
Closed per new fleet policy: no local llama-server for models >5GB. RunPod serverless endpoints only. See Timmy_Foundation/timmy-home#409.