All checks were successful
Smoke Test / smoke (pull_request) Successful in 7s
- Adds ansible/ deploy_turboquant.yml playbook with per-node config - Adds turboquant-deploy role: OS-specific (darwin/debian) tasks - Adds health_check.sh and integration test (chat completion) - Adds inventory.ini.example with Mac/Allegro/Ezra groups - Deploys llama.cpp with TurboQuant (Metal on macOS) - Systemd service (Linux) with TURBO_LAYER_ADAPTIVE env
624 B
624 B
TurboQuant Ansible Deployment
Deploy TurboQuant-compressed Gemma 4 inference across fleet nodes.
Quick Start
# Copy and edit inventory
cp ansible/inventory.ini.example ansible/inventory.ini
# Deploy to all nodes
ansible-playbook -i ansible/inventory.ini ansible/deploy_turboquant.yml
# Run health check
ansible -i ansible/inventory.ini all -m shell -a "sudo /opt/turboquant/health_check.sh"
# Run integration test
ansible -i ansible/inventory.ini all -m shell -a "curl -s http://localhost:8081/v1/chat/completions -d '{\"model\":\"gemma-4\",\"messages\":[{\"role\":\"user\",\"content\":\"Hello\"}]}'"