20 lines
463 B
Markdown
20 lines
463 B
Markdown
|
|
# TurboQuant Deployment — Ezra VPS Configuration
|
||
|
|
|
||
|
|
Resolves #110.
|
||
|
|
|
||
|
|
## Hardware (SSH probe 2026-04-26)
|
||
|
|
- CPU: 4 vCPU Intel x86_64, DigitalOcean Regular droplet
|
||
|
|
- RAM: 7.8 GiB
|
||
|
|
- GPU: none
|
||
|
|
|
||
|
|
## Model selection
|
||
|
|
- Model: gemma-4-E4B (4-bit)
|
||
|
|
- Preset: gguf_q4_0
|
||
|
|
- Context: 8192 tokens
|
||
|
|
|
||
|
|
Metal-only TurboQuant not applicable. Uses vanilla GGUF.
|
||
|
|
|
||
|
|
## Deliverables
|
||
|
|
- Inventory entry added to ansible/inventory/hosts.yml
|
||
|
|
- Smoke test validates hardware, model, preset fields
|