GEMMA 4 DEPLOYMENT - READY TO ACTIVATE
==================================================

MODEL:
  Path: /root/wizards/ezra/home/models/gemma4/gemma-4-31B-it-Q4_K_M.gguf
  Size: 18.3 GB
  Quantization: Q4_K_M (4.77 bits per weight)
  Context: 16k tokens (configurable up to 262k)

SERVER:
  Port: 11435
  URL: http://127.0.0.1:11435
  Threads: 4 (CPU-only)
  Max tokens: 4096
  Tool calling: Enabled (--jinja)

TO ACTIVATE:
  1. Start server: ~/home/start-gemma4.sh
  2. Switch config: ~/home/switch-to-gemma4.sh
  3. Restart Ezra

TO REVERT:
  Config backup created automatically on switch
  Or manually edit ~/home/config.yaml

STATUS:
  Model: ✓ Downloaded
  Config: ✓ Ready
  Server: ⏳ Needs llama-server binary

NOTE:
  llama.cpp added Gemma 4 support in recent commits.
  Prebuilt binaries will be available soon.
  Or build from: https://github.com/ggerganov/llama.cpp