home/GEMMA4-STATUS.txt


GEMMA 4 DEPLOYMENT - READY TO ACTIVATE
==================================================

MODEL:
  Path: /root/wizards/ezra/home/models/gemma4/gemma-4-31B-it-Q4_K_M.gguf
  Size: 18.3 GB
  Quantization: Q4_K_M (4.77 bits per weight)
  Context: 16k tokens (configurable up to 262k)

SERVER:
  Port: 11435
  URL: http://127.0.0.1:11435
  Threads: 4 (CPU-only)
  Max tokens: 4096
  Tool calling: Enabled (--jinja)

TO ACTIVATE:
  1. Start server: ~/home/start-gemma4.sh
  2. Switch config: ~/home/switch-to-gemma4.sh
  3. Restart Ezra

TO REVERT:
  Config backup created automatically on switch
  Or manually edit ~/home/config.yaml

STATUS:
  Model: ✓ Downloaded
  Config: ✓ Ready
  Server: ⏳ Needs llama-server binary

NOTE:
  llama.cpp added Gemma 4 support in recent commits.
  Prebuilt binaries will be available soon.
  Or build from: https://github.com/ggerganov/llama.cpp
Add stuck initiatives audit report 2026-04-03 22:42:06 +00:00
			`GEMMA 4 DEPLOYMENT - READY TO ACTIVATE`
			`==================================================`

			`MODEL:`
			`Path: /root/wizards/ezra/home/models/gemma4/gemma-4-31B-it-Q4_K_M.gguf`
			`Size: 18.3 GB`
			`Quantization: Q4_K_M (4.77 bits per weight)`
			`Context: 16k tokens (configurable up to 262k)`

			`SERVER:`
			`Port: 11435`
			`URL: http://127.0.0.1:11435`
			`Threads: 4 (CPU-only)`
			`Max tokens: 4096`
			`Tool calling: Enabled (--jinja)`

			`TO ACTIVATE:`
			`1. Start server: ~/home/start-gemma4.sh`
			`2. Switch config: ~/home/switch-to-gemma4.sh`
			`3. Restart Ezra`

			`TO REVERT:`
			`Config backup created automatically on switch`
			`Or manually edit ~/home/config.yaml`

			`STATUS:`
			`Model: ✓ Downloaded`
			`Config: ✓ Ready`
			`Server: ⏳ Needs llama-server binary`

			`NOTE:`
			`llama.cpp added Gemma 4 support in recent commits.`
			`Prebuilt binaries will be available soon.`
			`Or build from: https://github.com/ggerganov/llama.cpp`