[PROVE-IT] Timmy: Wire RunPod/Vertex AI Gemma 4 to Mac Hermes #543
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Objective
Get "Big Brain" (Gemma 4) serving on a GPU instance and wired to the Mac Hermes harness.
Instructions
config.yamlwith the new providerbig_brain.hermes chat --model gemma4 --provider big_brain.Acceptance Criteria
Timmy GPU Setup Instructions
RunPod Provisioning
Get RunPod API:
Provision GPU Pod:
Deploy Model:
Wire to Mac Hermes:
Alternative: Vertex AI
Create Vertex Endpoint:
Wire to Hermes:
Acceptance Criteria
Deploying Gemma 4 Big Brain on RunPod + Ollama
Step 1: Create RunPod Pod
Step 2: Get Pod IP
Step 3: Deploy Gemma 4 via Ollama
Step 4: Wire to Mac Hermes
Alternative: Vertex AI Gemma 4
Vertex AI REST Endpoint Format:
Auth: Use the service account key at ~/.config/vertex/key (already provided)
Request Format:
Auth Header: Bearer token from Google Cloud auth
Acceptance Criteria
Deploying Gemma 4 Big Brain on RunPod + Ollama
Step 1: Create RunPod Pod
Step 2: Get Pod IP from RunPod dashboard or API
Step 3: Deploy Ollama + Gemma
Step 4: Wire to Mac Hermes
Add to
~/.hermes/config.yaml:Test:
Alternative: Vertex AI
Vertex AI REST endpoint format:
Auth: Use the service account key at
~/.config/vertex/keyRequest Format:
Acceptance Criteria
Deployment Status Update
RunPod: ❌ FAILED — No GPU Availability
Next Options:
Option 1: Try Different GPU on RunPod
Option 2: Vertex AI (Google Cloud)
Vertex AI REST Endpoint Format:
Authentication:
Gemma 4 Model IDs:
gemma-3-27b-it— 27B parameter instruction-tuned (big brain)gemma-3-12b-it— 12B parametergemma-3-4b-it— 4B parameter (fastest)Example Request:
Hermes Integration:
After getting Vertex working, add to Hermes config:
Recommendation
[BURN-DOWN] Dispatched to Code Claw (claw-code worker) as part of nightly burn-down cycle. Heartbeat active.
[BURN-DOWN UPDATE] Code Claw failed to produce work. Timmy is handling this directly after first attempt failed. Claw delegation deprecated for critical path.
🟠 Code Claw (OpenRouter qwen/qwen3.6-plus:free) picking up this issue via 15-minute heartbeat.
Timestamp: 2026-04-07T03:12:00Z
⚠️ Code Claw made no durable code changes on this pass.
Exit: 1
This likely means the issue is too broad, not code-fit, or needs human clarification.
🟠 Code Claw (OpenRouter qwen/qwen3.6-plus:free) picking up this issue via 15-minute heartbeat.
Timestamp: 2026-04-07T03:12:30Z
⚠️ Code Claw made no durable code changes on this pass.
Exit: 1
This likely means the issue is too broad, not code-fit, or needs human clarification.
🟠 Code Claw (OpenRouter qwen/qwen3.6-plus:free) picking up this issue via 15-minute heartbeat.
Timestamp: 2026-04-07T03:21:09Z
Claw Code failed to produce work (exit=1, has_work=false). Timmy taking over directly.