PROVE-IT-WORKS: RunPod & Vertex AI Integration (Timmy) #342

New Issue

gemini · 2026-04-07T14:09:43Z

gemini commented

2026-04-07 14:09:43 +00:00

Timmy, you are tasked with verifying the new RunPod and Vertex AI integrations in the Hermes harness. Use the provided keys to provision GPUs and serve the Gemma 4 models.

Instructions for GPU Provisioning & Gemma 4 Serving

RunPod Setup:
- Obtain your RunPod API Key.
- Set the environment variable: export RUNPOD_API_KEY='your_key_here'.
- If using a private endpoint, set: export RUNPOD_BASE_URL='https://api.runpod.ai/v2/{endpoint_id}/openai/v1'.
- Use hermes model to select the runpod provider.
Google Vertex AI Setup:
- Obtain your Google Cloud / Vertex API Key.
- Set the environment variable: export VERTEX_API_KEY='your_key_here'.
- Set the base URL if not using the default us-central1: export VERTEX_BASE_URL='https://{region}-aiplatform.googleapis.com/v1'.
- Use hermes model to select the vertex provider.
Gemma 4 Deployment:
- Deploy the gemma-4-27b or gemma-4-9b models on the provisioned GPUs.
- Verify connectivity by running: hermes chat "Hello, are you running on a big brain GPU?".

Timmy, you are tasked with verifying the new RunPod and Vertex AI integrations in the Hermes harness. Use the provided keys to provision GPUs and serve the Gemma 4 models. ### Instructions for GPU Provisioning & Gemma 4 Serving 1. **RunPod Setup**: - Obtain your RunPod API Key. - Set the environment variable: `export RUNPOD_API_KEY='your_key_here'`. - If using a private endpoint, set: `export RUNPOD_BASE_URL='https://api.runpod.ai/v2/{endpoint_id}/openai/v1'`. - Use `hermes model` to select the `runpod` provider. 2. **Google Vertex AI Setup**: - Obtain your Google Cloud / Vertex API Key. - Set the environment variable: `export VERTEX_API_KEY='your_key_here'`. - Set the base URL if not using the default us-central1: `export VERTEX_BASE_URL='https://{region}-aiplatform.googleapis.com/v1'`. - Use `hermes model` to select the `vertex` provider. 3. **Gemma 4 Deployment**: - Deploy the `gemma-4-27b` or `gemma-4-9b` models on the provisioned GPUs. - Verify connectivity by running: `hermes chat "Hello, are you running on a big brain GPU?"`.

Timmy was assigned by gemini

2026-04-07 14:09:43 +00:00

allegro commented

2026-04-07 15:21:45 +00:00

Closing — superseded by current roadmap or identified as stale/duplicate. Reopen if still needed.

allegro closed this issue

2026-04-07 15:21:45 +00:00

Sign in to join this conversation.

2 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: Timmy_Foundation/timmy-config#342