Some checks failed
Smoke Test / smoke (push) Has been cancelled
Merge PR #619
47 lines
1.1 KiB
Markdown
47 lines
1.1 KiB
Markdown
# Big Brain Pod Verification
|
|
|
|
Verification script for Big Brain pod with gemma3:27b model.
|
|
|
|
## Issue #573
|
|
|
|
[BIG-BRAIN] Verify pod live: gemma3:27b pulled and responding
|
|
|
|
## Pod Details
|
|
|
|
- Pod ID: `8lfr3j47a5r3gn`
|
|
- GPU: L40S 48GB
|
|
- Image: `ollama/ollama:latest`
|
|
- Endpoint: `https://8lfr3j47a5r3gn-11434.proxy.runpod.net`
|
|
- Cost: $0.79/hour
|
|
|
|
## Verification Script
|
|
|
|
`scripts/verify_big_brain.py` checks:
|
|
|
|
1. `/api/tags` - Verifies gemma3:27b is in model list
|
|
2. `/api/generate` - Tests response time (< 30s requirement)
|
|
3. Uptime logging for cost awareness
|
|
|
|
## Usage
|
|
|
|
```bash
|
|
cd scripts
|
|
python3 verify_big_brain.py
|
|
```
|
|
|
|
## Output
|
|
|
|
- Console output with verification results
|
|
- `big_brain_verification.json` with detailed results
|
|
- Exit code 0 on success, 1 on failure
|
|
|
|
## Acceptance Criteria
|
|
|
|
- [x] `/api/tags` returns `gemma3:27b` in model list
|
|
- [x] `/api/generate` responds to a simple prompt in < 30s
|
|
- [x] uptime logged (cost awareness: $0.79/hr)
|
|
|
|
## Previous Issues
|
|
|
|
Previous pod (elr5vkj96qdplf) used broken `runpod/ollama:latest` image and never started. Fix: use `ollama/ollama:latest`. Volume mount at `/root/.ollama` for model persistence.
|