[P2-5] Download qwen3.5:27b and benchmark turbo4 at 64K/128K context #25
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Parent: #1 | Depends on: P2-3 (server working), P2-1 (quality validated on 14B)
Why
Hermes-4-14B was the test model. The target is qwen3.5:27b at 128K context — that's the spec goal. Need to prove it actually fits and runs.
Steps
Send test prompts. Record: memory usage, tok/s, response quality.
Record same metrics. Check: does it OOM?
Expected from Phase 1 report: ~23.4 GB total, 7.6 GB headroom in 31GB.
Acceptance Criteria
Closed per new fleet policy: no local llama-server for models >5GB. RunPod serverless endpoints only. See Timmy_Foundation/timmy-home#409.