[EVALUATE] Qwen 3.6 Plus — Integrate into Fleet Rotation and Start Churning #411
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Mission
Evaluate Qwen 3.6 Plus, integrate it into the fleet model rotation, and start churning production work through it.
Model Intel
qwen/qwen3.6-plus:freehermes_cli/models.pyInitial Evaluation Results (Ezra)
Observation: The model is verbose but coherent. The free endpoint is NOT suitable for high-frequency churn.
Implementation Plan
Phase 1 — Stabilize Access (BLOCKING)
Phase 2 — Profile & Tooling
qwen36using OpenRouter provider.toolsandtool_choiceare supported).Phase 3 — Churn Integration
Acceptance Criteria
qwen36Hermes profile committed totimmy-configor local profiles.Related
qwen/qwen3.6-plus:freein Hermes agentOwner
Unassigned — needs a wizard to pick up Phase 1 access stabilization.
Update: Profile Created + Rate-Limit Blocker Confirmed
Ezra created the Hermes profile for Qwen 3.6:
~/.hermes/profiles/qwen36/config.yamlwith OpenRouter provider + Qwen 3.6 as defaultRate-Limit Reality
The
qwen/qwen3.6-plus:freeendpoint on OpenRouter is heavily throttled by Alibaba upstream. In testing:Verdict: Free tier is NOT viable for churn. We need a paid access path.
Recommended Next Move
Two paths to unblock churn:
Path A — Alibaba Cloud BYOK (fastest)
Path B — RunPod Serverless (aligns with #409)
Deploy Qwen 3.6 as a vLLM serverless endpoint. Ezra can generate the deployment spec immediately.
Which path do you want to fund, Alexander?
Critical Finding: Qwen 3.6 is API-Only
Ezra searched HuggingFace for open-weight Qwen 3.6 variants. None exist.
Qwen3.6Qwen3.6-32BQwen3.6-Plusqwen-3.6Implication: RunPod serverless deployment (Path B) is NOT POSSIBLE for Qwen 3.6. There is no GGUF/AWQ/Safetensors to serve.
The Only Viable Path
DashScope direct API (Alibaba's official model API platform).
Ezra updated the
qwen36profile:dashscopeprovider with base URLhttps://dashscope.aliyuncs.com/compatible-mode/v1qwen3.6-plusREADME.mdwith setup instructionsWhat You Need to Do
DASHSCOPE_API_KEY~/.hermes/profiles/qwen36/.envhermes -p qwen36Until the DashScope key is in place, Qwen 3.6 churn is blocked.
Major Update: Model ID Confirmed + Churn Artifacts Ready
Ezra scraped the official Alibaba Cloud help docs (Chinese) and found definitive proof:
https://dashscope.aliyuncs.com/compatible-mode/v1qwen3.6-plusArtifacts Created & Committed
Ezra produced 3 ready-to-run files and committed them to
ezra/qwen36-churn:benchmark.pypython benchmark.py --provider dashscope --model qwen3.6-pluschurn_scoper.pypython churn_scoper.py --repo Timmy_Foundation/timmy-home --limit 5INTEGRATION_PLAN.mdCommit:
839b4d5inezra/ezra-environmentProfile Updated
~/.hermes/profiles/qwen36/config.yamlnow uses the confirmed exact model ID.What Remains
One blocker: You still need to obtain a
DASHSCOPE_API_KEYfrom https://dashscope.aliyun.com/Once that key is in
~/.hermes/profiles/qwen36/.env, churn can start immediately with the scripts above.Pricing Intel
DashScope pricing is behind login, but the
qwen3.6-plusmodel is a paid tier (not free). Expect Alibaba Cloud pay-as-you-go billing. Given the $200 RunPod budget, consider capping Qwen 3.6 burn at ~$50-100/month until quality is proven.Recommendation
Assign @allegro or @Timmy to:
benchmark.pychurn_scoper.pyon a low-stakes repo