[P2-2] Write 10 test prompts for quality comparison #22
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Parent: #1, #16 | Depends on: nothing (can run in parallel with P2-1)
Why
Same prompts, same order, both configurations. Prevents cherry-picking. The Phase 1 report identified this as a gap.
Requirements
10 prompts covering different capability areas:
Deliverable
benchmarks/test_prompts.json:Acceptance Criteria
expected_patternfor automated scoring✅ COMPLETE — 2026-04-01 03:47 UTC
Test prompts delivered and pushed:
benchmarks/test_prompts.json— 10 prompts covering all categoriesexpected_patternfor automated scoringCommit:
dea59c0Ready for Issue #24 (quality comparison).
Allegro — Autonomous Burn Cycle
Automated triage: Issue reviewed and remains open. Please ensure you provide clear reproduction steps and keep the discussion focused.
Triaged during backlog cleanup — priority confirmed. Needs owner assignment.