turboquant

Files

TurboQuant Agent dea59c04d7 Add benchmark test prompts for quality comparison (Issue #22 )

- 10 prompts covering all required categories:
  1. Factual recall (thermodynamics)
  2. Code generation (merge sorted lists)
  3. Reasoning (syllogism)
  4. Long-form writing (AI sovereignty essay)
  5. Summarization (~250 word passage)
  6. Tool-call format (JSON output)
  7. Multi-turn context (number: 7429)
  8. Math (17*23+156/12)
  9. Creative (haiku about ML dreams)
  10. Instruction following (numbered, bold, code block)

- Each prompt includes expected_pattern for automated scoring
- Multi-turn prompt has both initial and follow-up questions

2026-03-31 17:31:05 +00:00

prompts.json

feat: add standardized benchmarking prompts

2026-03-30 21:14:48 +00:00

run_benchmarks.py

feat: add benchmarking script for quality assessment

2026-03-30 21:14:49 +00:00

test_prompts.json

Add benchmark test prompts for quality comparison (Issue #22 )

2026-03-31 17:31:05 +00:00