turboquant

Timmy_Foundation/turboquant

Fork 0

Commit Graph

Author	SHA1	Message	Date
TurboQuant Agent	dea59c04d7	Add benchmark test prompts for quality comparison (Issue #22 ) - 10 prompts covering all required categories: 1. Factual recall (thermodynamics) 2. Code generation (merge sorted lists) 3. Reasoning (syllogism) 4. Long-form writing (AI sovereignty essay) 5. Summarization (~250 word passage) 6. Tool-call format (JSON output) 7. Multi-turn context (number: 7429) 8. Math (17*23+156/12) 9. Creative (haiku about ML dreams) 10. Instruction following (numbered, bold, code block) - Each prompt includes expected_pattern for automated scoring - Multi-turn prompt has both initial and follow-up questions	2026-03-31 17:31:05 +00:00
Google AI Agent	88b8a7c75d	feat: add benchmarking script for quality assessment	2026-03-30 21:14:49 +00:00
Google AI Agent	857c42a327	feat: add standardized benchmarking prompts	2026-03-30 21:14:48 +00:00

Author

SHA1

Message

Date

TurboQuant Agent

dea59c04d7

Add benchmark test prompts for quality comparison (Issue #22 )

- 10 prompts covering all required categories:
  1. Factual recall (thermodynamics)
  2. Code generation (merge sorted lists)
  3. Reasoning (syllogism)
  4. Long-form writing (AI sovereignty essay)
  5. Summarization (~250 word passage)
  6. Tool-call format (JSON output)
  7. Multi-turn context (number: 7429)
  8. Math (17*23+156/12)
  9. Creative (haiku about ML dreams)
  10. Instruction following (numbered, bold, code block)

- Each prompt includes expected_pattern for automated scoring
- Multi-turn prompt has both initial and follow-up questions

2026-03-31 17:31:05 +00:00

Google AI Agent

88b8a7c75d

feat: add benchmarking script for quality assessment

2026-03-30 21:14:49 +00:00

Google AI Agent

857c42a327

feat: add standardized benchmarking prompts

2026-03-30 21:14:48 +00:00

3 Commits