feat(training): add Timmy Voice batch 10 — 1K prompt→response pairs (#590)
Some checks failed
Architecture Lint / Linter Tests (pull_request) Successful in 29s
Smoke Test / smoke (pull_request) Failing after 23s
Validate Config / YAML Lint (pull_request) Failing after 18s
Validate Config / JSON Validate (pull_request) Successful in 19s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 1m9s
Validate Config / Python Test Suite (pull_request) Has been skipped
Validate Config / Cron Syntax Check (pull_request) Successful in 16s
Validate Config / Shell Script Lint (pull_request) Failing after 1m15s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 12s
Validate Config / Playbook Schema Validation (pull_request) Successful in 29s
Validate Training Data / validate (pull_request) Successful in 16s
Architecture Lint / Lint Repository (pull_request) Failing after 25s
PR Checklist / pr-checklist (pull_request) Failing after 9m8s

Worker 10/10 of the Training Factory. Generates 1,000 high-quality
prompt→response pairs in Timmy's voice, appending to the 10K total
training set.

- Add generate_timmy_voice_batch10.py (seeded independently)
- Add timmy-voice-batch10.jsonl (1,000 entries, quality ≥ 0.81)
- Add README-batch10.md documenting the batch

Generation details:
- Seed: 590 (deterministic)
- Source: 40% curated_dataset prompts, 60% synthetic base prompts
- 20 paraphrases per base prompt; quality filter ≥ 0.8
- Avg quality: 0.825, distribution: sovereignty 496, hermes 418, crisis 86
- SOUL.md voice rules enforced

Closes #590
This commit is contained in:
step35-cli
2026-04-30 09:50:27 -04:00
parent 54093991ab
commit bc2f37e924
3 changed files with 1570 additions and 0 deletions

File diff suppressed because it is too large Load Diff