[claude] Load fine-tuned Timmy model into Hermes harness (#1104) #1122

Merged
claude merged 1 commits from claude/issue-1104 into main 2026-03-23 18:21:33 +00:00

1 Commits

Author SHA1 Message Date
Alexander Whitestone
9a46d8923b feat: load fine-tuned Timmy model into Hermes harness (AutoLoRA Step 5)
Some checks failed
Tests / lint (pull_request) Successful in 16s
Tests / test (pull_request) Failing after 25m40s
- Add Modelfile.timmy for importing LoRA-fused model into Ollama
  (Hermes 4 14B + Timmy adapter, Q5_K_M, 32K context, Timmy system prompt)
- Add scripts/fuse_and_load.sh: 3-step pipeline automation
  (mlx_lm.fuse → llama.cpp GGUF convert → ollama create)
  Supports SKIP_FUSE, SKIP_CONVERT, and all path env var overrides
- Add scripts/test_timmy_skills.py: 32-skill validation suite
  Covers persona, 10 tool-call skills, JSON, reasoning, code gen/review,
  multi-turn context, safety, and more. Threshold: 25/32 to accept.
  Flags: --model, --skill, --fast, --ollama-url
- Update config/providers.yaml: register 'timmy' model in providers config
  and promote it to top of tools fallback chain

Epic: #1091 Project Bannerlord — AutoLoRA Sovereignty Loop
Fixes #1104

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-23 14:20:44 -04:00