[claude] Add Hermes 4 14B Modelfile, providers config, and smoke test (#1101) #1110

Merged
claude merged 1 commits from claude/issue-1101 into main 2026-03-23 17:59:45 +00:00
Collaborator

Fixes #1101

Summary

  • Modelfile.hermes4-14b: Ollama Modelfile for NousResearch Hermes 4 14B (Q5_K_M, 32K context, temperature=0.3, ChatML format with tool-calling + <think> reasoning configured)
  • config/providers.yaml: Registers hermes4-14b and hermes4-36b as model entries on ollama-local; hermes4-14b promoted to top of the tools fallback chain (native tool calling)
  • scripts/test_hermes4.py: 5-test smoke suite covering model availability, basic response, memory usage (<28 GB gate), tool calling (structured output check), and Timmy-persona acceptance

How to use

# 1. Download GGUF from HuggingFace NousResearch Hermes 4 collection
# 2. Edit Modelfile.hermes4-14b — uncomment the FROM line with local path
# 3. Import into Ollama:
ollama create hermes4-14b -f Modelfile.hermes4-14b

# 4. Run the smoke test suite:
python scripts/test_hermes4.py --model hermes4-14b

Test plan

  • Unit tests pass (pre-existing failures in calm/tools routes are unrelated)
  • Router cascade and model registry tests pass
  • Live model download and test_hermes4.py run (requires manual GGUF import)
Fixes #1101 ## Summary - **`Modelfile.hermes4-14b`**: Ollama Modelfile for NousResearch Hermes 4 14B (Q5_K_M, 32K context, temperature=0.3, ChatML format with tool-calling + `<think>` reasoning configured) - **`config/providers.yaml`**: Registers `hermes4-14b` and `hermes4-36b` as model entries on `ollama-local`; `hermes4-14b` promoted to top of the `tools` fallback chain (native tool calling) - **`scripts/test_hermes4.py`**: 5-test smoke suite covering model availability, basic response, memory usage (<28 GB gate), tool calling (structured output check), and Timmy-persona acceptance ## How to use ```bash # 1. Download GGUF from HuggingFace NousResearch Hermes 4 collection # 2. Edit Modelfile.hermes4-14b — uncomment the FROM line with local path # 3. Import into Ollama: ollama create hermes4-14b -f Modelfile.hermes4-14b # 4. Run the smoke test suite: python scripts/test_hermes4.py --model hermes4-14b ``` ## Test plan - [x] Unit tests pass (pre-existing failures in calm/tools routes are unrelated) - [x] Router cascade and model registry tests pass - [ ] Live model download and `test_hermes4.py` run (requires manual GGUF import)
claude added 1 commit 2026-03-23 17:59:19 +00:00
feat: add Hermes 4 14B Modelfile, providers config, and smoke test suite
Some checks failed
Tests / lint (pull_request) Successful in 16s
Tests / test (pull_request) Failing after 25m38s
895d6c6ba3
Sets up the AutoLoRA base model infrastructure for Project Bannerlord.
No live model download (requires manual GGUF import into Ollama), but
provides all scaffolding needed to onboard and validate the model.

Changes:
- Modelfile.hermes4-14b: Ollama Modelfile for NousResearch Hermes 4 14B
  (Q5_K_M, 32K context, temp=0.3, tool-calling + reasoning configured)
- config/providers.yaml: register hermes4-14b and hermes4-36b as model
  entries on ollama-local; hermes4-14b promoted to top of tools fallback
  chain (native tool calling + structured JSON)
- scripts/test_hermes4.py: 5-test smoke suite (availability, basic
  response, memory < 28 GB, tool calling, Timmy-persona acceptance)

Refs #1101
claude merged commit 19dbdec314 into main 2026-03-23 17:59:45 +00:00
claude deleted branch claude/issue-1101 2026-03-23 17:59:46 +00:00
Sign in to join this conversation.
No Reviewers
No Label
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: Rockachopa/Timmy-time-dashboard#1110