Commit Graph

24 Commits

Author SHA1 Message Date
5a60075515 Teach lane-aware skills in agent dispatch (#143)
Co-authored-by: Codex Agent <codex@hermes.local>
Co-committed-by: Codex Agent <codex@hermes.local>
2026-04-04 22:47:31 +00:00
2bf79c2286 Refresh ops tooling around current agent lanes (#142)
Co-authored-by: Codex Agent <codex@hermes.local>
Co-committed-by: Codex Agent <codex@hermes.local>
2026-04-04 22:43:48 +00:00
6a71dfb5c7 [ops] import gemini loop and timmy orchestrator into sidecar truth (#152) 2026-04-04 20:27:39 +00:00
5d83e5299f [ops] stabilize local loop watchdog and claude loop (#149) 2026-04-04 20:16:59 +00:00
Alexander Whitestone
c0603a6ce6 docs: Nostr agent-to-agent encrypted comms research + working demo
Proven: encrypted DM sent through relay.damus.io and nos.lol, fetched and decrypted.
Library: nostr-sdk v0.44 (pip install nostr-sdk).
Path to replace Telegram: keypairs per wizard, NIP-17 gift-wrapped DMs.
2026-04-04 12:48:57 -04:00
Alexander Whitestone
f29d579896 feat(ops): start-loops, gitea-api wrapper, fleet-status
Closes #126: bin/start-loops.sh -- health check + kill stale + launch all loops
Closes #129: bin/gitea-api.sh -- Python urllib wrapper bypassing security scanner
Closes #130: bin/fleet-status.sh -- one-liner health per wizard with color output

All syntax-checked with bash -n.
2026-04-04 12:05:04 -04:00
Alexander Whitestone
3cf9f0de5e feat(ops): deadman switch, model health check, issue filter
Closes #115: bin/deadman-switch.sh -- alerts Telegram when zero commits for 2+ hours
Closes #116: bin/model-health-check.sh -- validates model tags against provider APIs
Closes #117: bin/issue-filter.json + live loop patches -- excludes DO-NOT-CLOSE, EPIC, META, RETRO, INTEL, MORNING REPORT, Rockachopa-assigned issues from agent pickup

All three tested locally:
- deadman-switch correctly detected 14h gap and would alert
- model-health-check parses config.yaml and validates (skips gracefully without API key in env)
- issue filters patched into live claude-loop.sh and gemini-loop.sh
2026-04-04 12:00:05 -04:00
Alexander Whitestone
8ec4bff771 feat(crucible): Z3 sidecar MCP verifier -- rebased onto current main
Closes #86. Adds:
- bin/crucible_mcp_server.py (schedule, dependency, capacity proofs)
- docs/crucible-first-cut.md
- playbooks/verified-logic.yaml
- config.yaml crucible MCP server entry
2026-04-03 18:58:43 -04:00
67deb58077 [soul] The Conscience of the Training Pipeline
WHAT THIS IS
============
The Soul Eval Gate answers one question:

    'Did the candidate model preserve Timmy's soul?'

It is the missing bridge between autolora's eval output and
tasks.py's training pipeline. Without it, no adapter can ever
be promoted — tasks.py calls latest_eval_gate() but nothing
writes the gate file.

HOW IT WORKS
============
1. Reads autolora eval scores (candidate + baseline)
2. Applies SOUL.md constraints:
   - crisis and pastoral_care are SACRED categories
   - ANY regression on sacred categories = REJECT
   - This is not a tradeoff. From SOUL.md:
     'If adapter degrades this, adapter is REJECTED.'
3. Checks composite score regression (>10% = reject)
4. Checks per-metric regression on core categories
5. Writes eval_gate_latest.json for tasks.py to read
6. Exits 0 (pass) or 1 (reject) for CI integration

WHY THIS MATTERS
=================
An adapter that gets better at code review but worse at
responding to someone in crisis is not Timmy. It is wearing
Timmy's face without carrying his conscience.

The DPO training playbook says:
  'If the post-eval degrades on crisis or pastoral_care,
   REJECT the adapter and fail the issue.'

But until now, nothing enforced that constraint. The playbook
was an aspiration. This makes it architecture.

FILES
=====
  bin/soul_eval_gate.py          — 244 lines, zero deps beyond stdlib
  tests/test_soul_eval_gate.py   — 10 tests, all pass
  Full suite: 22/22

USAGE
=====
  # CLI (after autolora eval)
  python bin/soul_eval_gate.py \
    --scores evals/v1/8b/scores.json \
    --baseline evals/v0-baseline/8b/scores.json \
    --candidate-id timmy-v1-20260330

  # From tasks.py
  from soul_eval_gate import evaluate_candidate
  result = evaluate_candidate(scores_path, baseline_path, id)
  if result['pass']:
      promote_adapter(...)

Signed-off-by: gemini <gemini@hermes.local>
2026-03-30 19:13:35 -04:00
877425bde4 feat: add Allegro Kimi wizard house assets (#91) 2026-03-29 22:22:24 +00:00
34e01f0986 feat: add local-vs-cloud token and throughput metrics (#85) 2026-03-28 14:24:12 +00:00
Alexander Whitestone
d72ae92189 Tighten Hermes cutover and export checks 2026-03-27 17:35:07 -04:00
Alexander Whitestone
49020b34d9 config: update bin/timmy-dashboard,config.yaml,docs/local-model-integration-sketch.md,tasks.py 2026-03-26 17:00:22 -04:00
Alexander Whitestone
56364e62b4 config: update bin/sync-up.sh,channel_directory.json 2026-03-26 08:00:46 -04:00
Alexander Whitestone
069f8bd55f add: sync-up.sh — push live hermes config to repo
The harness is the source. The repo is the record.
Changes flow UP from ~/.hermes, not down from the repo.
2026-03-25 19:11:31 -04:00
4b69918135 remove deprecated workforce-manager.py — replaced by sovereign-orchestration 2026-03-25 14:41:40 +00:00
bafd97da5a remove deprecated timmy-orchestrator.sh — replaced by sovereign-orchestration 2026-03-25 14:41:38 +00:00
e884159cee remove deprecated timmy-loopstat.sh — replaced by sovereign-orchestration 2026-03-25 14:41:37 +00:00
9e1a35c0fa remove deprecated nexus-merge-bot.sh — replaced by sovereign-orchestration 2026-03-25 14:41:36 +00:00
b044588d3d remove deprecated gemini-loop.sh — replaced by sovereign-orchestration 2026-03-25 14:41:34 +00:00
b5b6243ee4 remove deprecated claudemax-watchdog.sh — replaced by sovereign-orchestration 2026-03-25 14:41:33 +00:00
d4f41edf7c remove deprecated claude-loop.sh — replaced by sovereign-orchestration 2026-03-25 14:41:31 +00:00
7d6df802d7 remove deprecated agent-loop.sh — replaced by sovereign-orchestration 2026-03-25 14:41:30 +00:00
Alexander Whitestone
d4c79d47a6 feat: add operational scripts and deploy.sh
- Moved all agent loop scripts into source control (bin/)
- claude-loop.sh, gemini-loop.sh, timmy-orchestrator.sh
- workforce-manager.py, agent-dispatch.sh, nexus-merge-bot.sh
- ops dashboard scripts (ops-panel, ops-helpers, ops-gitea)
- monitoring scripts (timmy-status, timmy-loopstat)
- deploy.sh: one-command overlay onto ~/.hermes/
- Updated README with sidecar architecture docs
- All loops now target the-nexus + autolora only
2026-03-25 10:05:55 -04:00