- Add bin/andon-alert.sh with severity levels: INFO, WARNING, CRITICAL, HALT
- Wire into agent-loop: CRITICAL on 3 consecutive failures and dispatch key validation errors
- Wire into start-loops: CRITICAL on model health check failure
- Wire into pane-watchdog: WARNING on successful restart, CRITICAL on max restart exhaustion
- Creates ~/.hermes/andon-{flags,critical,halt} directories for state
The Andon system makes failures visible immediately via Telegram alerts
and local flag files, turning the factory floor into a transparent
operations center.
Closes#347
Closes#126: bin/start-loops.sh -- health check + kill stale + launch all loops
Closes#129: bin/gitea-api.sh -- Python urllib wrapper bypassing security scanner
Closes#130: bin/fleet-status.sh -- one-liner health per wizard with color output
All syntax-checked with bash -n.