[SELF-IMPROVE] CI testbed dead-man alert — auto-file p0 when 67.205.155.108 unreachable #1477

Open
opened 2026-04-05 23:20:24 +00:00 by Timmy · 1 comment
Owner

Problem

CI testbed at 67.205.155.108 went down silently. Nobody noticed until ordered burn-down check. Dead CI silently blocks all auto-merges.

What To Do

Create ~/.hermes/bin/ci-deadman.sh:

  1. Every 5 minutes, SSH to 67.205.155.108 with 10s timeout
  2. If SSH succeeds: confirm Docker running and act_runner alive
  3. If SSH fails: write to ALERT.log with timestamp
  4. If down for 3 consecutive checks (>15 min): auto-file p0 issue titled "[P0] CI DEAD"
  5. If CI recovers: close the p0 issue with recovery timestamp
  6. State file: ~/.hermes/ci-deadman-state.json

Acceptance Criteria

  • Script at ~/.hermes/bin/ci-deadman.sh exists and is executable
  • Running it shows current CI status (reachable/unreachable, docker status)
  • Script writes to ~/.hermes/ci-deadman-state.json
  • 3 consecutive failures auto-file a p0 issue
  • Comment on epic #1474 with script path, sample output, state file

Labels: self-improve, priority-high

Assignee: claude

## Problem CI testbed at 67.205.155.108 went down silently. Nobody noticed until ordered burn-down check. Dead CI silently blocks all auto-merges. ## What To Do Create ~/.hermes/bin/ci-deadman.sh: 1. Every 5 minutes, SSH to 67.205.155.108 with 10s timeout 2. If SSH succeeds: confirm Docker running and act_runner alive 3. If SSH fails: write to ALERT.log with timestamp 4. If down for 3 consecutive checks (>15 min): auto-file p0 issue titled "[P0] CI DEAD" 5. If CI recovers: close the p0 issue with recovery timestamp 6. State file: ~/.hermes/ci-deadman-state.json ## Acceptance Criteria - [ ] Script at ~/.hermes/bin/ci-deadman.sh exists and is executable - [ ] Running it shows current CI status (reachable/unreachable, docker status) - [ ] Script writes to ~/.hermes/ci-deadman-state.json - [ ] 3 consecutive failures auto-file a p0 issue - [ ] Comment on epic #1474 with script path, sample output, state file ## Labels: self-improve, priority-high ## Assignee: claude
claude was assigned by Timmy 2026-04-05 23:20:24 +00:00
Author
Owner

Alexander ordered: go to Issue #427 and comment on OpenProse as force multiplier. Do it now.

Alexander ordered: go to [Issue #427](https://forge.alexanderwhitestone.com/Timmy_Foundation/timmy-home/issues/427) and comment on OpenProse as force multiplier. Do it now.
Sign in to join this conversation.
No Label
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: Rockachopa/Timmy-time-dashboard#1477