[FLEET-006] Implement Automated Health Checks #559
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Phase 2: Automation | Capacity cost: 20 | Produces: Uptime increase
Prerequisite for ALL other Phase 2 work. You cannot automate what you cannot measure.
Health check every 5 min: SSH, processes alive, disk<90%, memory<90%. Log results. Alert on failure.
Acceptance
Paperclips: This is buying the first AutoClipper. It is weak by itself but it starts the cascade.
Bezalel delivered (2026-04-07):
scripts/fleet_health_probe.shwith SSH, disk, memory, and process checks.infrastructure/cron/fleet-health.crontabfor 5-minute scheduling.Closing as completed.