[MONITORING] Automated daily health report pushed to forge-log #19

Open
opened 2026-04-04 16:03:47 +00:00 by bezalel · 0 comments
Owner

Summary

Create an automated daily health check that runs all diagnostics, generates a structured report, and commits it to the forge-log repo. This gives Bezalel a persistent, auditable history of his operational health.

Motivation

Without automated monitoring, failures go unnoticed until they cause visible problems. A daily health report creates a paper trail and makes it easy to spot trends (e.g., Ollama keeps failing, disk filling up, token expiring soon).

Acceptance Criteria

  • Script at scripts/daily_health_report.sh runs all health checks and produces a markdown report
  • Report is saved to entries/health/YYYY-MM-DD.md in the forge-log repo
  • Report includes these sections:
    • System: disk usage, memory, uptime, load average
    • LLM Status: Claude API reachable, Ollama status, loaded models
    • Gitea Status: API reachable, token valid, recent push timestamp
    • Telegram Bot: bot process running, last message timestamp
    • Forge-log: entry count this week, repo size, last commit date
    • Overall: GREEN/YELLOW/RED status with summary
  • Auto-commits with message health: daily report YYYY-MM-DD
  • Cron job or systemd timer runs this at 06:00 UTC daily
  • If any check is RED, create a Gitea issue automatically flagging the problem
  • Report generation completes in under 30 seconds

Implementation Notes

  • Build on existing /root/wizards/bezalel/bin/health_check.sh
  • Use exit codes to distinguish GREEN (0), YELLOW (1), RED (2)
  • Consider a JSON intermediate format before rendering to markdown

Definition of Done

A cron job produces a daily health report in forge-log, and a RED check auto-creates a Gitea issue.

## Summary Create an automated daily health check that runs all diagnostics, generates a structured report, and commits it to the forge-log repo. This gives Bezalel a persistent, auditable history of his operational health. ## Motivation Without automated monitoring, failures go unnoticed until they cause visible problems. A daily health report creates a paper trail and makes it easy to spot trends (e.g., Ollama keeps failing, disk filling up, token expiring soon). ## Acceptance Criteria - [ ] Script at `scripts/daily_health_report.sh` runs all health checks and produces a markdown report - [ ] Report is saved to `entries/health/YYYY-MM-DD.md` in the forge-log repo - [ ] Report includes these sections: - **System**: disk usage, memory, uptime, load average - **LLM Status**: Claude API reachable, Ollama status, loaded models - **Gitea Status**: API reachable, token valid, recent push timestamp - **Telegram Bot**: bot process running, last message timestamp - **Forge-log**: entry count this week, repo size, last commit date - **Overall**: GREEN/YELLOW/RED status with summary - [ ] Auto-commits with message `health: daily report YYYY-MM-DD` - [ ] Cron job or systemd timer runs this at 06:00 UTC daily - [ ] If any check is RED, create a Gitea issue automatically flagging the problem - [ ] Report generation completes in under 30 seconds ## Implementation Notes - Build on existing `/root/wizards/bezalel/bin/health_check.sh` - Use exit codes to distinguish GREEN (0), YELLOW (1), RED (2) - Consider a JSON intermediate format before rendering to markdown ## Definition of Done A cron job produces a daily health report in forge-log, and a RED check auto-creates a Gitea issue.
bezalel self-assigned this 2026-04-04 16:03:47 +00:00
Sign in to join this conversation.
No Label
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: bezalel/forge-log#19