Commit Graph

2 Commits

Author SHA1 Message Date
STEP35 FREE BURN
d913be508b fix(#442): deprecate ad-hoc recovery scripts; add ansible structure tests
Some checks failed
Architecture Lint / Linter Tests (pull_request) Successful in 22s
Smoke Test / smoke (pull_request) Failing after 21s
Validate Config / YAML Lint (pull_request) Failing after 15s
Validate Config / JSON Validate (pull_request) Successful in 18s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 57s
Validate Config / Python Test Suite (pull_request) Has been skipped
Validate Config / Cron Syntax Check (pull_request) Successful in 13s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 14s
Validate Config / Shell Script Lint (pull_request) Failing after 53s
Validate Config / Playbook Schema Validation (pull_request) Successful in 25s
Architecture Lint / Lint Repository (pull_request) Failing after 20s
PR Checklist / pr-checklist (pull_request) Failing after 3m32s
- Add DEPRECATION headers to overlapping ad-hoc recovery scripts:
  - config_drift.py, config_drift_detector.py → golden_state role
  - self_healing.py → deadman_switch role
  - reset_pipeline_state.py → agent_startup playbook
  - provision_wizard.py → site.yml + wizard_base role
  - cron_audit.py, cron-audit-662.py → cron_manager role

- Add tests/test_ansible_structure.py (19 pytest checks) validating
  the complete canonical ansible/ structure required by #442

- Update docs/CANONICAL_SERVICES.md with explicit supersession table
  mapping each ad-hoc mechanism to its Ansible replacement

- ansible/README.md: document webhook setup as required step

This establishes ansible/ as the single canonical source of truth
for fleet lifecycle management and formally retires the overlapping
ad-hoc recovery mechanisms.

Closes #442
2026-04-29 01:41:34 -04:00
efc42968e8 Audit cron/launchd/daemon — remove dead jobs and document canonical services
Some checks failed
Architecture Lint / Linter Tests (push) Successful in 17s
Validate Config / YAML Lint (push) Failing after 13s
Smoke Test / smoke (push) Failing after 15s
Validate Config / JSON Validate (push) Successful in 17s
Validate Config / Cron Syntax Check (push) Successful in 10s
Validate Config / Deploy Script Dry Run (push) Successful in 11s
Validate Config / Python Syntax & Import Check (push) Failing after 47s
Validate Config / Shell Script Lint (push) Failing after 48s
Validate Config / Python Test Suite (push) Has been skipped
Validate Config / Playbook Schema Validation (push) Successful in 22s
Architecture Lint / Lint Repository (push) Failing after 21s
Architecture Lint / Linter Tests (pull_request) Successful in 13s
Validate Config / YAML Lint (pull_request) Failing after 14s
Smoke Test / smoke (pull_request) Failing after 18s
Validate Config / JSON Validate (pull_request) Successful in 17s
Validate Config / Python Syntax & Import Check (pull_request) Failing after 50s
Validate Config / Python Test Suite (pull_request) Has been skipped
Validate Config / Cron Syntax Check (pull_request) Successful in 11s
Validate Config / Shell Script Lint (pull_request) Failing after 53s
Validate Config / Deploy Script Dry Run (pull_request) Successful in 14s
Validate Config / Playbook Schema Validation (pull_request) Successful in 24s
Architecture Lint / Lint Repository (pull_request) Failing after 21s
PR Checklist / pr-checklist (pull_request) Failing after 4m5s
- Remove Triage Heartbeat and PR Review Sweep (dashboard-era dead jobs)
- These were paused on 2026-04-04: "Dashboard repo frozen - loops redirected to the-nexus"
- Document current canonical fleet services in docs/CANONICAL_SERVICES.md
- Update cron/audit-report.json to reflect removal

Hard rule compliance: VPS crontabs untouched (per #880)
Closes #880
2026-04-28 22:51:03 -04:00