Feature: Agent "Self-Correction" Dashboard #1007

Closed
opened 2026-03-22 23:04:54 +00:00 by gemini · 2 comments
Collaborator

Objective

Create a dedicated dashboard view that highlights instances where the agent detected its own errors and the subsequent steps it took to rectify them.

Scope

  • Implement an "Error Detection" hook in the agent's execution loop.
  • Log "Self-Correction" events with details: original intent, detected error, correction strategy, and final outcome.
  • Visualize these events in a timeline on the dashboard.
  • Use this data to identify recurring failure patterns for future automation.
## Objective Create a dedicated dashboard view that highlights instances where the agent detected its own errors and the subsequent steps it took to rectify them. ## Scope - Implement an "Error Detection" hook in the agent's execution loop. - Log "Self-Correction" events with details: original intent, detected error, correction strategy, and final outcome. - Visualize these events in a timeline on the dashboard. - Use this data to identify recurring failure patterns for future automation.
claude was assigned by Rockachopa 2026-03-22 23:30:24 +00:00
claude added the gemini-review label 2026-03-23 13:51:30 +00:00
Collaborator

🧹 Closed — Gemini auto-generated, not aligned with harness-first strategy

Reason: Self-Correction Dashboard — dashboard is infra, self-correction is already in heartbeat loop

This was auto-generated by Gemini. The concept may be revisited if it becomes relevant to the harness roadmap. Reopen if needed.

Ref: Deep Backlog Triage #1076.

🧹 **Closed — Gemini auto-generated, not aligned with harness-first strategy** Reason: Self-Correction Dashboard — dashboard is infra, self-correction is already in heartbeat loop This was auto-generated by Gemini. The concept may be revisited if it becomes relevant to the harness roadmap. Reopen if needed. Ref: Deep Backlog Triage #1076.
Collaborator

PR created: #1269

Summary of changes

  • src/infrastructure/self_correction.py — New SQLite-backed logger. Records original_intent, detected_error, correction_strategy, final_outcome, outcome_status, and error_type. Public API: log_self_correction(), get_corrections(), get_patterns(), get_stats().
  • src/timmy/agentic_loop.py — Hook in _handle_step_failure emits a self-correction event every time the loop adapts (or fails to adapt) a step. Best-effort, never raises.
  • src/dashboard/routes/self_correction.py — Three routes: /self-correction/ui (dashboard), /timeline (HTMX, 30 s refresh), /patterns (HTMX, 60 s refresh).
  • Templatesself_correction.html with stats panel + correction-rate progress bar; two partials for timeline and recurring-pattern table.
  • NavigationSELF-CORRECT link added to INTEL dropdown (desktop + mobile).
  • CSS.sc-* styles appended to mission-control.css.
  • Tests — 18 unit tests; all 435 existing tests pass.
PR created: #1269 ## Summary of changes - **`src/infrastructure/self_correction.py`** — New SQLite-backed logger. Records `original_intent`, `detected_error`, `correction_strategy`, `final_outcome`, `outcome_status`, and `error_type`. Public API: `log_self_correction()`, `get_corrections()`, `get_patterns()`, `get_stats()`. - **`src/timmy/agentic_loop.py`** — Hook in `_handle_step_failure` emits a self-correction event every time the loop adapts (or fails to adapt) a step. Best-effort, never raises. - **`src/dashboard/routes/self_correction.py`** — Three routes: `/self-correction/ui` (dashboard), `/timeline` (HTMX, 30 s refresh), `/patterns` (HTMX, 60 s refresh). - **Templates** — `self_correction.html` with stats panel + correction-rate progress bar; two partials for timeline and recurring-pattern table. - **Navigation** — `SELF-CORRECT` link added to INTEL dropdown (desktop + mobile). - **CSS** — `.sc-*` styles appended to `mission-control.css`. - **Tests** — 18 unit tests; all 435 existing tests pass.
Sign in to join this conversation.
No Label gemini-review
3 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: Rockachopa/Timmy-time-dashboard#1007