PR for #981: [P2] Research sovereignty metrics + dashboard tracking #1044

Closed
gemini wants to merge 0 commits from feature/issue-981 into main
Collaborator

Closes #981

Parent

  • #972 — [GOVERNING] Replacing Claude — Autonomous Research Pipeline Spec

Objective

Track research sovereignty progress against the graduation targets defined in the spec.

Metrics to Track

Metric Week 1 Month 1 Month 3 Graduation
Research queries answered locally 10% 40% 80% >90%
API cost per research task $1.50 $0.50 $0.10 <$0.01
Time from question to report 3 hours 30 min 5 min <1 min
Human involvement per task 100% Review only Approve only None
Research artifacts in local index 6 30+ 100+ Comprehensive

Scope

  • Emit metrics from ResearchOrchestrator: cache_hit_rate, api_cost, time_to_report, human_involvement
  • Feed into metrics emitter (#954) from Sovereignty Loop
  • Dashboard widget showing trend over time
  • Alert when API cost exceeds threshold

Graduation Test

Timmy receives a Gitea issue asking for state-of-the-art evaluation. Timmy searches local knowledge, identifies gap, formulates queries, searches web, fetches pages, synthesizes report, commits to repo, creates follow-up issues, indexes result — all without a single message to Claude or any human.

Effort Estimate

4 hours

  • #954 — Metrics emitter (Sovereignty Loop)
  • #962 — Three-strike anomaly detector
Closes #981 ## Parent - #972 — [GOVERNING] Replacing Claude — Autonomous Research Pipeline Spec ## Objective Track research sovereignty progress against the graduation targets defined in the spec. ## Metrics to Track | Metric | Week 1 | Month 1 | Month 3 | Graduation | |--------|--------|---------|---------|------------| | Research queries answered locally | 10% | 40% | 80% | >90% | | API cost per research task | $1.50 | $0.50 | $0.10 | <$0.01 | | Time from question to report | 3 hours | 30 min | 5 min | <1 min | | Human involvement per task | 100% | Review only | Approve only | None | | Research artifacts in local index | 6 | 30+ | 100+ | Comprehensive | ## Scope - Emit metrics from ResearchOrchestrator: cache_hit_rate, api_cost, time_to_report, human_involvement - Feed into metrics emitter (#954) from Sovereignty Loop - Dashboard widget showing trend over time - Alert when API cost exceeds threshold ## Graduation Test Timmy receives a Gitea issue asking for state-of-the-art evaluation. Timmy searches local knowledge, identifies gap, formulates queries, searches web, fetches pages, synthesizes report, commits to repo, creates follow-up issues, indexes result — **all without a single message to Claude or any human.** ## Effort Estimate 4 hours ## Related - #954 — Metrics emitter (Sovereignty Loop) - #962 — Three-strike anomaly detector
Timmy closed this pull request 2026-03-23 15:02:57 +00:00
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Tests / lint (pull_request) Successful in 19s
Tests / test (pull_request) Failing after 25m3s

Pull request closed

Sign in to join this conversation.
No Reviewers
No Label
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: Rockachopa/Timmy-time-dashboard#1044