Commit Graph

778 Commits

Author SHA1 Message Date
Alexander Whitestone
48cf182c6e test: add unit tests for scorecard_service.py edge cases
Some checks failed
Tests / lint (pull_request) Failing after 35s
Tests / test (pull_request) Has been skipped
Adds tests/dashboard/test_scorecard_service.py with 31 unit tests
covering gaps not addressed in test_scorecards.py:
- _aggregate_metrics: test.execution events, PR-closed-without-merge,
  push with no num_commits key, issue/PR events with number=0,
  untracked agent with agent_id field passthrough
- _detect_patterns: boundary conditions (< 3 PRs threshold, exactly 80%
  merge rate, commit count boundary at 10 vs 11, token net boundaries)
- _generate_narrative_bullets: singular/plural forms for all activity types
- generate_scorecard: token augmentation max() logic (event vs ledger)
- ScorecardSummary.to_dict: ISO timestamp format, tests_affected count

Fixes #1139

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-23 22:08:04 -04:00
8304cf50da [claude] Add unit tests for backlog_triage.py (#1293) (#1307)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:57:44 +00:00
16c4cc0f9f [claude] Add unit tests for research_tools.py (#1294) (#1308)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:57:39 +00:00
a48f30fee4 [claude] Add unit tests for quest_system.py (#1292) (#1309)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:57:29 +00:00
e44db42c1a [claude] Split thinking.py into focused sub-modules (#1279) (#1306)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:57:04 +00:00
de7744916c [claude] DeerFlow evaluation research note (#1283) (#1305)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:56:37 +00:00
bde7232ece [claude] Add unit tests for kimi_delegation.py (#1295) (#1303)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:54:44 +00:00
fc4426954e [claude] Add module docstrings to 9 undocumented files (#1296) (#1302)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Co-authored-by: Claude (Opus 4.6) <claude@hermes.local>
Co-committed-by: Claude (Opus 4.6) <claude@hermes.local>
2026-03-24 01:54:18 +00:00
5be4ecb9ef [kimi] Add unit tests for sovereignty/perception_cache.py (#1261) (#1301)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Co-authored-by: Kimi Agent <kimi@timmy.local>
Co-committed-by: Kimi Agent <kimi@timmy.local>
2026-03-24 01:53:44 +00:00
4f80cfcd58 [claude] Three-tier model router: Local 8B / Hermes 70B / Cloud API cascade (#882) (#1297)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Co-authored-by: Claude (Opus 4.6) <claude@hermes.local>
Co-committed-by: Claude (Opus 4.6) <claude@hermes.local>
2026-03-24 01:53:25 +00:00
a7ccfbddc9 [claude] feat: SearXNG + Crawl4AI self-hosted search backend (#1282) (#1299)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:52:51 +00:00
f1f67e62a7 [claude] Document and validate AirLLM Apple Silicon requirements (#1284) (#1298)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:52:17 +00:00
00ef4fbd22 [claude] Document and validate AirLLM Apple Silicon requirements (#1284) (#1298)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:52:16 +00:00
fc0a94202f [claude] Implement graceful degradation test scenarios (#919) (#1291)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:49:58 +00:00
bd3e207c0d [loop-cycle-1] docs: add docstrings to VoiceTTS public methods (#774) (#1290)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:48:46 +00:00
cc8ed5b57d [claude] Fix empty commits: require git add before commit in Kimi workflow (#1268) (#1288)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:48:34 +00:00
823216db60 [claude] Add unit tests for events system backbone (#917) (#1289)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:48:16 +00:00
75ecfaba64 [claude] Wire delegate_task to DistributedWorker for actual execution (#985) (#1273)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Co-authored-by: Claude (Opus 4.6) <claude@hermes.local>
Co-committed-by: Claude (Opus 4.6) <claude@hermes.local>
2026-03-24 01:47:09 +00:00
55beaf241f [claude] Research summary: Kimi creative blueprint (#891) (#1286)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:46:28 +00:00
69498c9add [claude] Screenshot dump triage — 5 issues created (#1275) (#1287)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:46:22 +00:00
6c76bf2f66 [claude] Integrate health snapshot into Daily Run pre-flight (#923) (#1280)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:43:49 +00:00
0436dfd4c4 [claude] Dashboard: Agent Scorecards panel in Mission Control (#929) (#1276)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:43:21 +00:00
9eeb49a6f1 [claude] Autonomous research pipeline — orchestrator + SOVEREIGNTY.md (#972) (#1274)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:40:53 +00:00
2d6bfe6ba1 [claude] Agent Self-Correction Dashboard (#1007) (#1269)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Co-authored-by: Claude (Opus 4.6) <claude@hermes.local>
Co-committed-by: Claude (Opus 4.6) <claude@hermes.local>
2026-03-24 01:40:40 +00:00
ebb2cad552 [claude] feat: Session Sovereignty Report Generator (#957) v3 (#1263)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Co-authored-by: Claude (Opus 4.6) <claude@hermes.local>
Co-committed-by: Claude (Opus 4.6) <claude@hermes.local>
2026-03-24 01:40:24 +00:00
003e3883fb [claude] Restore self-modification loop (#983) (#1270)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Co-authored-by: Claude (Opus 4.6) <claude@hermes.local>
Co-committed-by: Claude (Opus 4.6) <claude@hermes.local>
2026-03-24 01:40:16 +00:00
7dfbf05867 [claude] Run 5-test benchmark suite against local model candidates (#1066) (#1271)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:38:59 +00:00
1cce28d1bb [claude] Investigate: document paths to resolution for 5 closed PRs (#1219) (#1266)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Tests / lint (pull_request) Failing after 29s
Tests / test (pull_request) Has been skipped
Co-authored-by: Claude (Opus 4.6) <claude@hermes.local>
Co-committed-by: Claude (Opus 4.6) <claude@hermes.local>
2026-03-24 01:36:06 +00:00
4c6b69885d [claude] feat: Agent Energy Budget Monitoring (#1009) (#1267)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:35:50 +00:00
6b2e6d9e8c [claude] feat: Agent Energy Budget Monitoring (#1009) (#1267)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:35:49 +00:00
2b238d1d23 [loop-cycle-1] fix: ruff format error on test_autoresearch.py (#1256) (#1257)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:27:38 +00:00
b7ad5bf1d9 fix: remove unused variable in test_loop_guard_seed (ruff F841) (#1255)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Tests / lint (pull_request) Failing after 33s
Tests / test (pull_request) Has been skipped
2026-03-24 01:20:42 +00:00
2240ddb632 [loop-cycle] fix: three-strike route test isolation for xdist (#1254)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 23:49:00 +00:00
35d2547a0b [claude] Fix cycle-metrics pipeline: seed issue= from queue so retro is never null (#1250) (#1253)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 23:42:23 +00:00
f62220eb61 [claude] Autoresearch H1: Apple Silicon support + M3 Max baseline doc (#905) (#1252)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 23:38:38 +00:00
72992b7cc5 [claude] Fix ImportError: memory_write missing from memory_system (#1249) (#1251)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 23:37:21 +00:00
b5fb6a85cf [claude] Fix pre-existing ruff lint errors blocking git hooks (#1247) (#1248)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 23:33:37 +00:00
fedd164686 [claude] Fix 10 vassal tests flaky under xdist parallel execution (#1243) (#1245)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 23:29:25 +00:00
261b7be468 [kimi] Refactor autoresearch.py -> SystemExperiment class (#906) (#1244)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Co-authored-by: Kimi Agent <kimi@timmy.local>
Co-committed-by: Kimi Agent <kimi@timmy.local>
2026-03-23 23:28:54 +00:00
6691f4d1f3 [claude] Add timmy learn autoresearch entry point (#907) (#1240)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Tests / lint (pull_request) Failing after 16s
Tests / test (pull_request) Has been skipped
Co-authored-by: Claude (Opus 4.6) <claude@hermes.local>
Co-committed-by: Claude (Opus 4.6) <claude@hermes.local>
2026-03-23 23:14:09 +00:00
ea76af068a [kimi] Add unit tests for paperclip.py (#1236) (#1241)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 23:13:54 +00:00
b61fcd3495 [claude] Add unit tests for research_tools.py (#1237) (#1239)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 23:06:06 +00:00
1e1689f931 [claude] Qwen3 two-model routing via task complexity classifier (#1065) v2 (#1233)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Co-authored-by: Claude (Opus 4.6) <claude@hermes.local>
Co-committed-by: Claude (Opus 4.6) <claude@hermes.local>
2026-03-23 22:58:21 +00:00
acc0df00cf [claude] Three-Strike Detector (#962) v2 (#1232)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Co-authored-by: Claude (Opus 4.6) <claude@hermes.local>
Co-committed-by: Claude (Opus 4.6) <claude@hermes.local>
2026-03-23 22:50:59 +00:00
a0c35202f3 [claude] ADR-024: canonical Nostr identity in timmy-nostr (#1223) (#1230)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 22:47:25 +00:00
fe1d576c3c [claude] Gitea activity & branch audit across all repos (#1210) (#1228)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 22:46:16 +00:00
3e65271af6 [claude] Rescue unmerged work: open PRs for 3 abandoned branches (#1218) (#1229)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 22:46:10 +00:00
697575e561 [gemini] Implement semantic index for research outputs (#976) (#1227)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 22:45:29 +00:00
e6391c599d [claude] Enforce one-agent-per-issue via labels, document auto-delete branches (#1220) (#1222)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 22:44:50 +00:00
d697c3d93e [claude] refactor: break up monolithic tools.py into a tools/ package (#1215) (#1221)
Some checks failed
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 22:43:09 +00:00