ee1b5cd744
[loop-cycle-2] feat: add VIDEO enum and fallback chain ( #1494 , #1495 ) ( #1496 )
2026-04-15 15:44:11 +00:00
Alexander Whitestone
8298f6edc3
Merge feat/gemma4-multimodal-support: resolve MULTIMODAL_BACKLOG.md conflict
...
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Combined main's rich epic structure with PR #1493 's Task 3/4 items.
Code changes to multimodal.py merged cleanly.
2026-04-13 15:29:20 -04:00
Alexander Whitestone
1ed8e91a44
feat: add Gemma 4 to multimodal capabilities registry
...
Tests / lint (pull_request) Has been cancelled
Tests / test (pull_request) Has been cancelled
- Add gemma4 (4b, 12b, 27b) to KNOWN_MODEL_CAPABILITIES with VISION, TOOLS, TEXT, JSON, STREAMING
- Add gemma4:12b to VISION fallback chain (preferred over llama3.2:3b)
- Add Gemma 4 variants to vision model test assertions
- Create MULTIMODAL_BACKLOG.md for tracking multimodal improvements
2026-04-13 14:59:14 -04:00
d05ce8bf86
feat: add automated Nexus zone screenshot capture script ( #1492 )
...
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Co-authored-by: Alexander Whitestone <alexander@alexanderwhitestone.com >
Co-committed-by: Alexander Whitestone <alexander@alexanderwhitestone.com >
2026-04-13 03:05:53 +00:00
4d2aeb937f
[loop-cycle-7] refactor: split research.py into research/ subpackage ( #1405 ) ( #1458 )
Tests / lint (push) Successful in 11s
Tests / test (push) Has been cancelled
2026-03-24 19:53:04 +00:00
8a7a34499c
[loop-cycle-1] refactor: split cascade.py into focused modules ( #1342 ) ( #1448 )
Tests / lint (push) Failing after 8s
Tests / test (push) Has been skipped
2026-03-24 18:39:06 +00:00
e09082a8a8
[kimi] Add comprehensive unit tests for models/budget.py ( #1316 ) ( #1347 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 02:48:51 +00:00
d4e5a5d293
[claude] TES3MP server hardening — multi-player stability & anti-grief ( #860 ) ( #1321 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 02:13:57 +00:00
f0841bd34e
[claude] Automated Episode Compiler — Highlights to Published Video ( #880 ) ( #1318 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 02:05:14 +00:00
4f80cfcd58
[claude] Three-tier model router: Local 8B / Hermes 70B / Cloud API cascade ( #882 ) ( #1297 )
...
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Co-authored-by: Claude (Opus 4.6) <claude@hermes.local >
Co-committed-by: Claude (Opus 4.6) <claude@hermes.local >
2026-03-24 01:53:25 +00:00
fc0a94202f
[claude] Implement graceful degradation test scenarios ( #919 ) ( #1291 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:49:58 +00:00
823216db60
[claude] Add unit tests for events system backbone ( #917 ) ( #1289 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-24 01:48:16 +00:00
b5fb6a85cf
[claude] Fix pre-existing ruff lint errors blocking git hooks ( #1247 ) ( #1248 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 23:33:37 +00:00
1e1689f931
[claude] Qwen3 two-model routing via task complexity classifier ( #1065 ) v2 ( #1233 )
...
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Co-authored-by: Claude (Opus 4.6) <claude@hermes.local >
Co-committed-by: Claude (Opus 4.6) <claude@hermes.local >
2026-03-23 22:58:21 +00:00
25157a71a8
[loop-cycle] fix: remove unused imports and fix formatting (lint) ( #1209 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 22:30:03 +00:00
7d20d18af1
[claude] test: improve event bus unit test coverage to 99% ( #1191 ) ( #1201 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 21:59:59 +00:00
7afb72209a
[claude] Add unit tests for chat_store.py ( #1192 ) ( #1198 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 21:58:38 +00:00
8f8061e224
[claude] refactor: break up cascade.py complete() ( #1185 ) ( #1190 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 21:52:27 +00:00
0d73a4ff7a
[claude] Fix ruff S105/S106/B017/E402 errors in bannerlord ( #1161 ) ( #1165 )
...
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Co-authored-by: Claude (Opus 4.6) <claude@hermes.local >
Co-committed-by: Claude (Opus 4.6) <claude@hermes.local >
2026-03-23 19:56:07 +00:00
08d337e03d
[claude] Implement three-tier metabolic LLM router ( #966 ) ( #1160 )
...
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Co-authored-by: Claude (Opus 4.6) <claude@hermes.local >
Co-committed-by: Claude (Opus 4.6) <claude@hermes.local >
2026-03-23 19:45:56 +00:00
495c1ac2bd
[claude] Fix 27 ruff lint errors blocking all pushes ( #1149 ) ( #1153 )
...
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Co-authored-by: Claude (Opus 4.6) <claude@hermes.local >
Co-committed-by: Claude (Opus 4.6) <claude@hermes.local >
2026-03-23 19:06:11 +00:00
0e5948632d
[claude] Add unit tests for cascade.py ( #1138 ) ( #1150 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 18:47:28 +00:00
510d890eb2
[claude] Wire QuotaMonitor.select_model() into cascade router ( #1106 ) ( #1113 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 18:13:17 +00:00
f2a277f7b5
[claude] Add vllm-mlx as high-performance local inference backend ( #1069 ) ( #1089 )
...
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Co-authored-by: Claude (Opus 4.6) <claude@hermes.local >
Co-committed-by: Claude (Opus 4.6) <claude@hermes.local >
2026-03-23 15:34:13 +00:00
48f667c76b
[claude] Integrate Claude Quota Monitor + Metabolic Protocol into cascade router ( #1075 ) ( #1086 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 15:18:11 +00:00
af0963a8c7
[loop-cycle-1] refactor: break up run_agentic_loop ( #531 ) ( #1084 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 15:06:59 +00:00
7f875398fc
[claude] Add sovereignty metrics tracking + dashboard panel ( #981 ) ( #1083 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 14:09:03 +00:00
1697e55cdb
[claude] Add content moderation pipeline (Llama Guard + game-context prompts) ( #1056 ) ( #1059 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-23 02:14:42 +00:00
45bde4df58
[claude] Add agent performance regression benchmark suite ( #1015 ) ( #1053 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-22 23:55:27 +00:00
9656a5e0d0
[claude] Add connection leak and pragma unit tests for db_pool.py ( #944 ) ( #1001 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-22 22:56:58 +00:00
1fa5cff5dc
[kimi] Fix GITEA_API configuration in triage scripts ( #951 ) ( #994 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-22 22:28:23 +00:00
bebbe442b4
feat: WorldInterface + Heartbeat v2 ( #871 , #872 ) ( #900 )
...
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Co-authored-by: Perplexity Computer <perplexity@tower.local >
Co-committed-by: Perplexity Computer <perplexity@tower.local >
2026-03-22 13:44:49 +00:00
2ab6539564
[kimi] Add ConnectionPool class with unit tests ( #769 ) ( #830 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-21 22:02:08 +00:00
cc30bdb391
[kimi] test: add comprehensive tests for multimodal.py ( #658 ) ( #694 )
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
2026-03-21 04:00:53 +00:00
2381d0b6d0
refactor: break up _create_bug_report — extract helpers ( #645 )
...
Tests / lint (push) Has been cancelled
Tests / test (push) Has been cancelled
Tests / lint (pull_request) Has been cancelled
Tests / test (pull_request) Has been cancelled
Co-authored-by: Kimi Agent <kimi@timmy.local >
Co-committed-by: Kimi Agent <kimi@timmy.local >
2026-03-20 22:03:40 +00:00
f3b3d1e648
[loop-cycle-1658] feat: provider health history endpoint ( #457 ) ( #611 )
Tests / lint (push) Successful in 3s
Tests / test (push) Has been cancelled
2026-03-20 16:09:20 -04:00
b331aa6139
refactor: break up capture_error() into testable helpers ( #523 )
...
Tests / lint (push) Successful in 2s
Tests / test (push) Successful in 1m25s
Co-authored-by: Kimi Agent <kimi@timmy.local >
Co-committed-by: Kimi Agent <kimi@timmy.local >
2026-03-19 20:03:28 -04:00
15eb7c3b45
[loop-cycle-538] refactor: remove dead airllm provider from cascade router ( #459 ) ( #481 )
Tests / lint (push) Successful in 3s
Tests / test (push) Successful in 1m28s
2026-03-19 15:44:10 -04:00
3df526f6ef
[loop-cycle-2] feat: hot-reload providers.yaml without restart ( #458 ) ( #470 )
Tests / lint (push) Failing after 3s
Tests / test (push) Has been skipped
2026-03-19 15:11:40 -04:00
bcd6d7e321
[loop-cycle-50] refactor: replace bare sqlite3.connect() with context managers batch 2 ( #157 ) ( #180 )
Tests / lint (push) Successful in 4s
Tests / test (push) Successful in 1m55s
2026-03-15 11:58:43 -04:00
d2c51763d0
[loop-cycle-43] refactor: remove 1035 lines of dead code ( #136 ) ( #146 )
Tests / lint (push) Successful in 4s
Tests / test (push) Successful in 1m4s
2026-03-15 10:10:12 -04:00
9535dd86de
test: push event system coverage to ≥80% on all three modules
...
Tests / lint (pull_request) Successful in 4s
Tests / test (pull_request) Failing after 16s
Add 3 targeted tests for infrastructure/error_capture.py:
- test_stale_entries_pruned: exercises dedup cache pruning (line 61)
- test_git_context_fallback_on_failure: exercises exception path (lines 90-91)
- test_returns_none_when_feedback_disabled: exercises early return (line 112)
Coverage results (63 tests, all passing):
- error_capture.py: 75.6% → 80.0%
- broadcaster.py: 93.9% (unchanged)
- bus.py: 92.9% (unchanged)
- Total: 88.1% → 89.4%
Closes #45
2026-03-14 16:01:05 -04:00
2f623826bd
cleanup: delete dead modules — ~7,900 lines removed
...
Tests / lint (pull_request) Successful in 5s
Tests / test (pull_request) Successful in 1m30s
Closes #22 , Closes #23
Deleted: brain/, swarm/, openfang/, paperclip/, cascade_adapter,
memory_migrate, agents/timmy.py, dead routes + all corresponding tests.
Updated pyproject.toml, app.py, loop_qa.py for removed imports.
2026-03-14 09:49:24 -04:00
Trip T
ea2dbdb4b5
fix: test DB isolation, Discord recovery, and over-mocked tests
...
Tests / lint (pull_request) Successful in 8s
Tests / test (pull_request) Successful in 33s
Test data was bleeding into production tasks.db because
swarm.task_queue.models.DB_PATH (relative path) was never patched in
conftest.clean_database. Fixed by switching to absolute paths via
settings.repo_root and adding the missing module to the patching list.
Discord bot could leak orphaned clients on retry after ERROR state.
Added _cleanup_stale() to close stale client/task before each start()
attempt, with improved logging in the token watcher.
Rewrote test_paperclip_client.py to use httpx.MockTransport instead of
patching _get/_post/_delete — tests now exercise real HTTP status codes,
error handling, and JSON parsing. Added end-to-end test for
capture_error → create_task DB isolation.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com >
2026-03-11 20:33:59 -04:00
Alexander Whitestone
68115fe477
fix: update agno to v2 and fix airllm availability tests ( #170 )
...
The agno dependency was pinned to <2.0 but the code uses agno.db.sqlite
(a 2.x API), breaking all tests in CI. Also fix airllm provider tests
to patch importlib.util.find_spec (what the production code uses) instead
of builtins.__import__.
Co-authored-by: Trip T <trip@local>
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com >
2026-03-11 12:40:45 -04:00
Alexander Whitestone
9d78eb31d1
ruff ( #169 )
...
* polish: streamline nav, extract inline styles, improve tablet UX
- Restructure desktop nav from 8+ flat links + overflow dropdown into
5 grouped dropdowns (Core, Agents, Intel, System, More) matching
the mobile menu structure to reduce decision fatigue
- Extract all inline styles from mission_control.html and base.html
notification elements into mission-control.css with semantic classes
- Replace JS-built innerHTML with secure DOM construction in
notification loader and chat history
- Add CONNECTING state to connection indicator (amber) instead of
showing OFFLINE before WebSocket connects
- Add tablet breakpoint (1024px) with larger touch targets for
Apple Pencil / stylus use and safe-area padding for iPad toolbar
- Add active-link highlighting in desktop dropdown menus
- Rename "Mission Control" page title to "System Overview" to
disambiguate from the chat home page
- Add "Home — Timmy Time" page title to index.html
https://claude.ai/code/session_015uPUoKyYa8M2UAcyk5Gt6h
* fix(security): move auth-gate credentials to environment variables
Hardcoded username, password, and HMAC secret in auth-gate.py replaced
with os.environ lookups. Startup now refuses to run if any variable is
unset. Added AUTH_GATE_SECRET/USER/PASS to .env.example.
https://claude.ai/code/session_015uPUoKyYa8M2UAcyk5Gt6h
* refactor(tooling): migrate from black+isort+bandit to ruff
Replace three separate linting/formatting tools with a single ruff
invocation. Updates tox.ini (lint, format, pre-push, pre-commit envs),
.pre-commit-config.yaml, and CI workflow. Fixes all ruff errors
including unused imports, missing raise-from, and undefined names.
Ruff config maps existing bandit skips to equivalent S-rules.
https://claude.ai/code/session_015uPUoKyYa8M2UAcyk5Gt6h
---------
Co-authored-by: Claude <noreply@anthropic.com >
2026-03-11 12:23:35 -04:00
Alexander Whitestone
82fb2417e3
feat: enable SQLite WAL mode for all databases (AGI ticket #1 ) ( #153 )
2026-03-08 16:07:02 -04:00
Alexander Whitestone
ae3bb1cc21
feat: code quality audit + autoresearch integration + infra hardening ( #150 )
2026-03-08 12:50:44 -04:00
Alexander Whitestone
3f06e7231d
Improve test coverage from 63.6% to 73.4% and fix test infrastructure ( #137 )
2026-03-06 13:21:05 -05:00
Alexander Whitestone
584eeb679e
Operation Darling Purge: slim to wealth core (-33,783 lines) ( #121 )
2026-03-02 13:17:38 -05:00