Timmy-time-dashboard

Author	SHA1	Message	Date
Kimi Agent	9e9dd5309a	[kimi] Fix: stub cv2 in tests to prevent timeout (#1336 ) (#1356 ) Some checks failed Tests / lint (push) Has been cancelled Details Tests / test (push) Has been cancelled Details Co-authored-by: Kimi Agent <kimi@timmy.local> Co-committed-by: Kimi Agent <kimi@timmy.local>	2026-03-24 02:59:52 +00:00
Claude (Opus 4.6)	7f875398fc	[claude] Add sovereignty metrics tracking + dashboard panel (#981 ) (#1083 ) Some checks failed Tests / lint (push) Has been cancelled Details Tests / test (push) Has been cancelled Details	2026-03-23 14:09:03 +00:00
Kimi Agent	c9445e3056	[kimi] refactor: extract helpers from CSRFMiddleware.dispatch (#628 ) (#691 ) Some checks failed Tests / lint (push) Has been cancelled Details Tests / test (push) Has been cancelled Details	2026-03-21 03:41:09 +00:00
Timmy Time	7da434c85b	[loop-cycle-946] refactor: complete airllm removal (#486 ) (#545 ) Some checks failed Tests / lint (push) Successful in 3s Details Tests / test (push) Has been cancelled Details	2026-03-19 20:46:20 -04:00
hermes	b4cb3e9975	[loop-cycle-54] refactor: consolidate three memory stores into single table (#37 ) (#223 ) Some checks failed Tests / lint (push) Failing after 2s Details Tests / test (push) Has been skipped Details	2026-03-15 13:33:24 -04:00
Kimi Agent	79edfd1106	feat: persist chat history in SQLite — survives server restarts Some checks failed Tests / lint (pull_request) Successful in 2s Details Tests / test (pull_request) Failing after 13s Details Replace in-memory MessageLog with SQLite-backed implementation. Same API surface (append/all/clear/len) so zero caller changes needed. - data/chat.db stores messages with role, content, timestamp, source - Lazy DB connection (opened on first use, not at import time) - Retention policy: oldest messages pruned when count > 500 - New .recent(limit) method for efficient last-N queries - Thread-safe with explicit locking - WAL mode for concurrent read performance - Test isolation: conftest redirects DB to tmp_path per test - 8 new tests: persistence, retention, concurrency, source field Closes #46	2026-03-14 16:09:26 -04:00
Trip T	bd1aa55904	fix: use StdioServerParameters to bypass Agno executable whitelist All checks were successful Tests / lint (pull_request) Successful in 23s Details Tests / test (pull_request) Successful in 32s Details Agno's MCPTools has an undocumented executable whitelist that blocks gitea-mcp (Go binary). Switch to server_params=StdioServerParameters() which bypasses this restriction. Also fixes: - Use tools.session.call_tool() for standalone invocation (MCPTools doesn't expose call_tool() directly) - Use close() instead of disconnect() for cleanup - Resolve gitea-mcp path via ~/go/bin fallback when not on PATH - Stub mcp.client.stdio in test conftest Smoke-tested end-to-end against real Gitea: connect, list_issues, create issue, close issue, create_gitea_issue_via_mcp — all pass. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 22:03:45 -04:00
Trip T	78167675f2	feat: replace custom Gitea client with MCP servers All checks were successful Tests / lint (pull_request) Successful in 3s Details Tests / test (pull_request) Successful in 29s Details Replace the bespoke GiteaHand httpx client and tools_gitea.py wrappers with official MCP tool servers (gitea-mcp + filesystem MCP), wired into Agno via MCPTools. Switch all session functions to async (arun/acontinue_run) so MCP tools auto-connect. Delete ~1070 lines of custom Gitea code. - Create src/timmy/mcp_tools.py with MCP factories + standalone issue bridge - Wire MCPTools into agent.py tool list (Gitea + filesystem) - Switch session.py chat/chat_with_tools/continue_chat to async - Update all callers (dashboard routes, Discord vendor, CLI, thinking engine) - Add gitea_token fallback from ~/.config/gitea/token - Add MCP session cleanup to app shutdown hook - Update tool_safety.py for MCP tool names - 11 new tests, all 1417 passing, coverage 74.2% Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 21:40:32 -04:00
Trip T	b2f12ca97c	feat: consolidate memory into unified memory.db with 4-type model All checks were successful Tests / lint (pull_request) Successful in 3s Details Tests / test (pull_request) Successful in 30s Details Consolidates 3 separate memory databases (semantic_memory.db, swarm.db memory_entries, brain.db) into a single data/memory.db with facts, chunks, and episodes tables. Key changes: - Add unified schema (timmy/memory/unified.py) with 3 core tables - Redirect vector_store.py and semantic_memory.py to memory.db - Add thought distillation: every Nth thought extracts lasting facts - Enrich agent context with known facts in system prompt - Add memory_forget tool for removing outdated memories - Unify embeddings: vector_store delegates to semantic_memory.embed_text - Bridge spark events to unified event log - Add pruning for thoughts and events with configurable retention - Add data migration script (timmy/memory_migrate.py) - Deprecate brain.memory in favor of unified system Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 11:23:18 -04:00
Trip T	ea2dbdb4b5	fix: test DB isolation, Discord recovery, and over-mocked tests All checks were successful Tests / lint (pull_request) Successful in 8s Details Tests / test (pull_request) Successful in 33s Details Test data was bleeding into production tasks.db because swarm.task_queue.models.DB_PATH (relative path) was never patched in conftest.clean_database. Fixed by switching to absolute paths via settings.repo_root and adding the missing module to the patching list. Discord bot could leak orphaned clients on retry after ERROR state. Added _cleanup_stale() to close stale client/task before each start() attempt, with improved logging in the token watcher. Rewrote test_paperclip_client.py to use httpx.MockTransport instead of patching _get/_post/_delete — tests now exercise real HTTP status codes, error handling, and JSON parsing. Added end-to-end test for capture_error → create_task DB isolation. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-11 20:33:59 -04:00
Alexander Whitestone	9d78eb31d1	ruff (#169 ) * polish: streamline nav, extract inline styles, improve tablet UX - Restructure desktop nav from 8+ flat links + overflow dropdown into 5 grouped dropdowns (Core, Agents, Intel, System, More) matching the mobile menu structure to reduce decision fatigue - Extract all inline styles from mission_control.html and base.html notification elements into mission-control.css with semantic classes - Replace JS-built innerHTML with secure DOM construction in notification loader and chat history - Add CONNECTING state to connection indicator (amber) instead of showing OFFLINE before WebSocket connects - Add tablet breakpoint (1024px) with larger touch targets for Apple Pencil / stylus use and safe-area padding for iPad toolbar - Add active-link highlighting in desktop dropdown menus - Rename "Mission Control" page title to "System Overview" to disambiguate from the chat home page - Add "Home — Timmy Time" page title to index.html https://claude.ai/code/session_015uPUoKyYa8M2UAcyk5Gt6h * fix(security): move auth-gate credentials to environment variables Hardcoded username, password, and HMAC secret in auth-gate.py replaced with os.environ lookups. Startup now refuses to run if any variable is unset. Added AUTH_GATE_SECRET/USER/PASS to .env.example. https://claude.ai/code/session_015uPUoKyYa8M2UAcyk5Gt6h * refactor(tooling): migrate from black+isort+bandit to ruff Replace three separate linting/formatting tools with a single ruff invocation. Updates tox.ini (lint, format, pre-push, pre-commit envs), .pre-commit-config.yaml, and CI workflow. Fixes all ruff errors including unused imports, missing raise-from, and undefined names. Ruff config maps existing bandit skips to equivalent S-rules. https://claude.ai/code/session_015uPUoKyYa8M2UAcyk5Gt6h --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-03-11 12:23:35 -04:00
Alexander Whitestone	904a7c564e	feat: migrate to Agno native HITL tool confirmation flow (#158 ) Replace the homebrew regex-based tool extraction and manual dispatch (tool_executor.py) with Agno's built-in Human-In-The-Loop confirmation: - Toolkit(requires_confirmation_tools=...) marks dangerous tools - agent.run() returns RunOutput with status=paused when confirmation needed - RunRequirement.confirm()/reject() + agent.continue_run() resumes execution Dashboard and Discord vendor both use the native flow. DuckDuckGo import isolated so its absence doesn't kill all tools. Test stubs cleaned up (agno is a real dependency, only truly optional packages stubbed). 1384 tests pass in parallel (~14s). Co-authored-by: Trip T <trip@local> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-09 21:54:04 -04:00
Alexander Whitestone	82fb2417e3	feat: enable SQLite WAL mode for all databases (AGI ticket #1 ) (#153 )	2026-03-08 16:07:02 -04:00
Alexander Whitestone	ae3bb1cc21	feat: code quality audit + autoresearch integration + infra hardening (#150 )	2026-03-08 12:50:44 -04:00
Alexander Whitestone	e36a1dc939	fix: resolve 6 dashboard bugs and rebuild Task Queue + Work Orders (#144 ) (#144 ) Round 2+3 bug fix batch: 1. Ollama timeout: Add request_timeout=300 to prevent socket read errors on complex 30-60s prompts (production crash fix) 2. Memory API: Create missing HTMX partial templates (memory_facts.html, memory_results.html) so Save/Search buttons work 3. CALM page: Add create_tables() call so SQLAlchemy tables exist on first request (was returning HTTP 500) 4. Task Queue: Full SQLite-backed rebuild with CRUD endpoints, HTMX partials, and action buttons (approve/veto/pause/cancel/retry) 5. Work Orders: Full SQLite-backed rebuild with submit/approve/reject/ execute pipeline and HTMX polling partials 6. Memory READ tool: Add memory_read function so Timmy stops calling read_file when trying to recall stored facts Also: Close GitHub issues #115, #114, #112, #110 as won't-fix. Comment on #107 confirming prune_memories() already wired to startup. Tests: 33 new tests across 4 test files, all passing. Full suite: 1155 passed, 2 pre-existing failures (hands_shell). Co-authored-by: Trip T <trip@local> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-07 23:21:30 -05:00
Alexander Whitestone	fb97625404	Consolidate architecture: flatten agents, kill Redis/Celery, thin routes (#133 )	2026-03-05 20:27:02 -05:00
Alexander Whitestone	aff3edb06a	Audit cleanup: security fixes, code reduction, test hygiene (#131 )	2026-03-05 18:56:52 -05:00
Alexander Whitestone	e8f1dea3ec	Remove unused deps from poetry build, speed test suite to ~16s (#130 )	2026-03-05 18:07:59 -05:00
Alexander Whitestone	f2dacf4ee0	Integrate Celery task queue for background task processing (#129 )	2026-03-05 12:09:51 -05:00
AlexanderWhitestone	5e8766cef0	Fix build issues, implement missing routes, and stabilize e2e tests for production readiness	2026-03-04 17:15:46 -05:00
Alexander Whitestone	584eeb679e	Operation Darling Purge: slim to wealth core (-33,783 lines) (#121 )	2026-03-02 13:17:38 -05:00
Alexander Whitestone	62ef1120a4	Memory Unification + Canonical Identity: -11,074 lines of homebrew (#119 )	2026-03-02 09:58:07 -05:00
Alexander Whitestone	b7c89d1101	feat: dockerize OpenFang as vendored tool runtime sidecar (#96 )	2026-02-28 19:27:48 -05:00
Alexander Whitestone	d7d7a5a80a	audit: clean Docker architecture, consolidate test fixtures, add containerized test runner (#94 )	2026-02-28 16:11:58 -05:00
Alexander Whitestone	e5190b248a	CI/CD Optimization: Guard Rails, Pre-commit Checks, and Test Fixes (#90 ) * CI/CD Optimization: Guard Rails, Black Linting, and Pre-commit Hooks - Fixed all test collection errors (Selenium imports, fixture paths, syntax) - Implemented pre-commit hooks with Black formatting and isort - Created comprehensive Makefile with test targets (unit, integration, functional, e2e) - Added pytest.ini with marker definitions for test categorization - Established guard rails to prevent future collection errors - Wrapped optional dependencies (Selenium, MoviePy) in try-except blocks - Added conftest_markers for automatic test categorization This ensures a smooth development stream with: - Fast feedback loops (pre-commit checks before push) - Consistent code formatting (Black) - Reliable CI/CD (no collection errors, proper test isolation) - Clear test organization (unit, integration, functional, E2E) * Fix CI/CD test failures: - Export templates from dashboard.app - Fix model name assertion in test_agent.py - Fix platform-agnostic path resolution in test_path_resolution.py - Skip Docker tests in test_docker_deployment.py if docker not available - Fix test_model_fallback_chain logic in test_ollama_integration.py * Add preventative pre-commit checks and Docker test skipif decorators: - Create pre_commit_checks.py script for common CI failures - Add skipif decorators to Docker tests - Improve test robustness for CI environments	2026-02-28 11:36:50 -05:00
Alexander Whitestone	a5fd680428	feat: microservices refactoring with TDD and Docker optimization (#88 ) ## Summary Complete refactoring of Timmy Time from monolithic architecture to microservices using Test-Driven Development (TDD) and optimized Docker builds. ## Changes ### Core Improvements - Optimized dashboard startup: moved blocking tasks to async background processes - Fixed model fallback logic in agent configuration - Enhanced test fixtures with comprehensive conftest.py ### Microservices Architecture - Created separate Dockerfiles for dashboard, Ollama, and agent services - Implemented docker-compose.microservices.yml for service orchestration - Added health checks and non-root user execution for security - Multi-stage Docker builds for lean, fast images ### Testing - Added E2E tests for dashboard responsiveness - Added E2E tests for Ollama integration - Added E2E tests for microservices architecture validation - All 36 tests passing, 8 skipped (environment-specific) ### Documentation - Created comprehensive final report - Generated issue resolution plan - Added interview transcript demonstrating core agent functionality ### New Modules - skill_absorption.py: Dynamic skill loading and integration system for Timmy ## Test Results ✅ 36 passed, 8 skipped, 6 warnings ✅ All microservices tests passing ✅ Dashboard responsiveness verified ✅ Ollama integration validated ## Files Added/Modified - docker/: Multi-stage Dockerfiles for all services - tests/e2e/: Comprehensive E2E test suite - src/timmy/skill_absorption.py: Skill absorption system - src/dashboard/app.py: Optimized startup logic - tests/conftest.py: Enhanced test fixtures - docker-compose.microservices.yml: Service orchestration ## Breaking Changes None - all changes are backward compatible ## Next Steps - Integrate skill absorption system into agent workflow - Test with microservices-tdd-refactor skill - Deploy to production with docker-compose orchestration	2026-02-28 11:07:19 -05:00
Claude	eb501c43da	fix: resolve 8 test failures from missing requests stub and wrong python path - Add `requests` to conftest.py module stubs so patch("requests.post") works in reward scoring tests without the package installed - Use sys.executable instead of bare "python" in git safety tests so the subprocess finds pytest from the venv rather than system python https://claude.ai/code/session_012Ye9nyFEiw2QQfx4bZeDmn	2026-02-27 02:06:45 +00:00
Claude	15596ca325	feat: add Discord integration with chat_bridge abstraction layer Introduces a vendor-agnostic chat platform architecture: - chat_bridge/base.py: ChatPlatform ABC, ChatMessage, ChatThread - chat_bridge/registry.py: PlatformRegistry singleton - chat_bridge/invite_parser.py: QR + Ollama vision invite extraction - chat_bridge/vendors/discord.py: DiscordVendor with native threads Workflow: paste a screenshot of a Discord invite or QR code at POST /discord/join → Timmy extracts the invite automatically. Every Discord conversation gets its own thread, keeping channels clean. Bot responds to @mentions and DMs, routes through Timmy agent. 43 new tests (base classes, registry, invite parser, vendor, routes). https://claude.ai/code/session_01WU4h3cQQiouMwmgYmAgkMM	2026-02-25 01:11:14 +00:00
Alexander Payne	c5df954d44	feat: Lightning interface, swarm routing, sovereignty audit, embodiment prep Lightning Backend Interface: - Abstract LightningBackend with pluggable implementations - MockBackend for development (auto-settle invoices) - LndBackend stub with gRPC integration path documented - Backend factory for runtime selection via LIGHTNING_BACKEND env Intelligent Swarm Routing: - CapabilityManifest for agent skill declarations - Task scoring based on keywords + capabilities + bid price - RoutingDecision audit logging to SQLite - Agent stats tracking (wins, consideration rate) Sovereignty Audit: - Comprehensive audit report (docs/SOVEREIGNTY_AUDIT.md) - 9.2/10 sovereignty score - Documented all external dependencies and local alternatives Substrate-Agnostic Agent Interface: - TimAgent abstract base class - Perception/Action/Memory/Communication types - OllamaAdapter implementation - Foundation for future embodiment (robot, VR) Tests: - 36 new tests for Lightning and routing - 472 total tests passing - Maintained 0 warning policy	2026-02-22 20:20:11 -05:00
Alexander Payne	f0aa43533f	feat: swarm E2E, MCP tools, timmy-serve L402, tests, notifications Major Features: - Auto-spawn persona agents (Echo, Forge, Seer) on app startup - WebSocket broadcasts for real-time swarm UI updates - MCP tool integration: web search, file I/O, shell, Python execution - New /tools dashboard page showing agent capabilities - Real timmy-serve start with L402 payment gating middleware - Browser push notifications for briefings and task events Tests: - test_docker_agent.py: 9 tests for Docker agent runner - test_swarm_integration_full.py: 18 E2E lifecycle tests - Fixed all pytest warnings (436 tests, 0 warnings) Improvements: - Fixed coroutine warnings in coordinator broadcasts - Fixed ResourceWarning for unclosed process pipes - Added pytest-asyncio config to pyproject.toml - Test isolation with proper event loop cleanup	2026-02-22 19:01:04 -05:00
Alexander Payne	4020b5222f	feat: add Docker-based swarm agent containerization Add infrastructure for running swarm agents as isolated Docker containers with HTTP-based coordination, startup recovery, and enhanced dashboard UI for agent management. - Dockerfile and docker-compose.yml for multi-service orchestration - DockerAgentRunner for programmatic container lifecycle management - Internal HTTP API for container agents to poll tasks and submit bids - Startup recovery system to reconcile orphaned tasks and stale agents - Enhanced UI partials for agent panels, chat, and task assignment - Timmy docker entry point with heartbeat and task polling - New Makefile targets for Docker workflows - Tests for swarm recovery Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 16:21:32 -05:00
Claude	bb93697b92	feat: add Telegram bot integration Bridges Telegram messages to Timmy via python-telegram-bot (optional dependency). The bot token can be supplied through the TELEGRAM_TOKEN env var or at runtime via the new POST /telegram/setup dashboard endpoint, which (re)starts the bot without a restart. Changes: - src/telegram_bot/bot.py — TelegramBot singleton: token persistence (telegram_state.json), lifecycle (start/stop), /start command and message handler that forwards to Timmy - src/dashboard/routes/telegram.py — /telegram/setup and /telegram/status FastAPI routes - src/dashboard/app.py — register telegram router; auto-start/stop bot in lifespan hook - src/config.py — TELEGRAM_TOKEN setting (pydantic-settings) - pyproject.toml — [telegram] optional extra (python-telegram-bot>=21), telegram_bot wheel include - .env.example — TELEGRAM_TOKEN section - .gitignore — exclude telegram_state.json (contains token) - tests/conftest.py — stub telegram/telegram.ext for offline test runs - tests/test_telegram_bot.py — 16 tests covering token helpers, lifecycle, and all dashboard routes (370 total, all passing) https://claude.ai/code/session_01CNBm3ZLobtx3Z1YogHq8ZS	2026-02-22 17:16:12 +00:00
Claude	19af4ae540	feat: integrate AirLLM as optional high-performance backend Adds the `bigbrain` optional dependency group (airllm>=2.9.0) and a complete second inference path that runs 8B / 70B / 405B Llama models locally via layer-by-layer loading — no GPU required, no cloud, fully sovereign. Key changes: - src/timmy/backends.py — TimmyAirLLMAgent (same print_response interface as Agno Agent); auto-selects AirLLMMLX on Apple Silicon, AutoModel (PyTorch) everywhere else - src/timmy/agent.py — _resolve_backend() routing with explicit override, env-config, and 'auto' Apple-Silicon detection - src/timmy/cli.py — --backend / --model-size flags on all commands - src/config.py — timmy_model_backend + airllm_model_size settings - src/timmy/prompts.py — mentions AirLLM "even bigger brains, still fully sovereign" - pyproject.toml — bigbrain optional dep; wheel includes updated - .env.example — TIMMY_MODEL_BACKEND + AIRLLM_MODEL_SIZE docs - tests/conftest.py — stubs 'airllm' module so tests run without GPU - tests/test_backends.py — 13 new tests covering helpers + TimmyAirLLMAgent - tests/test_agent.py — 7 new tests for backend routing - README.md — Big Brain section with one-line install - activate_self_tdd.sh — bootstrap script (venv + install + tests + watchdog + dashboard); --big-brain flag All 61 tests pass. Self-TDD watchdog unaffected. https://claude.ai/code/session_01DMjQ5qMZ8iHeyix1j3GS7c	2026-02-21 16:53:16 +00:00
Claude	0d14be291a	feat: persistent chat history with clear button - Add dashboard/store.py: MessageLog dataclass singleton tracking user/agent/error messages for the lifetime of the server process - agents.py: write each chat turn to MessageLog; add GET and DELETE /agents/timmy/history routes returning the history.html partial - partials/history.html: render stored messages by role (YOU / TIMMY / SYSTEM); falls back to the Mission Control init message when empty - index.html: chat-log loads history via hx-get on page start; new CLEAR button in panel header sends hx-delete to reset the log - style.css: add .mc-btn-clear (muted, red-on-hover for the header) - tests: autouse reset_message_log fixture in conftest; 5 new history tests covering empty state, recording, offline errors, clear, and post-clear state → 32 tests total, all passing https://claude.ai/code/session_01KZMfwBpLuiv6x9GbzTqbys	2026-02-20 18:55:14 +00:00
Claude	5e7d805245	feat: scaffold Timmy Time Mission Control (v1.0.0 Genesis) - src/timmy/ — Agno agent wrapper (llama3.2 via Ollama, SQLite memory, TIMMY_SYSTEM_PROMPT) - src/dashboard/ — FastAPI + HTMX + Jinja2 Mission Control UI - /health + /health/status (Ollama ping, HTMX 30s poll) - /agents list + /agents/timmy/chat (HTMX form submission) - static/style.css — dark terminal mission-control aesthetic - tests/ — 27 pytest tests (prompts, agent config, dashboard routes); no Ollama required - pyproject.toml — hatchling build, pytest configured with pythonpath=src https://claude.ai/code/session_01M4L3R98N5fgXFZRvV8X9b6	2026-02-19 19:05:01 +00:00

35 Commits