Timmy-time-dashboard

Author	SHA1	Message	Date
Alexander Payne	8fec9c41a5	feat: autonomous self-modifying agent with multi-backend LLM support Adds SelfModifyLoop — an edit→validate→test→commit cycle that can read its own failure reports, diagnose root causes, and restart autonomously. Key capabilities: - Multi-backend LLM: Anthropic Claude API, Ollama, or auto-detect - Syntax validation via compile() before writing to disk - Autonomous self-correction loop with configurable max cycles - XML-based output format to avoid triple-quote delimiter conflicts - Branch creation skipped by default to prevent container restarts - CLI: self-modify run "instruction" --backend auto --autonomous - 939 tests passing, 30 skipped Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 17:23:47 -05:00
Alexander Whitestone	c430f8002c	Merge pull request #29 from AlexanderWhitestone/fix/xss-prevention-mobile-test Security: XSS Prevention in Mobile Test Page	2026-02-25 08:01:05 -05:00
Alexander Payne	3463f4e4a4	fix: rename src/websocket to src/ws_manager to avoid websocket-client clash selenium depends on websocket-client which installs a top-level `websocket` package that shadows our src/websocket/ module on CI. Renaming to ws_manager eliminates the conflict entirely — no more sys.path hacks needed in conftest or Selenium tests. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 07:57:28 -05:00
Alexander Payne	e483748816	fix: resolve websocket-client shadowing src/websocket on CI selenium depends on websocket-client which installs a top-level `websocket` package that shadows our src/websocket/ module. Ensure src/ is inserted at the front of sys.path in conftest so the project module wins the import race. Fixes collection errors for test_websocket.py and test_websocket_extended.py on GitHub Actions. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 07:32:57 -05:00
Alexander Payne	29292cfb84	feat: single-command Docker startup, fix UI bugs, add Selenium tests - Add `make up` / `make up DEV=1` for one-command Docker startup with optional hot-reload via docker-compose.dev.yml overlay - Add `timmy up --dev` / `timmy down` CLI commands - Fix cross-platform font resolution in creative assembler (7 test failures) - Fix Ollama host URL not passed to Agno model (container connectivity) - Fix task panel route shadowing by reordering literal routes before parameterized routes in swarm.py - Fix chat input not clearing after send (hx-on::after-request) - Fix chat scroll overflow (CSS min-height: 0 on flex children) - Add Selenium UI smoke tests (17 tests, gated behind SELENIUM_UI=1) - Install fonts-dejavu-core in Dockerfile for container font support - Remove obsolete docker-compose version key - Bump CSS cache-bust to v4 833 unit tests pass, 15 Selenium tests pass (2 skipped). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 07:20:56 -05:00
AlexanderWhitestone	bc1be23e23	security: prevent XSS in mobile-test by using textContent	2026-02-25 02:08:02 -05:00
Claude	78cf91697c	feat: add functional Ollama chat tests with containerised LLM Add an ollama service (behind --profile ollama) to the test compose stack and a new test suite that verifies real LLM inference end-to-end: - docker-compose.test.yml: add ollama/ollama service with health check, make OLLAMA_URL and OLLAMA_MODEL configurable via env vars - tests/functional/test_ollama_chat.py: session-scoped fixture that brings up Ollama + dashboard, pulls qwen2.5:0.5b (~400MB, CPU-only), and runs chat/history/multi-turn tests against the live stack - Makefile: add `make test-ollama` target Run with: make test-ollama (or FUNCTIONAL_DOCKER=1 pytest tests/functional/test_ollama_chat.py -v) https://claude.ai/code/session_01NTEzfRHSZQCfkfypxgyHKk	2026-02-25 02:44:36 +00:00
Claude	15596ca325	feat: add Discord integration with chat_bridge abstraction layer Introduces a vendor-agnostic chat platform architecture: - chat_bridge/base.py: ChatPlatform ABC, ChatMessage, ChatThread - chat_bridge/registry.py: PlatformRegistry singleton - chat_bridge/invite_parser.py: QR + Ollama vision invite extraction - chat_bridge/vendors/discord.py: DiscordVendor with native threads Workflow: paste a screenshot of a Discord invite or QR code at POST /discord/join → Timmy extracts the invite automatically. Every Discord conversation gets its own thread, keeping channels clean. Bot responds to @mentions and DMs, routes through Timmy agent. 43 new tests (base classes, registry, invite parser, vendor, routes). https://claude.ai/code/session_01WU4h3cQQiouMwmgYmAgkMM	2026-02-25 01:11:14 +00:00
Claude	2c419a777d	fix: skip Docker tests gracefully when daemon is unavailable The docker_stack fixture now checks `docker info` before attempting `compose up`. If the daemon isn't reachable, tests skip instead of erroring with pytest.fail. https://claude.ai/code/session_01WU4h3cQQiouMwmgYmAgkMM	2026-02-25 00:49:06 +00:00
Claude	c91e02e7c5	test: add functional test suite with real fixtures, no mocking Three-tier functional test infrastructure: - CLI tests via Typer CliRunner (timmy, timmy-serve, self-tdd) - Dashboard integration tests with real TestClient, real SQLite, real coordinator (no patch/mock — Ollama offline = graceful degradation) - Docker compose container-level tests (gated by FUNCTIONAL_DOCKER=1) - End-to-end L402 payment flow with real mock-lightning backend 42 new tests (8 Docker tests skipped without FUNCTIONAL_DOCKER=1). All 849 tests pass. https://claude.ai/code/session_01WU4h3cQQiouMwmgYmAgkMM	2026-02-25 00:46:22 +00:00
Claude	3e51434b4b	test: add 157 functional tests covering 8 low-coverage modules Analyze test coverage (75.3% → 85.4%) and add functional test suites for the major gaps identified: - test_agent_core.py: Full coverage for agent_core/interface.py (0→100%) and agent_core/ollama_adapter.py (0→100%) — data classes, factories, abstract enforcement, perceive/reason/act/recall workflow, effect logging - test_docker_runner.py: Full coverage for swarm/docker_runner.py (0→100%) — container spawn/stop/list lifecycle with mocked subprocess - test_timmy_tools.py: Tool usage tracking, persona toolkit mapping, catalog generation, graceful degradation without Agno - test_routes_tools.py: /tools page, API stats endpoint, and WebSocket /swarm/live connect/disconnect/send lifecycle (41→82%) - test_voice_tts_functional.py: VoiceTTS init, speak, volume clamping, voice listing, graceful degradation (41→94%) - test_watchdog_functional.py: _run_tests, watch loop state transitions, regression detection, KeyboardInterrupt (47→97%) - test_lnd_backend.py: LND init from params/env, grpc stub enforcement, method-level BackendNotAvailableError, settle returns False (25→61%) - test_swarm_routes_functional.py: Agent spawn/stop, task CRUD, auction, insights, UI partials, error paths (63→92%) https://claude.ai/code/session_01WU4h3cQQiouMwmgYmAgkMM	2026-02-24 23:36:50 +00:00
Claude	65a278dbee	fix: comprehensive iPhone UI overhaul — glassmorphism, responsive layouts, theme unification - base.html: add missing {% block extra_styles %}, mobile hamburger menu with slide-out nav, interactive-widget viewport meta, -webkit-text-size-adjust - style.css: define 15+ missing CSS variables (--bg-secondary, --text-muted, --accent, --success, --danger, etc.), add missing utility classes (.grid, .stat, .agent-card, .agent-avatar, .form-group), glassmorphism card effects, iPhone breakpoints (768px, 390px), 44pt min touch targets, smooth animations - mobile.html: rewrite with proper theme variables, glass cards, touch-friendly quick actions grid, chat with proper message bubbles - swarm_live.html: replace undefined CSS vars, use mc-panel theme cards - marketplace.html: responsive agent cards that stack on iPhone, themed pricing - voice_button.html & voice_enhanced.html: proper theme integration, touch-sized buttons, themed result containers - create_task.html: mobile-friendly forms with 16px font (prevents iOS zoom) - tools.html & creative.html: themed headers, responsive column stacking - spark.html: replace all hardcoded blue (#00d4ff) colors with theme purple/orange - briefing.html: replace hardcoded bootstrap colors with theme variables Fixes: header nav overflow on iPhone (7 links in single row), missing extra_styles block silently dropping child template styles, undefined CSS variables breaking mobile/swarm/marketplace/voice pages, sub-44pt touch targets, missing -webkit-text-size-adjust, inconsistent color themes. 97 UI tests pass (91 UI-specific + 6 creative route). https://claude.ai/code/session_01JiyhGyee2zoMN4p8xWYqEe	2026-02-24 22:25:04 +00:00
Alexander Whitestone	7018a756b3	Merge pull request #22 from AlexanderWhitestone/claude/audit-timmy-dashboard-ft27r	2026-02-24 14:18:29 -05:00
Claude	96c9f1b02f	fix: address audit low-hanging fruit — docs accuracy, auction timing, stubs, tests - Docs: "No Cloud" → "No Cloud AI" (frontend uses CDN for Bootstrap/HTMX/fonts) - Docs: "600+" → "640+" tests, "20+" → "58" endpoints (actual counts) - Docs: LND described as "scaffolded" not "gRPC-ready"; remove "agents earn sats" - Fix auction timing: coordinator sleep(0) → sleep(AUCTION_DURATION_SECONDS) - agent_core: implement remember() with dedup/eviction, communicate() via swarm comms - Tests: add CLI tests for chat, think, and backend/model-size forwarding (647 passing) https://claude.ai/code/session_01SZTwAkTg6v4ybv8g9NLxqN	2026-02-24 18:29:21 +00:00
Alexander Whitestone	03ff505c4b	Merge pull request #23 from AlexanderWhitestone/security/macaroon-forgery-and-xss-1771955896	2026-02-24 13:00:52 -05:00
AlexanderWhitestone	4daf382819	security: fix L402 macaroon forgery and XSS in templates	2026-02-24 12:58:19 -05:00
Claude	832478f0d0	fix: serve_chat endpoint bug, stale docs, and license mismatch - Fix /serve/chat AttributeError: split Request and ChatRequest params so auth headers are read from HTTP request, not Pydantic body - Add regression tests for the serve_chat endpoint bug - Add agent_core and lightning to pyproject.toml wheel includes - Replace Apache 2.0 LICENSE with MIT to match pyproject.toml - Update test count from "228" to "600+" across README, docs, AGENTS.md - Add 5 missing subsystems to README table (Spark, Creative, Tools, Telegram, agent_core/lightning) - Update AGENTS.md project structure with 6 missing modules - Mark completed v2 roadmap items (personas, MCP tools) in AGENTS.md https://claude.ai/code/session_01GMiccXbo77GkV3TA69x6KS	2026-02-24 17:18:29 +00:00
Claude	b098b00959	test: add integration tests with real media for music video pipeline Build real PNG, WAV, and MP4 fixtures (no AI models) and exercise the full assembler and Creative Director pipeline end-to-end. Fix MoviePy v2 crossfade API (vfx.CrossFadeIn) and font resolution (DejaVu-Sans). 14 new integration tests — 638 total, all passing. https://claude.ai/code/session_01KJm6jQkNi3aA3yoQJn636c	2026-02-24 16:48:14 +00:00
Claude	1103da339c	feat: add full creative studio + DevOps tools (Pixel, Lyra, Reel personas) Adds 3 new personas (Pixel, Lyra, Reel) and 5 new tool modules: - Git/DevOps tools (GitPython): clone, status, diff, log, blame, branch, add, commit, push, pull, stash — wired to Forge and Helm personas - Image generation (FLUX via diffusers): text-to-image, storyboards, variations — Pixel persona - Music generation (ACE-Step 1.5): full songs with vocals+instrumentals, instrumental tracks, vocal-only tracks — Lyra persona - Video generation (Wan 2.1 via diffusers): text-to-video, image-to-video clips — Reel persona - Creative Director pipeline: multi-step orchestration that chains storyboard → music → video → assembly into 3+ minute final videos - Video assembler (MoviePy + FFmpeg): stitch clips, overlay audio, title cards, subtitles, final export Also includes: - Spark Intelligence tool-level + creative pipeline event capture - Creative Studio dashboard page (/creative/ui) with 4 tabs - Config settings for all new models and output directories - pyproject.toml creative optional extra for GPU dependencies - 107 new tests covering all modules (624 total, all passing) https://claude.ai/code/session_01KJm6jQkNi3aA3yoQJn636c	2026-02-24 16:31:47 +00:00
Claude	1ab26d30ad	feat: integrate Spark Intelligence into Timmy swarm system Adds a self-evolving cognitive layer inspired by vibeship-spark-intelligence, adapted for Timmy's agent architecture. Spark captures swarm events, runs EIDOS prediction-evaluation loops, consolidates memories, and generates advisory recommendations — all backed by SQLite consistent with existing patterns. New modules: - spark/memory.py — event capture with importance scoring + memory consolidation - spark/eidos.py — EIDOS cognitive loop (predict → observe → evaluate → learn) - spark/advisor.py — ranked advisory generation from accumulated intelligence - spark/engine.py — top-level API wiring all subsystems together Dashboard: - /spark/ui — full Spark Intelligence dashboard (3-column: status/advisories, predictions/memories, event timeline) with HTMX auto-refresh - /spark — JSON API for programmatic access - SPARK link added to navigation header Integration: - Coordinator hooks emit Spark events on task post, bid, assign, complete, fail - EIDOS predictions generated when tasks are posted, evaluated on completion - Memory consolidation triggers when agents accumulate enough outcomes - SPARK_ENABLED config toggle (default: true) Tests: 47 new tests covering all Spark subsystems + dashboard routes. Full suite: 538 tests passing. https://claude.ai/code/session_01KJm6jQkNi3aA3yoQJn636c	2026-02-24 15:51:15 +00:00
Alexander Payne	ace5bfdf5f	feat: Mission Control dashboard with sovereignty audit + scary path tests Mission Control Dashboard: - /swarm/mission-control page with real-time system status - Sovereignty score display with visual progress bar - Dependency health grid (Ollama, Redis, Lightning, SQLite) - Recommendations based on dependency status - Heartbeat monitor with tick counter - System metrics: uptime, agents, tasks, sats earned Health Endpoints: - /health/sovereignty - Full sovereignty audit report - /health/components - Component status and config Tests (TDD approach): - 11 Mission Control tests (all passing) - 23 scary path tests for production scenarios - Concurrent load, memory persistence, edge cases Total: 525 tests passing	2026-02-22 20:48:14 -05:00
Alexander Payne	ca60483268	feat: pytest-cov configuration and test audit cleanup Add full pytest-cov configuration with fail_under=60% threshold, HTML/XML report targets, and proper exclude_lines. Fix websocket history test to use public broadcast() API instead of manually manipulating internals. Audit confirmed 491 tests at 71.2% coverage. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 20:42:58 -05:00
Alexander Payne	14072f9bb5	feat: MCP tools integration for swarm agents ToolExecutor: - Persona-specific toolkit selection (forge gets code tools, echo gets search) - Tool inference from task keywords (search→web_search, code→python) - LLM-powered reasoning about tool selection - Graceful degradation when Agno unavailable PersonaNode Updates: - Subscribe to swarm:events for task assignments - Execute tasks using ToolExecutor when assigned - Complete tasks via comms.complete_task() - Track current_task for status monitoring Tests: - 19 new tests for tool execution - All 6 personas covered - Tool inference verification - Edge cases (no toolkit, unknown tasks) Total: 491 tests passing	2026-02-22 20:33:26 -05:00
Alexander Payne	c5df954d44	feat: Lightning interface, swarm routing, sovereignty audit, embodiment prep Lightning Backend Interface: - Abstract LightningBackend with pluggable implementations - MockBackend for development (auto-settle invoices) - LndBackend stub with gRPC integration path documented - Backend factory for runtime selection via LIGHTNING_BACKEND env Intelligent Swarm Routing: - CapabilityManifest for agent skill declarations - Task scoring based on keywords + capabilities + bid price - RoutingDecision audit logging to SQLite - Agent stats tracking (wins, consideration rate) Sovereignty Audit: - Comprehensive audit report (docs/SOVEREIGNTY_AUDIT.md) - 9.2/10 sovereignty score - Documented all external dependencies and local alternatives Substrate-Agnostic Agent Interface: - TimAgent abstract base class - Perception/Action/Memory/Communication types - OllamaAdapter implementation - Foundation for future embodiment (robot, VR) Tests: - 36 new tests for Lightning and routing - 472 total tests passing - Maintained 0 warning policy	2026-02-22 20:20:11 -05:00
Alexander Payne	f0aa43533f	feat: swarm E2E, MCP tools, timmy-serve L402, tests, notifications Major Features: - Auto-spawn persona agents (Echo, Forge, Seer) on app startup - WebSocket broadcasts for real-time swarm UI updates - MCP tool integration: web search, file I/O, shell, Python execution - New /tools dashboard page showing agent capabilities - Real timmy-serve start with L402 payment gating middleware - Browser push notifications for briefings and task events Tests: - test_docker_agent.py: 9 tests for Docker agent runner - test_swarm_integration_full.py: 18 E2E lifecycle tests - Fixed all pytest warnings (436 tests, 0 warnings) Improvements: - Fixed coroutine warnings in coordinator broadcasts - Fixed ResourceWarning for unclosed process pipes - Added pytest-asyncio config to pyproject.toml - Test isolation with proper event loop cleanup	2026-02-22 19:01:04 -05:00
Claude	167fd0a7b4	Add outcome-based learning system for swarm agents Introduce a feedback loop where task outcomes (win/loss, success/failure) feed back into agent bidding strategy. Borrows the "learn from outcomes" concept from Spark Intelligence but builds it natively on Timmy's existing SQLite + swarm architecture. New module: src/swarm/learner.py - Records every bid outcome with task description context - Computes per-agent metrics: win rate, success rate, keyword performance - suggest_bid() adjusts bids based on historical performance - learned_keywords() discovers what task types agents actually excel at Changes: - persona_node: _compute_bid() now consults learner for adaptive adjustments - coordinator: complete_task/fail_task feed results into learner - coordinator: run_auction_and_assign records all bid outcomes - routes/swarm: add /swarm/insights and /swarm/insights/{agent_id} endpoints - routes/swarm: add POST /swarm/tasks/{task_id}/fail endpoint All 413 tests pass (23 new + 390 existing). https://claude.ai/code/session_01E5jhTCwSUnJk9p9zrTMVUJ	2026-02-22 22:04:37 +00:00
Alexander Payne	4020b5222f	feat: add Docker-based swarm agent containerization Add infrastructure for running swarm agents as isolated Docker containers with HTTP-based coordination, startup recovery, and enhanced dashboard UI for agent management. - Dockerfile and docker-compose.yml for multi-service orchestration - DockerAgentRunner for programmatic container lifecycle management - Internal HTTP API for container agents to poll tasks and submit bids - Startup recovery system to reconcile orphaned tasks and stale agents - Enhanced UI partials for agent panels, chat, and task assignment - Timmy docker entry point with heartbeat and task polling - New Makefile targets for Docker workflows - Tests for swarm recovery Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 16:21:32 -05:00
Claude	bb93697b92	feat: add Telegram bot integration Bridges Telegram messages to Timmy via python-telegram-bot (optional dependency). The bot token can be supplied through the TELEGRAM_TOKEN env var or at runtime via the new POST /telegram/setup dashboard endpoint, which (re)starts the bot without a restart. Changes: - src/telegram_bot/bot.py — TelegramBot singleton: token persistence (telegram_state.json), lifecycle (start/stop), /start command and message handler that forwards to Timmy - src/dashboard/routes/telegram.py — /telegram/setup and /telegram/status FastAPI routes - src/dashboard/app.py — register telegram router; auto-start/stop bot in lifespan hook - src/config.py — TELEGRAM_TOKEN setting (pydantic-settings) - pyproject.toml — [telegram] optional extra (python-telegram-bot>=21), telegram_bot wheel include - .env.example — TELEGRAM_TOKEN section - .gitignore — exclude telegram_state.json (contains token) - tests/conftest.py — stub telegram/telegram.ext for offline test runs - tests/test_telegram_bot.py — 16 tests covering token helpers, lifecycle, and all dashboard routes (370 total, all passing) https://claude.ai/code/session_01CNBm3ZLobtx3Z1YogHq8ZS	2026-02-22 17:16:12 +00:00
Claude	ce6077be0c	feat(briefing): morning briefing + approval queue Implements the Morning Briefing and Approval Queue feature — the first step from tool to companion. Timmy now shows up before the owner asks. New modules ----------- • src/timmy/approvals.py — ApprovalItem dataclass, GOLDEN_TIMMY governance constant, full SQLite CRUD (create / list / approve / reject / expire). Items auto-expire after 7 days if not actioned. • src/timmy/briefing.py — BriefingEngine that queries swarm activity and chat history, calls Timmy's Agno agent for a prose summary, and caches the result in SQLite (~/.timmy/briefings.db). get_or_generate() skips regeneration if a fresh briefing (< 30 min) already exists. New routes (src/dashboard/routes/briefing.py) ---------------------------------------------- GET /briefing — full briefing page GET /briefing/approvals — HTMX partial: pending approval cards POST /briefing/approvals/{id}/approve — approve via HTMX (no page reload) POST /briefing/approvals/{id}/reject — reject via HTMX (no page reload) New templates ------------- • briefing.html — clean, mobile-first prose layout (max 680px) • partials/approval_cards.html — list of approval cards • partials/approval_card_single.html — single approval card with Approve/Reject HTMX buttons App wiring (src/dashboard/app.py) ---------------------------------- • Added asynccontextmanager lifespan with _briefing_scheduler background task. Generates a briefing at startup and every 6 hours; skips if fresh. Push notification hook (src/notifications/push.py) --------------------------------------------------- • notify_briefing_ready(briefing) — logs + triggers local notifier. Placeholder for APNs/Pushover wiring later. Navigation ---------- • Added BRIEFING link to the header nav in base.html. Tests ----- • tests/test_approvals.py — 17 tests: GOLDEN_TIMMY, CRUD, expiry, ordering • tests/test_briefing.py — 22 tests: dataclass, freshness, cache round-trip, generate/get_or_generate, push notification hook 354 tests, 354 passing. https://claude.ai/code/session_01D7p5w91KX3grBeioGiiGy8	2026-02-22 14:04:20 +00:00
Claude	cc7f0bdc0a	feat(swarm): agent personas, bid stats persistence, marketplace frontend v2.0.0 Exodus — three roadmap items implemented in one PR: 1. Agent Personas (Echo, Mace, Helm, Seer, Forge, Quill) - src/swarm/personas.py — PERSONAS dict with role, description, capabilities, rate_sats, bid_base/jitter, and preferred_keywords for each of the 6 agents - src/swarm/persona_node.py — PersonaNode extends SwarmNode with capability- aware bidding: bids lower when the task description contains a preferred keyword (specialist advantage), higher otherwise (off-spec inflation) - SwarmCoordinator.spawn_persona(persona_id) — registers the persona in the SQLite registry with its full capabilities string and wires it into the shared AuctionManager via comms subscription 2. Bid History Persistence (prerequisite for marketplace stats) - src/swarm/stats.py — bid_history table in data/swarm.db: record_bid(), mark_winner(), get_agent_stats(), get_all_agent_stats() - coordinator.run_auction_and_assign() now calls swarm_stats.mark_winner() when a winner is chosen, so tasks_won/total_earned survive restarts - spawn_persona() records each bid for stats tracking 3. Marketplace Frontend wired to real data - /marketplace/ui — new HTML route renders marketplace.html with live registry status (idle/busy/offline/planned) and cumulative bid stats - /marketplace JSON endpoint enriched with same registry+stats data - marketplace.html — fixed field names (rate_sats, tasks_completed, total_earned), added role subtitle, comma-split capabilities string, FREE label for Timmy, "planned_count" display - base.html — added MARKET nav link pointing to /marketplace/ui Tests: 315 passed (87 new) covering personas, persona_node, stats CRUD, marketplace UI route, and enriched catalog data. https://claude.ai/code/session_013CPPgLc589wfdS8LDNuarL	2026-02-22 12:21:50 +00:00
Manus AI	2e055635a8	test: close coverage gaps for timmy_serve CLI, voice_enhanced, WebSocket, and swarm live page - Add 8 tests for timmy_serve/cli.py (start, invoice, status commands) covering default args, custom args, and output validation - Add 8 tests for voice_enhanced route covering all intent types (status, help, swarm, voice, chat fallback) plus error handling - Add 17 tests for websocket/handler.py covering broadcast to multiple clients, dead connection cleanup, history trimming, connect/disconnect, and all convenience broadcast methods - Add 4 tests for the new GET /swarm/live page route Total new tests: 37	2026-02-21 13:45:24 -05:00
Manus AI	ee45a16267	feat(swarm): add in-process agent spawning with live auction bidding - Add spawn_in_process_agent() to SwarmCoordinator: creates lightweight SwarmNode instances that share the coordinator's comms layer and AuctionManager, enabling synchronous bid submission - Fix post_task() to open the auction BEFORE announcing via comms so in-process agent callbacks can submit bids into an open auction - Fix run_auction_and_assign() to close an already-open auction instead of re-opening (which would discard bids) - Add POST /swarm/tasks/auction route for atomic task+auction flow - Add 7 integration tests (TDD) covering the full lifecycle: spawn → post → auction → assign → complete	2026-02-21 13:42:31 -05:00
Claude	95555b3738	feat: senior architect quality analysis + XSS fixes + HITL guide - Add QUALITY_ANALYSIS.md — 10-point architect review covering architecture coherence, completeness (~35-40% vs vision), mobile UX, security, test coverage, code quality, and DX - Fix P0 XSS: mobile.html chat input now uses DOM textContent instead of innerHTML string interpolation with raw user input - Fix P0 XSS: swarm_live.html agent/auction rendering rewritten with safe DOM methods (_t/_el helpers) — no more ${agent.name} in innerHTML - Add M7xx test category (4 new tests) covering XSS prevention assertions; total suite now 232 passing (was 228) - HITL session guide included in analysis with step-by-step phone test instructions and critical scenario priority ordering https://claude.ai/code/session_0183Nzcy7TMqjrAopnTtygds	2026-02-21 18:11:22 +00:00
Alexspayne	f9b84c1e2f	feat: Mission Control v2 — swarm, L402, voice, marketplace, React dashboard Major expansion of the Timmy Time Dashboard: Backend modules: - Swarm subsystem: registry, manager, bidder, coordinator, agent_runner, swarm_node, tasks, comms - L402/Lightning: payment_handler, l402_proxy with HMAC macaroons - Voice NLU: regex-based intent detection (chat, status, swarm, task, help, voice) - Notifications: push notifier for swarm events - Shortcuts: Siri Shortcuts iOS integration endpoints - WebSocket: live dashboard event manager - Inter-agent: agent-to-agent messaging layer Dashboard routes: - /swarm/* — swarm management and agent registry - /marketplace — agent catalog with sat pricing - /voice/* — voice command processing - /mobile — mobile status endpoint - /swarm/live — WebSocket live feed React web dashboard (dashboard-web/): - Sovereign Terminal design — dark theme with Bitcoin orange accents - Three-column layout: status sidebar, workspace tabs, context panel - Chat, Swarm, Tasks, Marketplace tab views - JetBrains Mono typography, terminal aesthetic - Framer Motion animations throughout Tests: 228 passing (expanded from 93) Includes Kimi's additional templates and QA work.	2026-02-21 12:57:38 -05:00
Claude	c8aa6a5fbb	feat: quality analysis — bug fixes, mobile tests, HITL checklist Senior architect review findings + remediations: BUG FIX — critical interface mismatch - TimmyAirLLMAgent only exposed print_response(); dashboard route calls agent.run() → AttributeError when AirLLM backend is selected. Added run() → RunResult(content) as primary inference entry point; print_response() now delegates to run() so both call sites share one inference path. - Added RunResult dataclass for Agno-compatible structured return. BUG FIX — hardcoded model name in health status partial - health_status.html rendered literal "llama3.2" regardless of OLLAMA_MODEL env var. Route now passes settings.ollama_model to the template context; partial renders {{ model }} instead. FEATURE — /mobile-test HITL checklist page - 22 human-executable test scenarios across: Layout, Touch & Input, Chat behaviour, Health, Scroll, Notch/Home Bar, Live UI. - Pass/Fail/Skip buttons with sessionStorage state persistence. - Live progress bar + final score summary. - TEST link added to Mission Control header for quick access on phone. TEST — 32 new automated mobile quality tests (M1xx–M6xx) - M1xx: viewport/meta tags (8 tests) - M2xx: touch target sizing — 44 px min-height, manipulation (4 tests) - M3xx: iOS zoom prevention, autocapitalize, enterkeyhint (5 tests) - M4xx: HTMX robustness — hx-sync drop, disabled-elt, polling (5 tests) - M5xx: safe-area insets, overscroll, dvh units (5 tests) - M6xx: AirLLM interface contract — run(), RunResult, delegation (5 tests) Total test count: 61 → 93 (all passing). https://claude.ai/code/session_01RBuRCBXZNkAQQXXGiJNDmt	2026-02-21 17:21:47 +00:00
Claude	19af4ae540	feat: integrate AirLLM as optional high-performance backend Adds the `bigbrain` optional dependency group (airllm>=2.9.0) and a complete second inference path that runs 8B / 70B / 405B Llama models locally via layer-by-layer loading — no GPU required, no cloud, fully sovereign. Key changes: - src/timmy/backends.py — TimmyAirLLMAgent (same print_response interface as Agno Agent); auto-selects AirLLMMLX on Apple Silicon, AutoModel (PyTorch) everywhere else - src/timmy/agent.py — _resolve_backend() routing with explicit override, env-config, and 'auto' Apple-Silicon detection - src/timmy/cli.py — --backend / --model-size flags on all commands - src/config.py — timmy_model_backend + airllm_model_size settings - src/timmy/prompts.py — mentions AirLLM "even bigger brains, still fully sovereign" - pyproject.toml — bigbrain optional dep; wheel includes updated - .env.example — TIMMY_MODEL_BACKEND + AIRLLM_MODEL_SIZE docs - tests/conftest.py — stubs 'airllm' module so tests run without GPU - tests/test_backends.py — 13 new tests covering helpers + TimmyAirLLMAgent - tests/test_agent.py — 7 new tests for backend routing - README.md — Big Brain section with one-line install - activate_self_tdd.sh — bootstrap script (venv + install + tests + watchdog + dashboard); --big-brain flag All 61 tests pass. Self-TDD watchdog unaffected. https://claude.ai/code/session_01DMjQ5qMZ8iHeyix1j3GS7c	2026-02-21 16:53:16 +00:00
Claude	7619407b63	feat: add self-tdd watchdog — continuous test runner CLI Adds `src/self_tdd/watchdog.py` with a `_run_tests()` function that shells out to pytest and a `watch` command that polls on a configurable interval, printing green on recovery and full short-traceback output on regression. No files are modified and no commits are made automatically. Usage: self-tdd watch # default 60s interval self-tdd watch -i 15 # poll every 15s Also adds 6 unit tests and wires the `self-tdd` entry point + `src/self_tdd` wheel include into pyproject.toml. https://claude.ai/code/session_01DMjQ5qMZ8iHeyix1j3GS7c	2026-02-21 16:36:56 +00:00
Claude	982c42ba45	Use TIMMY_STATUS_PROMPT in CLI status command TIMMY_STATUS_PROMPT was defined in timmy/prompts.py and covered by tests, but never wired into the application. The CLI status command was passing a hardcoded inline string instead. Replace the inline string with the canonical prompt and add two CLI tests that verify the correct prompt is used. https://claude.ai/code/session_01DMjQ5qMZ8iHeyix1j3GS7c	2026-02-21 16:17:55 +00:00
Claude	0d14be291a	feat: persistent chat history with clear button - Add dashboard/store.py: MessageLog dataclass singleton tracking user/agent/error messages for the lifetime of the server process - agents.py: write each chat turn to MessageLog; add GET and DELETE /agents/timmy/history routes returning the history.html partial - partials/history.html: render stored messages by role (YOU / TIMMY / SYSTEM); falls back to the Mission Control init message when empty - index.html: chat-log loads history via hx-get on page start; new CLEAR button in panel header sends hx-delete to reset the log - style.css: add .mc-btn-clear (muted, red-on-hover for the header) - tests: autouse reset_message_log fixture in conftest; 5 new history tests covering empty state, recording, offline errors, clear, and post-clear state → 32 tests total, all passing https://claude.ai/code/session_01KZMfwBpLuiv6x9GbzTqbys	2026-02-20 18:55:14 +00:00
Claude	5e7d805245	feat: scaffold Timmy Time Mission Control (v1.0.0 Genesis) - src/timmy/ — Agno agent wrapper (llama3.2 via Ollama, SQLite memory, TIMMY_SYSTEM_PROMPT) - src/dashboard/ — FastAPI + HTMX + Jinja2 Mission Control UI - /health + /health/status (Ollama ping, HTMX 30s poll) - /agents list + /agents/timmy/chat (HTMX form submission) - static/style.css — dark terminal mission-control aesthetic - tests/ — 27 pytest tests (prompts, agent config, dashboard routes); no Ollama required - pyproject.toml — hatchling build, pytest configured with pythonpath=src https://claude.ai/code/session_01M4L3R98N5fgXFZRvV8X9b6	2026-02-19 19:05:01 +00:00

... 3 4 5 6 7

340 Commits