Timmy-time-dashboard

Author	SHA1	Message	Date
Claude	f15559482b	docs: Update REFACTORING_PLAN.md with execution status Mark completed phases (1, 2a, 3, 4, 6) and document remaining work (full module consolidation, package extraction) with guidance on incremental execution approach. https://claude.ai/code/session_019oMFNvD8uSGSSmBMGkBfQN	2026-02-26 21:32:18 +00:00
Claude	d2c80fbf4c	refactor: Phase 2a — consolidate dashboard routes (27→22 files) Merge related route files to reduce sprawl: - voice.py ← voice_enhanced.py (enhanced pipeline merged in) - swarm.py ← swarm_internal.py + swarm_ws.py (internal API + WebSocket) - self_coding.py ← self_modify.py (self-modify endpoints merged in) - Delete mobile_test.py route + template (test-only page, not for prod) - Delete test_xss_prevention.py (tested the deleted mobile_test page) Update app.py to use consolidated imports. Update test_voice_enhanced.py patch paths. Remove mobile_test.py from coverage omit (file deleted). 27 route files → 22. Tests: 1502 passed (1 removed with deleted page). https://claude.ai/code/session_019oMFNvD8uSGSSmBMGkBfQN	2026-02-26 21:30:39 +00:00
Claude	4e11dd2490	refactor: Phase 3 — reorganize tests into module-mirroring subdirectories Move 97 test files from flat tests/ into 13 subdirectories: tests/dashboard/ (8 files — routes, mobile, mission control) tests/swarm/ (17 files — coordinator, docker, routing, tasks) tests/timmy/ (12 files — agent, backends, CLI, tools) tests/self_coding/ (14 files — git safety, indexer, self-modify) tests/lightning/ (3 files — L402, LND, interface) tests/creative/ (8 files — assembler, director, image/music/video) tests/integrations/ (10 files — chat bridge, telegram, voice, websocket) tests/mcp/ (4 files — bootstrap, discovery, executor) tests/spark/ (3 files — engine, tools, events) tests/hands/ (3 files — registry, oracle, phase5) tests/scripture/ (1 file) tests/infrastructure/ (3 files — router cascade, API) tests/security/ (3 files — XSS, regression) Fix Path(__file__) reference in test_mobile_scenarios.py for new depth. Add __init__.py to all test subdirectories. Tests: 1503 passed, 9 failed (pre-existing), 53 errors (pre-existing) https://claude.ai/code/session_019oMFNvD8uSGSSmBMGkBfQN	2026-02-26 21:21:28 +00:00
Claude	6045077144	refactor: Phase 1/4/6 — doc cleanup, config fix, token optimization Phase 1 — Documentation cleanup: - Slim README 303→93 lines (remove duplicated architecture, config tables) - Slim CLAUDE.md 267→80 lines (remove project layout, env vars, CI section) - Slim AGENTS.md 342→72 lines (remove duplicated patterns, running locally) - Delete MEMORY.md, WORKSET_PLAN.md, WORKSET_PLAN_PHASE2.md (session docs) - Archive PLAN.md, IMPLEMENTATION_SUMMARY.md to docs/ - Move QUALITY_ANALYSIS.md, QUALITY_REVIEW_REPORT.md to docs/ - Move apply_security_fixes.py, activate_self_tdd.sh to scripts/ Phase 4 — Config & build cleanup: - Fix wheel build: add 11 missing modules to pyproject.toml include list - Add pytest markers (unit, integration, dashboard, swarm, slow) - Add data/self_modify_reports/ and .handoff/ to .gitignore Phase 6 — Token optimization: - Add docstrings to 15 __init__.py files that were empty - Create __init__.py for events/, memory/, upgrades/ modules Root markdown: 87KB → ~18KB (79% reduction) https://claude.ai/code/session_019oMFNvD8uSGSSmBMGkBfQN	2026-02-26 21:03:15 +00:00
Claude	31760682f6	docs: Add comprehensive architectural refactoring plan Full VP-engineering-level review of the codebase identifying 8 problems (monolith sprawl, dashboard gravity well, doc entropy, test skeleton bloat, unclear project boundaries, broken wheel build, dashboard coupling, overscoped conftest) and proposing 6 phases of incremental refactoring from low-risk doc cleanup to potential package extraction. Key findings: - 28 modules in src/, 11 missing from wheel build - 87KB of root markdown with massive duplication - 61 of 97 test files are empty skeletons (0 test functions) - Dashboard routes: 27 files, 4,562 lines (gravity well) - 4 autouse fixtures run on every test regardless of need https://claude.ai/code/session_019oMFNvD8uSGSSmBMGkBfQN	2026-02-26 20:42:02 +00:00
Alexander Payne	d9e556d4c1	fix: Upgrade model to llama3.1:8b-instruct + fix git tool cwd Change 1: Model Upgrade (Primary Fix) - Changed default model from llama3.2 to llama3.1:8b-instruct - llama3.1:8b-instruct is fine-tuned for reliable tool/function calling - llama3.2 (3B) consistently hallucinated tool output in testing - Added fallback to qwen2.5:14b if primary unavailable Change 2: Structured Output Foundation - Enhanced session init to load real data on first message - Preparation for JSON schema enforcement Change 3: Git Tool Working Directory Fix - Rewrote git_tools.py to use subprocess with cwd=REPO_ROOT - REPO_ROOT auto-detected at module load time - All git commands now run from correct directory Change 4: Session Init with Git Log - _session_init() reads git log --oneline -15 on first message - Recent commits prepended to system prompt - Timmy can now answer 'what's new?' from actual commit data Change 5: Documentation - Updated README with new model requirement - Added CHANGELOG_2025-02-27.md User must run: ollama pull llama3.1:8b-instruct All 18 git tool tests pass.	2026-02-26 13:42:36 -05:00
Alexander Whitestone	f403d69bc1	Merge pull request #56 from AlexanderWhitestone/feature/hands-infrastructure-phase3 feat: Hands Infrastructure + 6 Autonomous Agents	2026-02-26 13:10:38 -05:00
Alexander Payne	9edcc627ea	docs: Update README with all 6 Hands Update Hands documentation to include Phase 5 additions: - Scout: hourly OSINT monitoring - Scribe: daily content production - Ledger: 6-hour treasury tracking - Weaver: weekly creative pipeline Total: 6 autonomous Hands using existing agent framework.	2026-02-26 13:09:03 -05:00
Alexander Payne	7b26922339	test: Phase 5 Hands tests Add comprehensive tests for new Hands: TestScoutHand: - Directory structure, TOML validity, SYSTEM.md - Registry loading TestScribeHand: - Same validation pattern TestLedgerHand: - Same validation pattern TestWeaverHand: - Same validation pattern TestPhase5Schedules: - Scout: hourly (0 * * * ) - Scribe: daily 9am (0 9 * ) - Ledger: every 6 hours (0 /6 * * ) - Weaver: Sunday 10am (0 10 * 0) TestPhase5ApprovalGates: - All 4 Hands have approval gates TestAllHandsLoad: - All 6 Hands load together 25 tests total, all passing.	2026-02-26 13:08:48 -05:00
Alexander Payne	a8f44c159e	feat: Phase 5 Additional Hands (Scout, Scribe, Ledger, Weaver) Add 4 new autonomous Hands using existing agent framework: Scout Hand (hands/scout/): - OSINT monitoring every hour - Monitors: HN, Reddit, RSS for Bitcoin/sovereign AI topics - Uses: web_search, rss_fetch, sentiment analysis Scribe Hand (hands/scribe/): - Content production daily at 9am - Produces: blog posts, docs, changelog - Uses: file ops, git tools, codebase indexer Ledger Hand (hands/ledger/): - Treasury tracking every 6 hours - Monitors: on-chain, Lightning balances, payment flows - Uses: lightning_balance, onchain_balance, payment_audit Weaver Hand (hands/weaver/): - Creative pipeline weekly on Sundays - Orchestrates: Pixel + Lyra + Reel for video production - Uses: creative_director, project management tools All Hands configured with: - HAND.toml manifests with schedules - SYSTEM.md prompts - Approval gates for write actions - Dashboard + Telegram output	2026-02-26 13:07:43 -05:00
Alexander Payne	b884884bad	docs: Hands documentation in README (Phase 4) Update README with Hands subsystem documentation: - Add Hands to 'What's built' table - New 'Hands — Autonomous Agents' section with: - Built-in Hands reference (Oracle, Sentinel) - Dashboard URL - HAND.toml example - Update project layout to include src/hands/ and hands/ - Update roadmap: Exodus now includes Hands Complete documentation for Phase 3-4 Hands infrastructure.	2026-02-26 12:58:21 -05:00
Alexander Payne	7508ef13c1	test: Oracle and Sentinel Hands tests (Phase 4) Add validation tests for the first two autonomous Hands: TestOracleHand: - Directory structure exists - HAND.toml is valid TOML with correct config - SYSTEM.md exists with proper content - Skills directory populated - Loads correctly in HandRegistry TestSentinelHand: - Same validation pattern as Oracle TestHandSchedules: - Oracle runs twice daily (7am, 7pm UTC) - Sentinel runs every 15 minutes TestHandApprovalGates: - Both Hands have approval gates configured - Safety model enforced 14 tests total, all passing.	2026-02-26 12:57:41 -05:00
Alexander Payne	1ba03e4ce2	feat: Oracle and Sentinel Hands (Phase 4) Add the first two autonomous Hands to validate infrastructure: Oracle Hand (hands/oracle/): - Bitcoin intelligence briefing, 2x daily (7am, 7pm) - Monitors: price action, on-chain metrics, macro context - Tools: mempool_fetch, fee_estimate, price_fetch, whale_alert - Output: Dashboard + Telegram, markdown format - Safety: Broadcast requires approval (5min auto) Sentinel Hand (hands/sentinel/): - System health monitoring, every 15 minutes - Monitors: dashboard, agents, database, disk, memory - Tools: system_stats, db_health, agent_status, disk_check - Output: Dashboard + Telegram, JSON format - Safety: Service restart requires approval (1min auto) Both include: - HAND.toml configuration with schedules - SYSTEM.md with complete prompts - skills/ directory with specialized knowledge - Approval gates for write actions	2026-02-26 12:57:07 -05:00
Alexander Whitestone	536a371d48	Merge pull request #55 from AlexanderWhitestone/feature/hands-infrastructure-phase3 Feature/hands infrastructure phase3	2026-02-26 12:55:28 -05:00
Alexander Payne	a1d00da2de	test: Hands infrastructure tests (Phase 3) Add comprehensive test suite for Hands framework: TestHandRegistry: - Load all Hands from directory - Get Hand by name (with not-found handling) - Get scheduled vs all Hands - State management (status updates) - Approval queue operations TestHandScheduler: - Scheduler initialization - Schedule Hand with cron - Get scheduled jobs list - Manual trigger execution TestHandRunner: - Load system prompts from SYSTEM.md - Load skills from skills/ directory - Build execution prompts TestHandConfig: - HandConfig creation and validation - Cron schedule validation TestHandModels: - HandStatus enum values - HandState serialization to dict 17 tests total, all passing.	2026-02-26 12:49:06 -05:00
Alexander Payne	d7aaae74d5	feat: Hands Dashboard Routes and UI (Phase 3.6) Add dashboard for managing autonomous Hands: Routes (src/dashboard/routes/hands.py): - GET /api/hands - List all Hands with status - GET /api/hands/{name} - Get Hand details - POST /api/hands/{name}/trigger - Manual trigger - POST /api/hands/{name}/pause - Pause scheduled Hand - POST /api/hands/{name}/resume - Resume paused Hand - GET /api/approvals - List pending approvals - POST /api/approvals/{id}/approve - Approve request - POST /api/approvals/{id}/reject - Reject request - GET /api/executions - List execution history Templates: - hands.html - Main dashboard page - partials/hands_list.html - Active Hands list - partials/approvals_list.html - Pending approvals - partials/hand_executions.html - Execution history Integration: - Wired up in app.py - Navigation links in base.html	2026-02-26 12:46:48 -05:00
Alexander Payne	73cf780656	feat: HandRunner and hands module init (Phase 3.5) Add HandRunner for executing Hands: - hands/runner.py: Hand execution engine - Load SYSTEM.md and SKILL.md files - Inject domain expertise into LLM context - Check and handle approval gates - Execute tool loop with LLM - Deliver output to dashboard/channel/file - Log execution records - hands/__init__.py: Module exports - Export all public classes and models - Usage documentation The HandRunner completes the core Hands infrastructure.	2026-02-26 12:43:40 -05:00
Alexander Payne	8a952f6818	feat: Hands Infrastructure - Models, Registry, Scheduler (Phase 3.1-3.3) Add core Hands infrastructure: - hands/models.py: Pydantic models for HAND.toml schema - HandConfig: Complete hand configuration - HandState: Runtime state tracking - HandExecution: Execution records - ApprovalRequest: Approval queue entries - hands/registry.py: HandRegistry for loading and indexing - Load Hands from hands/ directory - Parse HAND.toml manifests - SQLite indexing for fast lookup - Approval queue management - Execution history logging - hands/scheduler.py: APScheduler-based scheduling - Cron and interval triggers - Job management (schedule, pause, resume, unschedule) - Hand execution wrapper - Manual trigger support	2026-02-26 12:41:52 -05:00
Alexander Whitestone	7de9db32ea	Merge pull request #54 from AlexanderWhitestone/feature/self-coding-rebased Feature/self coding rebased	2026-02-26 12:36:37 -05:00
Alexander Payne	4d3995012a	test: Self-Coding Dashboard Tests Add tests for dashboard routes: - Page routes (main page, journal partial, stats partial, execute form) - API routes (journal list/detail, stats, codebase summary/reindex) - Execute endpoints (API and HTMX) - Navigation integration (link in header) Tests verify endpoints return correct status codes and content types.	2026-02-26 12:28:30 -05:00
Alexander Payne	62365cc9b2	feat: Wire up Self-Coding Dashboard Integrate self-coding routes into dashboard: Changes: - Add import for self_coding_router in app.py - Include self_coding_router in FastAPI app - Add SELF-CODING link to desktop navigation - Add SELF-CODING link to mobile navigation The self-coding dashboard is now accessible at /self-coding	2026-02-26 12:28:30 -05:00
Alexander Payne	e81be8aed7	feat: Self-Coding Dashboard HTMX Templates Add complete UI for self-coding dashboard: Templates: - self_coding.html - Main dashboard page with layout - partials/self_coding_stats.html - Stats cards (total, success rate, etc) - partials/journal_entries.html - List of modification attempts - partials/journal_entry_detail.html - Expanded view of single attempt - partials/execute_form.html - Task execution form - partials/execute_result.html - Execution result display - partials/error.html - Error message display Features: - HTMX-powered dynamic updates - Real-time journal filtering (all/success/failure) - Modal dialog for task execution - Responsive Bootstrap 5 styling - Automatic refresh after successful execution	2026-02-26 12:28:05 -05:00
Alexander Payne	cb70cb392a	feat: Self-Coding Dashboard API Routes Add FastAPI routes for self-coding dashboard: API Endpoints: - GET /api/journal - List modification journal entries - GET /api/journal/{id} - Get detailed attempt info - GET /api/stats - Get success rate metrics - POST /api/execute - Execute self-edit task - GET /api/codebase/summary - Get codebase summary - POST /api/codebase/reindex - Trigger reindex HTMX Partials: - GET /self-coding/ - Main dashboard page - GET /self-coding/journal - Journal entries list - GET /self-coding/stats - Stats cards - GET /self-coding/execute-form - Task execution form - POST /self-coding/execute - Execute task endpoint - GET /journal/{id}/detail - Entry detail view	2026-02-26 12:28:05 -05:00
Alexander Payne	49ca4dad43	feat: Self-Edit MCP Tool (Phase 2.1) Implements the Self-Edit MCP Tool that orchestrates the self-coding foundation: ## Core Features 1. SelfEditTool (src/tools/self_edit.py) - Complete self-modification orchestrator - Pre-flight safety checks (clean repo, on main branch) - Context gathering (codebase indexer + modification journal) - Feature branch creation (timmy/self-edit/{timestamp}) - LLM-based edit planning with fallback - Safety constraint validation - Aider integration (preferred) with fallback to direct editing - Automatic test execution via pytest - Commit on success, rollback on failure - Modification journaling with reflections 2. Safety Constraints - Max 3 files per commit - Max 100 lines changed - Protected files list (self-edit tool, foundation services) - Only modify files with test coverage - Max 3 retries on failure - Requires user confirmation (MCP tool registration) 3. Execution Backends - Aider integration: --auto-test --test-cmd pytest --yes --no-git - Direct editing fallback: LLM-based file modification with AST validation - Automatic backend selection based on availability ## Test Coverage - 19 new tests covering: - Basic functionality (initialization, preflight checks) - Edit planning (with/without LLM) - Safety validation (file limits, protected files) - Execution flow (success and failure paths) - Error handling (exceptions, LLM failures) - MCP registration ## Usage from tools.self_edit import register_self_edit_tool from mcp.registry import tool_registry # Register with MCP register_self_edit_tool(tool_registry, llm_adapter) Phase 2.2 will add Dashboard API endpoints and UI.	2026-02-26 12:28:05 -05:00
Alexander Whitestone	bb13052da2	Merge pull request #53 from AlexanderWhitestone/claude/sovereign-biblical-ai-design-0nuHW Add scripture module: ESV text storage, parsing, and meditation	2026-02-26 12:16:56 -05:00
Claude	485b704145	chore: include pre-existing self-modify report artifacts https://claude.ai/code/session_015wv7FM6BFsgZ35Us6WeY7H	2026-02-26 17:07:01 +00:00
Claude	63bbe2a288	feat: add sovereign biblical text integration module (scripture) Implement the core scripture module for local-first ESV text storage, verse retrieval, reference parsing, original language support, cross-referencing, topical mapping, and automated meditation workflows. Architecture: - scripture/constants.py: 66-book Protestant canon with aliases and metadata - scripture/models.py: Pydantic models with integer-encoded verse IDs - scripture/parser.py: Regex-based reference extraction and formatting - scripture/store.py: SQLite-backed verse/xref/topic/Strong's storage - scripture/memory.py: Tripartite memory (working/long-term/associative) - scripture/meditation.py: Sequential/thematic/lectionary meditation scheduler - dashboard/routes/scripture.py: REST endpoints for all scripture operations - config.py: scripture_enabled, translation, meditation settings - 95 comprehensive tests covering all modules and routes https://claude.ai/code/session_015wv7FM6BFsgZ35Us6WeY7H	2026-02-26 17:06:00 +00:00
Alexander Whitestone	166e9f7544	Merge pull request #52 from AlexanderWhitestone/fix/chat-eval-bugs Fix chat evaluation bugs: task pipeline, prompt grounding, markdown rendering	2026-02-26 11:47:51 -05:00
Alexander Payne	431cf3e020	merge: resolve conflicts with main, keep comprehensive chat pipeline Resolved merge conflicts in agents.py and test_task_queue.py: - Keep full chat-to-task pipeline (agent/priority extraction, question filtering, context injection) over simpler main version - Incorporate test_briefing_task_queue_summary from main - All 64 task queue tests pass Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 11:47:34 -05:00
Alexander Payne	3ca8e9f2d6	fix: chat evaluation bugs — task pipeline, prompt grounding, markdown rendering Addresses 14 bugs from 3 rounds of deep chat evaluation: - Add chat-to-task pipeline in agents.py with regex-based intent detection, agent extraction, priority extraction, and title cleaning - Filter meta-questions ("how do I create a task?") from task creation - Inject real-time date/time context into every chat message - Inject live queue state when user asks about tasks - Ground system prompts with agent roster, honesty guardrails, self-knowledge, math delegation template, anti-filler rules, values-conflict guidance - Add CSS for markdown code blocks, inline code, lists, blockquotes in chat - Add highlight.js CDN for syntax highlighting in chat responses - Reduce small-model memory context budget (4000→2000) for expanded prompt - Add 27 comprehensive tests covering the full chat-to-task pipeline Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 11:42:42 -05:00
Alexander Whitestone	32ad43a61a	Merge pull request #51 from AlexanderWhitestone/feature/task-queue-and-ui-fixes feat: wire chat-to-task-queue and briefing integration	2026-02-26 11:31:25 -05:00
Alexander Whitestone	13018ea04c	Merge pull request #50 from AlexanderWhitestone/feature/self-coding-phase1-clean feat: Self-Coding Foundation (Phase 1)	2026-02-26 11:30:41 -05:00
Alexander Payne	18bc64b36d	feat: Self-Coding Foundation (Phase 1) Implements the foundational infrastructure for Timmy's self-modification capability: ## New Services 1. GitSafety (src/self_coding/git_safety.py) - Atomic git operations with rollback capability - Snapshot/restore for safe experimentation - Feature branch management (timmy/self-edit/{timestamp}) - Merge to main only after tests pass 2. CodebaseIndexer (src/self_coding/codebase_indexer.py) - AST-based parsing of Python source files - Extracts classes, functions, imports, docstrings - Builds dependency graph for blast radius analysis - SQLite storage with hash-based incremental indexing - get_summary() for LLM context (<4000 tokens) - get_relevant_files() for task-based file discovery 3. ModificationJournal (src/self_coding/modification_journal.py) - Persistent log of all self-modification attempts - Tracks outcomes: success, failure, rollback - find_similar() for learning from past attempts - Success rate metrics and recent failure tracking - Supports vector embeddings (Phase 2) 4. ReflectionService (src/self_coding/reflection.py) - LLM-powered analysis of modification attempts - Generates lessons learned from successes and failures - Fallback templates when LLM unavailable - Supports context from similar past attempts ## Test Coverage - 104 new tests across 7 test files - 95% code coverage on self_coding module - Green path tests: full workflow integration - Red path tests: errors, rollbacks, edge cases - Safety constraint tests: test coverage requirements, protected files ## Usage from self_coding import GitSafety, CodebaseIndexer, ModificationJournal git = GitSafety(repo_path=/path/to/repo) indexer = CodebaseIndexer(repo_path=/path/to/repo) journal = ModificationJournal() Phase 2 will build the Self-Edit MCP Tool that orchestrates these services.	2026-02-26 11:08:05 -05:00
Alexander Payne	bc9089ef96	feat: wire chat-to-task-queue and briefing integration - Chat messages like "add X to the queue" or "create a task" are intercepted and create a task_queue entry with pending_approval status instead of going through to the LLM - Briefing engine now gathers task queue stats (pending, running, completed, failed) and includes them in the morning briefing prompt - 7 new tests covering detection patterns, chat integration, and briefing summary Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 10:33:14 -05:00
Alexander Whitestone	6c6b6f8a54	Merge pull request #49 from AlexanderWhitestone/feature/task-queue-and-ui-fixes feat: task queue + work orders + UI bug fixes	2026-02-26 10:28:35 -05:00
Alexander Payne	5f9bbb8435	feat: add task queue with human-in-the-loop approval + work orders + UI bug fixes Task Queue system: - New /tasks page with three-column layout (Pending/Active/Completed) - Full CRUD API at /api/tasks with approve/veto/modify/pause/cancel/retry - SQLite persistence in task_queue table - WebSocket live updates via ws_manager - Create task modal with agent assignment and priority - Auto-approve rules for low-risk tasks - HTMX polling for real-time column updates - HOME TASK buttons now link to task queue with agent pre-selected - MARKET HIRE buttons link to task queue with agent pre-selected Work Order system: - External submission API for agents/users (POST /work-orders/submit) - Risk scoring and configurable auto-execution thresholds - Dashboard at /work-orders/queue with approve/reject/execute flow - Integration with swarm task system for execution UI & Dashboard bug fixes: - EVENTS: add startup event so page is never empty - LEDGER: fix empty filter params in URL - MISSION CONTROL: LLM backend and model now read from /health - MISSION CONTROL: agent count fallback to /swarm/agents - SWARM: HTMX fallback loads initial data if WebSocket is slow - MEMORY: add edit/delete buttons for personal facts - UPGRADES: add empty state guidance with links - BRIEFING: add regenerate button and POST /briefing/regenerate endpoint Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 10:27:08 -05:00
Alexander Whitestone	4e78f7102e	Merge pull request #48 from AlexanderWhitestone/fix/timmy-startup-and-stability fix: Timmy QA bugs — calculator, markdown, prompt guardrails, briefing	2026-02-26 09:44:33 -05:00
Alexander Payne	6e6b4355bb	fix: calculator tool, markdown rendering, prompt guardrails, briefing notification - Add sandboxed calculator tool to Timmy's toolkit so arithmetic questions get exact answers instead of LLM hallucinations - Update system prompts (lite + full) to instruct Timmy to always use the calculator and never attempt multi-digit math in his head - Add self-contradiction guard to both prompts ("commit to your facts") - Render Timmy's chat responses as markdown via marked.js + DOMPurify instead of raw escaped text - Suppress empty briefing notification on startup when there are 0 pending approval items - Add calculator to session response sanitizer regex - 18 new calculator tests, 2 updated briefing notification tests Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 09:35:59 -05:00
Alexander Whitestone	e6a7db7d80	Merge pull request #47 from AlexanderWhitestone/fix/timmy-startup-and-stability fix: Timmy startup crashes and clean initialization	2026-02-26 09:21:23 -05:00
Alexander Payne	05d4dc997c	fix: chat panel scroll — internal scroll on #chat-log, auto-scroll on new messages - Set overflow:hidden on mc-main to prevent page-level scrolling - Add max-height:100% to sidebar and chat panel to contain within viewport - Use flex-wrap:nowrap on layout row to prevent column stacking on desktop - Move scrollChat() to hx-on::after-settle for reliable post-swap scrolling - Use requestAnimationFrame for smooth scroll-to-bottom timing Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 09:15:40 -05:00
Alexander Payne	f95c9606f1	fix: Timmy startup crashes and clean initialization - Remove show_tool_calls kwarg (not in Agno 2.5.3), which crashed Agent.__init__ - Guard memory_search against top_k=None from model, return formatted string - Skip Telegram/Discord startup silently when no token configured - Replace placeholder MEMORY.md with proper structured hot memory document Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 09:11:48 -05:00
Alexander Whitestone	dccd13df8e	Merge pull request #46 from AlexanderWhitestone/feature/memory-layers-and-conversational-ai feat: Event Log, Ledger, Memory, Cascade Router, Upgrade Queue, Activity Feed	2026-02-26 08:33:32 -05:00
Alexander Payne	06a15bb3f2	test: add missing fixtures for functional tests Add fixtures required by functional test suite: - docker_stack: Docker container test URL (skips if FUNCTIONAL_DOCKER != 1) - serve_client: FastAPI TestClient for timmy-serve app - tdd_runner: Alias for self_tdd_runner Fixes CI errors in test_docker_swarm.py, test_l402_flow.py, test_cli.py	2026-02-26 08:30:04 -05:00
Alexander Payne	96ed82d81e	fix: memory route bug + fast E2E tests under 10 seconds - Fix recall_personal_facts() call - remove unsupported limit parameter - Replace 4 slow E2E test files with single fast test file - All 6 E2E tests complete in ~9 seconds (was 60+ seconds) - Reuse browser session across tests (module-scoped fixture) - Combine related checks into single tests - Add HTTP-only smoke test for speed	2026-02-26 08:08:32 -05:00
Alexander Payne	d8d976aa60	feat: complete Event Log, Ledger, Memory, Cascade Router, Upgrade Queue, Activity Feed This commit implements six major features: 1. Event Log System (src/swarm/event_log.py) - SQLite-based audit trail for all swarm events - Task lifecycle tracking (created, assigned, completed, failed) - Agent lifecycle tracking (joined, left, status changes) - Integrated with coordinator for automatic logging - Dashboard page at /swarm/events 2. Lightning Ledger (src/lightning/ledger.py) - Transaction tracking for Lightning Network payments - Balance calculations (incoming, outgoing, net, available) - Integrated with payment_handler for automatic logging - Dashboard page at /lightning/ledger 3. Semantic Memory / Vector Store (src/memory/vector_store.py) - Embedding-based similarity search for Echo agent - Fallback to keyword matching if sentence-transformers unavailable - Personal facts storage and retrieval - Dashboard page at /memory 4. Cascade Router Integration (src/timmy/cascade_adapter.py) - Automatic LLM failover between providers (Ollama → AirLLM → API) - Circuit breaker pattern for failing providers - Metrics tracking per provider (latency, error rates) - Dashboard status page at /router/status 5. Self-Upgrade Approval Queue (src/upgrades/) - State machine for self-modifications: proposed → approved/rejected → applied/failed - Human approval required before applying changes - Git integration for branch management - Dashboard queue at /self-modify/queue 6. Real-Time Activity Feed (src/events/broadcaster.py) - WebSocket-based live activity streaming - Bridges event_log to dashboard clients - Activity panel on /swarm/live Tests: - 101 unit tests passing - 4 new E2E test files for Selenium testing - Run with: SELENIUM_UI=1 pytest tests/functional/ -v --headed Documentation: - 6 ADRs (017-022) documenting architecture decisions - Implementation summary in docs/IMPLEMENTATION_SUMMARY.md - Architecture diagram in docs/architecture-v2.md	2026-02-26 08:01:01 -05:00
Alexander Whitestone	9d1f2f6b85	Merge pull request #45 from AlexanderWhitestone/fix/xss-vulnerabilities-in-templates	2026-02-26 05:58:12 -05:00
AlexanderWhitestone	930ec9eb80	Security: Fix XSS vulnerabilities in dashboard templates and improve mobile test UI safety	2026-02-26 02:07:54 -05:00
Alexander Payne	8d85f95ee5	Fix router disabled provider check + comprehensive functional tests Fixes: - Router now properly skips disabled providers in complete() method - Fixed avg_latency calculation comment in tests (now correctly documents behavior) New Test Suites: - tests/test_functional_router.py: 10 functional tests for router - tests/test_functional_mcp.py: 15 functional tests for MCP discovery/bootstrap - tests/test_integration_full.py: 14 end-to-end integration tests Total: 39 new functional/integration tests All 144 tests passing (105 router/mcp + 39 functional/integration)	2026-02-25 20:22:51 -05:00
Alexander Whitestone	3792bf16cf	Merge pull request #44 from AlexanderWhitestone/feature/memory-layers-and-conversational-ai Phase 3-4: Cascade LLM Router + Tool Registry Auto-Discovery	2026-02-25 20:04:30 -05:00
Alexander Payne	56437751d3	Phase 4: Tool Registry Auto-Discovery - @mcp_tool decorator for marking functions as tools - ToolDiscovery class for introspecting modules and packages - Automatic JSON schema generation from type hints - AST-based discovery for files (without importing) - Auto-bootstrap on startup (packages=['tools'] by default) - Support for tags, categories, and metadata - Updated registry with register_tool() convenience method - Environment variable MCP_AUTO_BOOTSTRAP to disable - 39 tests with proper isolation and cleanup Files Added: - src/mcp/discovery.py: Tool discovery and introspection - src/mcp/bootstrap.py: Auto-bootstrap functionality - tests/test_mcp_discovery.py: 26 tests - tests/test_mcp_bootstrap.py: 13 tests Files Modified: - src/mcp/registry.py: Added tags, source_module, auto_discovered fields - src/mcp/__init__.py: Export discovery and bootstrap modules - src/dashboard/app.py: Auto-bootstrap on startup	2026-02-25 19:59:42 -05:00

... 12 13 14 15 16 ...

822 Commits