Timmy-time-dashboard

Archived

forked from Rockachopa/Timmy-time-dashboard

Author	SHA1	Message	Date
Alexander Whitestone	bb13052da2	Merge pull request #53 from AlexanderWhitestone/claude/sovereign-biblical-ai-design-0nuHW Add scripture module: ESV text storage, parsing, and meditation	2026-02-26 12:16:56 -05:00
Claude	485b704145	chore: include pre-existing self-modify report artifacts https://claude.ai/code/session_015wv7FM6BFsgZ35Us6WeY7H	2026-02-26 17:07:01 +00:00
Claude	63bbe2a288	feat: add sovereign biblical text integration module (scripture) Implement the core scripture module for local-first ESV text storage, verse retrieval, reference parsing, original language support, cross-referencing, topical mapping, and automated meditation workflows. Architecture: - scripture/constants.py: 66-book Protestant canon with aliases and metadata - scripture/models.py: Pydantic models with integer-encoded verse IDs - scripture/parser.py: Regex-based reference extraction and formatting - scripture/store.py: SQLite-backed verse/xref/topic/Strong's storage - scripture/memory.py: Tripartite memory (working/long-term/associative) - scripture/meditation.py: Sequential/thematic/lectionary meditation scheduler - dashboard/routes/scripture.py: REST endpoints for all scripture operations - config.py: scripture_enabled, translation, meditation settings - 95 comprehensive tests covering all modules and routes https://claude.ai/code/session_015wv7FM6BFsgZ35Us6WeY7H	2026-02-26 17:06:00 +00:00
Alexander Whitestone	166e9f7544	Merge pull request #52 from AlexanderWhitestone/fix/chat-eval-bugs Fix chat evaluation bugs: task pipeline, prompt grounding, markdown rendering	2026-02-26 11:47:51 -05:00
Alexander Payne	431cf3e020	merge: resolve conflicts with main, keep comprehensive chat pipeline Resolved merge conflicts in agents.py and test_task_queue.py: - Keep full chat-to-task pipeline (agent/priority extraction, question filtering, context injection) over simpler main version - Incorporate test_briefing_task_queue_summary from main - All 64 task queue tests pass Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 11:47:34 -05:00
Alexander Payne	3ca8e9f2d6	fix: chat evaluation bugs — task pipeline, prompt grounding, markdown rendering Addresses 14 bugs from 3 rounds of deep chat evaluation: - Add chat-to-task pipeline in agents.py with regex-based intent detection, agent extraction, priority extraction, and title cleaning - Filter meta-questions ("how do I create a task?") from task creation - Inject real-time date/time context into every chat message - Inject live queue state when user asks about tasks - Ground system prompts with agent roster, honesty guardrails, self-knowledge, math delegation template, anti-filler rules, values-conflict guidance - Add CSS for markdown code blocks, inline code, lists, blockquotes in chat - Add highlight.js CDN for syntax highlighting in chat responses - Reduce small-model memory context budget (4000→2000) for expanded prompt - Add 27 comprehensive tests covering the full chat-to-task pipeline Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 11:42:42 -05:00
Alexander Whitestone	32ad43a61a	Merge pull request #51 from AlexanderWhitestone/feature/task-queue-and-ui-fixes feat: wire chat-to-task-queue and briefing integration	2026-02-26 11:31:25 -05:00
Alexander Whitestone	13018ea04c	Merge pull request #50 from AlexanderWhitestone/feature/self-coding-phase1-clean feat: Self-Coding Foundation (Phase 1)	2026-02-26 11:30:41 -05:00
Alexander Payne	18bc64b36d	feat: Self-Coding Foundation (Phase 1) Implements the foundational infrastructure for Timmy's self-modification capability: ## New Services 1. GitSafety (src/self_coding/git_safety.py) - Atomic git operations with rollback capability - Snapshot/restore for safe experimentation - Feature branch management (timmy/self-edit/{timestamp}) - Merge to main only after tests pass 2. CodebaseIndexer (src/self_coding/codebase_indexer.py) - AST-based parsing of Python source files - Extracts classes, functions, imports, docstrings - Builds dependency graph for blast radius analysis - SQLite storage with hash-based incremental indexing - get_summary() for LLM context (<4000 tokens) - get_relevant_files() for task-based file discovery 3. ModificationJournal (src/self_coding/modification_journal.py) - Persistent log of all self-modification attempts - Tracks outcomes: success, failure, rollback - find_similar() for learning from past attempts - Success rate metrics and recent failure tracking - Supports vector embeddings (Phase 2) 4. ReflectionService (src/self_coding/reflection.py) - LLM-powered analysis of modification attempts - Generates lessons learned from successes and failures - Fallback templates when LLM unavailable - Supports context from similar past attempts ## Test Coverage - 104 new tests across 7 test files - 95% code coverage on self_coding module - Green path tests: full workflow integration - Red path tests: errors, rollbacks, edge cases - Safety constraint tests: test coverage requirements, protected files ## Usage from self_coding import GitSafety, CodebaseIndexer, ModificationJournal git = GitSafety(repo_path=/path/to/repo) indexer = CodebaseIndexer(repo_path=/path/to/repo) journal = ModificationJournal() Phase 2 will build the Self-Edit MCP Tool that orchestrates these services.	2026-02-26 11:08:05 -05:00
Alexander Payne	bc9089ef96	feat: wire chat-to-task-queue and briefing integration - Chat messages like "add X to the queue" or "create a task" are intercepted and create a task_queue entry with pending_approval status instead of going through to the LLM - Briefing engine now gathers task queue stats (pending, running, completed, failed) and includes them in the morning briefing prompt - 7 new tests covering detection patterns, chat integration, and briefing summary Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 10:33:14 -05:00
Alexander Whitestone	6c6b6f8a54	Merge pull request #49 from AlexanderWhitestone/feature/task-queue-and-ui-fixes feat: task queue + work orders + UI bug fixes	2026-02-26 10:28:35 -05:00
Alexander Payne	5f9bbb8435	feat: add task queue with human-in-the-loop approval + work orders + UI bug fixes Task Queue system: - New /tasks page with three-column layout (Pending/Active/Completed) - Full CRUD API at /api/tasks with approve/veto/modify/pause/cancel/retry - SQLite persistence in task_queue table - WebSocket live updates via ws_manager - Create task modal with agent assignment and priority - Auto-approve rules for low-risk tasks - HTMX polling for real-time column updates - HOME TASK buttons now link to task queue with agent pre-selected - MARKET HIRE buttons link to task queue with agent pre-selected Work Order system: - External submission API for agents/users (POST /work-orders/submit) - Risk scoring and configurable auto-execution thresholds - Dashboard at /work-orders/queue with approve/reject/execute flow - Integration with swarm task system for execution UI & Dashboard bug fixes: - EVENTS: add startup event so page is never empty - LEDGER: fix empty filter params in URL - MISSION CONTROL: LLM backend and model now read from /health - MISSION CONTROL: agent count fallback to /swarm/agents - SWARM: HTMX fallback loads initial data if WebSocket is slow - MEMORY: add edit/delete buttons for personal facts - UPGRADES: add empty state guidance with links - BRIEFING: add regenerate button and POST /briefing/regenerate endpoint Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 10:27:08 -05:00
Alexander Whitestone	4e78f7102e	Merge pull request #48 from AlexanderWhitestone/fix/timmy-startup-and-stability fix: Timmy QA bugs — calculator, markdown, prompt guardrails, briefing	2026-02-26 09:44:33 -05:00
Alexander Payne	6e6b4355bb	fix: calculator tool, markdown rendering, prompt guardrails, briefing notification - Add sandboxed calculator tool to Timmy's toolkit so arithmetic questions get exact answers instead of LLM hallucinations - Update system prompts (lite + full) to instruct Timmy to always use the calculator and never attempt multi-digit math in his head - Add self-contradiction guard to both prompts ("commit to your facts") - Render Timmy's chat responses as markdown via marked.js + DOMPurify instead of raw escaped text - Suppress empty briefing notification on startup when there are 0 pending approval items - Add calculator to session response sanitizer regex - 18 new calculator tests, 2 updated briefing notification tests Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 09:35:59 -05:00
Alexander Whitestone	e6a7db7d80	Merge pull request #47 from AlexanderWhitestone/fix/timmy-startup-and-stability fix: Timmy startup crashes and clean initialization	2026-02-26 09:21:23 -05:00
Alexander Payne	05d4dc997c	fix: chat panel scroll — internal scroll on #chat-log, auto-scroll on new messages - Set overflow:hidden on mc-main to prevent page-level scrolling - Add max-height:100% to sidebar and chat panel to contain within viewport - Use flex-wrap:nowrap on layout row to prevent column stacking on desktop - Move scrollChat() to hx-on::after-settle for reliable post-swap scrolling - Use requestAnimationFrame for smooth scroll-to-bottom timing Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 09:15:40 -05:00
Alexander Payne	f95c9606f1	fix: Timmy startup crashes and clean initialization - Remove show_tool_calls kwarg (not in Agno 2.5.3), which crashed Agent.__init__ - Guard memory_search against top_k=None from model, return formatted string - Skip Telegram/Discord startup silently when no token configured - Replace placeholder MEMORY.md with proper structured hot memory document Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 09:11:48 -05:00
Alexander Whitestone	dccd13df8e	Merge pull request #46 from AlexanderWhitestone/feature/memory-layers-and-conversational-ai feat: Event Log, Ledger, Memory, Cascade Router, Upgrade Queue, Activity Feed	2026-02-26 08:33:32 -05:00
Alexander Payne	06a15bb3f2	test: add missing fixtures for functional tests Add fixtures required by functional test suite: - docker_stack: Docker container test URL (skips if FUNCTIONAL_DOCKER != 1) - serve_client: FastAPI TestClient for timmy-serve app - tdd_runner: Alias for self_tdd_runner Fixes CI errors in test_docker_swarm.py, test_l402_flow.py, test_cli.py	2026-02-26 08:30:04 -05:00
Alexander Payne	96ed82d81e	fix: memory route bug + fast E2E tests under 10 seconds - Fix recall_personal_facts() call - remove unsupported limit parameter - Replace 4 slow E2E test files with single fast test file - All 6 E2E tests complete in ~9 seconds (was 60+ seconds) - Reuse browser session across tests (module-scoped fixture) - Combine related checks into single tests - Add HTTP-only smoke test for speed	2026-02-26 08:08:32 -05:00
Alexander Payne	d8d976aa60	feat: complete Event Log, Ledger, Memory, Cascade Router, Upgrade Queue, Activity Feed This commit implements six major features: 1. Event Log System (src/swarm/event_log.py) - SQLite-based audit trail for all swarm events - Task lifecycle tracking (created, assigned, completed, failed) - Agent lifecycle tracking (joined, left, status changes) - Integrated with coordinator for automatic logging - Dashboard page at /swarm/events 2. Lightning Ledger (src/lightning/ledger.py) - Transaction tracking for Lightning Network payments - Balance calculations (incoming, outgoing, net, available) - Integrated with payment_handler for automatic logging - Dashboard page at /lightning/ledger 3. Semantic Memory / Vector Store (src/memory/vector_store.py) - Embedding-based similarity search for Echo agent - Fallback to keyword matching if sentence-transformers unavailable - Personal facts storage and retrieval - Dashboard page at /memory 4. Cascade Router Integration (src/timmy/cascade_adapter.py) - Automatic LLM failover between providers (Ollama → AirLLM → API) - Circuit breaker pattern for failing providers - Metrics tracking per provider (latency, error rates) - Dashboard status page at /router/status 5. Self-Upgrade Approval Queue (src/upgrades/) - State machine for self-modifications: proposed → approved/rejected → applied/failed - Human approval required before applying changes - Git integration for branch management - Dashboard queue at /self-modify/queue 6. Real-Time Activity Feed (src/events/broadcaster.py) - WebSocket-based live activity streaming - Bridges event_log to dashboard clients - Activity panel on /swarm/live Tests: - 101 unit tests passing - 4 new E2E test files for Selenium testing - Run with: SELENIUM_UI=1 pytest tests/functional/ -v --headed Documentation: - 6 ADRs (017-022) documenting architecture decisions - Implementation summary in docs/IMPLEMENTATION_SUMMARY.md - Architecture diagram in docs/architecture-v2.md	2026-02-26 08:01:01 -05:00
Alexander Whitestone	9d1f2f6b85	Merge pull request #45 from AlexanderWhitestone/fix/xss-vulnerabilities-in-templates	2026-02-26 05:58:12 -05:00
AlexanderWhitestone	930ec9eb80	Security: Fix XSS vulnerabilities in dashboard templates and improve mobile test UI safety	2026-02-26 02:07:54 -05:00
Alexander Payne	8d85f95ee5	Fix router disabled provider check + comprehensive functional tests Fixes: - Router now properly skips disabled providers in complete() method - Fixed avg_latency calculation comment in tests (now correctly documents behavior) New Test Suites: - tests/test_functional_router.py: 10 functional tests for router - tests/test_functional_mcp.py: 15 functional tests for MCP discovery/bootstrap - tests/test_integration_full.py: 14 end-to-end integration tests Total: 39 new functional/integration tests All 144 tests passing (105 router/mcp + 39 functional/integration)	2026-02-25 20:22:51 -05:00
Alexander Whitestone	3792bf16cf	Merge pull request #44 from AlexanderWhitestone/feature/memory-layers-and-conversational-ai Phase 3-4: Cascade LLM Router + Tool Registry Auto-Discovery	2026-02-25 20:04:30 -05:00
Alexander Payne	56437751d3	Phase 4: Tool Registry Auto-Discovery - @mcp_tool decorator for marking functions as tools - ToolDiscovery class for introspecting modules and packages - Automatic JSON schema generation from type hints - AST-based discovery for files (without importing) - Auto-bootstrap on startup (packages=['tools'] by default) - Support for tags, categories, and metadata - Updated registry with register_tool() convenience method - Environment variable MCP_AUTO_BOOTSTRAP to disable - 39 tests with proper isolation and cleanup Files Added: - src/mcp/discovery.py: Tool discovery and introspection - src/mcp/bootstrap.py: Auto-bootstrap functionality - tests/test_mcp_discovery.py: 26 tests - tests/test_mcp_bootstrap.py: 13 tests Files Modified: - src/mcp/registry.py: Added tags, source_module, auto_discovered fields - src/mcp/__init__.py: Export discovery and bootstrap modules - src/dashboard/app.py: Auto-bootstrap on startup	2026-02-25 19:59:42 -05:00
Alexander Payne	c658ca829c	Phase 3: Cascade LLM Router with automatic failover - YAML-based provider configuration (config/providers.yaml) - Priority-ordered provider routing - Circuit breaker pattern for failing providers - Health check and availability monitoring - Metrics tracking (latency, errors, success rates) - Support for Ollama, OpenAI, Anthropic, AirLLM providers - Automatic failover on rate limits or errors - REST API endpoints for monitoring and control - 41 comprehensive tests API Endpoints: - POST /api/v1/router/complete - Chat completion with failover - GET /api/v1/router/status - Provider health status - GET /api/v1/router/metrics - Detailed metrics - GET /api/v1/router/providers - List all providers - POST /api/v1/router/providers/{name}/control - Enable/disable/reset - POST /api/v1/router/health-check - Run health checks - GET /api/v1/router/config - View configuration	2026-02-25 19:43:43 -05:00
Alexander Payne	a719c7538d	Implement MCP system, Event Bus, and Sub-Agents ## 1. MCP (Model Context Protocol) Implementation ### Registry (src/mcp/registry.py) - Tool registration with JSON schemas - Dynamic tool discovery - Health tracking per tool - Metrics collection (latency, error rates) - @register_tool decorator for easy registration ### Server (src/mcp/server.py) - MCPServer class implementing MCP protocol - MCPHTTPServer for FastAPI integration - Standard endpoints: list_tools, call_tool, get_schema ### Schemas (src/mcp/schemas/base.py) - create_tool_schema() helper - Common parameter types - Standard return types ### Bootstrap (src/mcp/bootstrap.py) - Automatic tool module loading - Status reporting ## 2. MCP-Compliant Tools (src/tools/) \| Tool \| Purpose \| Category \| \|------\|---------\|----------\| \| web_search \| DuckDuckGo search \| research \| \| read_file \| File reading \| files \| \| write_file \| File writing (confirmation) \| files \| \| list_directory \| Directory listing \| files \| \| python \| Python code execution \| code \| \| memory_search \| Vector memory search \| memory \| All tools have proper schemas, error handling, and MCP registration. ## 3. Event Bus (src/events/bus.py) - Async publish/subscribe pattern - Pattern matching with wildcards (agent.task.*) - Event history tracking - Concurrent handler execution - Module-level singleton for system-wide use ## 4. Sub-Agents (src/agents/) All agents inherit from BaseAgent with: - Agno Agent integration - MCP tool registry access - Event bus connectivity - Structured logging ### Agent Roster \| Agent \| Role \| Tools \| Purpose \| \|-------\|------\|-------\|---------\| \| Seer \| Research \| web_search, read_file, memory_search \| Information gathering \| \| Forge \| Code \| python, write_file, read_file \| Code generation \| \| Quill \| Writing \| write_file, read_file, memory_search \| Content creation \| \| Echo \| Memory \| memory_search, read_file, write_file \| Context retrieval \| \| Helm \| Routing \| memory_search \| Task routing decisions \| \| Timmy \| Orchestrator \| All tools \| Coordination & user interface \| ### Timmy Orchestrator - Analyzes user requests - Routes to appropriate sub-agent - Handles direct queries - Manages swarm coordination - create_timmy_swarm() factory function ## 5. Integration All components wired together: - Tools auto-register on import - Agents connect to event bus - MCP server provides HTTP API - Ready for dashboard integration ## Tests - All 973 existing tests pass - New components tested manually - Import verification successful Next steps: Cascade Router, Self-Upgrade Loop, Dashboard integration	2026-02-25 19:26:24 -05:00
Alexander Whitestone	fcaf9260ca	Merge pull request #43 from AlexanderWhitestone/claude/condescending-vaughan Fix Timmy coherence: persistent sessions, model-aware tools, response sanitization	2026-02-25 19:19:43 -05:00
Alexander Payne	26e1691099	Fix Timmy coherence: persistent session, model-aware tools, response sanitization Timmy was exhibiting severe incoherence (no memory between messages, tool call leakage, chain-of-thought narration, random tool invocations) due to creating a brand new agent per HTTP request and giving a 3B model (llama3.2) a 73-line system prompt with complex tool-calling instructions it couldn't follow. Key changes: - Add session.py singleton with stable session_id for conversation continuity - Add _model_supports_tools() to strip tools from small models (< 7B) - Add two-tier prompts: lite (12 lines) for small models, full for capable ones - Add response sanitizer to strip leaked JSON tool calls and CoT narration - Set show_tool_calls=False to prevent raw tool JSON in output - Wire ConversationManager for user name extraction - Deprecate orphaned memory_layers.py (unused 4-layer system) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 19:18:08 -05:00
Alexander Payne	16b65b28e8	Add Tier 3: Semantic Memory (vector search) Completes the three-tier memory architecture: ## Tier 3 — Semantic Search - Vector embeddings over all vault files - Similarity-based retrieval - memory_search tool for agents - Fallback to hash-based embeddings if transformers unavailable ## Implementation - src/timmy/semantic_memory.py — Core semantic memory - Chunking strategy: paragraphs → sentences - SQLite storage for vectors - cosine_similarity for ranking ## Integration - Added memory_search to create_full_toolkit() - Updated prompts with memory_search examples - Tool triggers: past conversations, reminders ## Features - Automatic vault indexing - Source file tracking (re-indexes on change) - Similarity scoring - Context retrieval for queries ## Usage All 973 tests pass.	2026-02-25 18:25:20 -05:00
Alexander Payne	7838df19b0	Implement three-tier memory architecture (Hot/Vault/Handoff) This commit replaces the previous memory_layers.py with a proper three-tier memory system as specified by the user: ## Tier 1 — Hot Memory (MEMORY.md) - Single flat file always loaded into system context - Contains: current status, standing rules, agent roster, key decisions - ~300 lines max, pruned monthly - Managed by HotMemory class ## Tier 2 — Structured Vault (memory/) - Directory with three namespaces: • self/ — identity.md, user_profile.md, methodology.md • notes/ — session logs, AARs, research • aar/ — post-task retrospectives - Markdown format, Obsidian-compatible - Append-only, date-stamped - Managed by VaultMemory class ## Handoff Protocol - last-session-handoff.md written at session end - Contains: summary, key decisions, open items, next steps - Auto-loaded at next session start - Maintains continuity across resets ## Implementation ### New Files: - src/timmy/memory_system.py — Core memory system - MEMORY.md — Hot memory template - memory/self/*.md — Identity, user profile, methodology ### Modified: - src/timmy/agent.py — Integrated with memory system - create_timmy() injects memory context - TimmyWithMemory class with automatic fact extraction - tests/test_agent.py — Updated for memory context ## Key Principles - Hot memory = small and curated - Vault = append-only, never delete - Handoffs = continuity mechanism - Flat files = human-readable, portable ## Usage All 973 tests pass.	2026-02-25 18:17:43 -05:00
Alexander Payne	625806daf5	Fine-tune Timmy's conversational AI with memory layers ## Enhanced System Prompt - Detailed tool usage guidelines with explicit examples - Clear DO and DON'T examples for tool selection - Memory system documentation - Conversation flow guidelines - Context awareness instructions ## Memory Layer System (NEW) Implemented 3-layer memory architecture: 1. WORKING MEMORY (src/timmy/memory_layers.py) - Immediate context (last 20 messages) - Topic tracking - Tool call tracking - Fast, ephemeral 2. SHORT-TERM MEMORY (Agno SQLite) - Recent conversations (100) - Persists across restarts - Managed by Agno Agent 3. LONG-TERM MEMORY (src/timmy/memory_layers.py) - Facts about user (name, preferences) - SQLite storage in data/memory/ - Auto-extraction from conversations - User profile generation ## Memory Manager (NEW) - Central coordinator for all memory layers - Context injection into prompts - Fact extraction and storage - Session management ## TimmyWithMemory Class (NEW) - Wrapper around Agno Agent with explicit memory - Auto-injects user context from LTM - Tracks exchanges across all layers - Simple chat() interface ## Agent Configuration - Increased num_history_runs: 10 -> 20 - Better conversational context retention ## Tests - All 973 tests pass - Fixed test expectations for new config - Fixed module path in test_scary_paths.py ## Files Added/Modified - src/timmy/prompts.py - Enhanced with memory and tool guidance - src/timmy/agent.py - Added TimmyWithMemory class - src/timmy/memory_layers.py - NEW memory system - src/timmy/conversation.py - NEW conversation manager - tests/ - Updated for new config	2026-02-25 18:07:44 -05:00
Alexander Whitestone	c18da7bce8	Merge pull request #37 from AlexanderWhitestone/timmy/self-modify-1772038737 feat: autonomous self-modifying agent	2026-02-25 17:52:13 -05:00
Alexander Whitestone	5e523d4654	Merge pull request #41 from AlexanderWhitestone/claude/peaceful-benz feat: Mission Control dashboard + scary path tests	2026-02-25 17:51:53 -05:00
Alexander Payne	90a93aa070	fix: resolve merge conflict in base.html nav with main Keep Mission Control link from this branch alongside SWARM and SPARK links from main. All 939 tests pass. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 17:51:15 -05:00
Alexander Payne	b282ac19be	fix: resolve merge conflict in config.py with main Keep both L402/privacy settings from security hardening PR and self-modification settings. All 939 tests pass. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 17:48:28 -05:00
Alexander Whitestone	d853e931ec	Merge pull request #40 from AlexanderWhitestone/kimi/phase2-swarm-hardening-v2 Phase 2: Swarm hardening, auto-auction, WebSocket fix	2026-02-25 17:34:13 -05:00
Alexander Payne	fc326421b1	fix: update integration tests for auto-auction behavior The POST /swarm/tasks endpoint now triggers an automatic auction via asyncio.create_task. Tests must allow tasks to be in bidding, assigned, or failed status since the background auction may resolve before the follow-up GET query. All 895 tests pass. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 17:28:41 -05:00
Alexander Whitestone	f161ed3f5a	Merge pull request #39 from AlexanderWhitestone/timmy/self-modify-v2 feat: autonomous self-modifying agent	2026-02-25 17:28:20 -05:00
Alexander Payne	4b12aca090	Swarm hardening: mobile nav, registry cleanup, module path fix ## Workset E: Swarm System Realization - Verified PersonaNode bidding system is properly connected - Coordinator already subscribes personas to task announcements - Auction system works when /tasks/auction endpoint is used ## Workset F: Testing & Reliability - Mobile nav: Add MOBILE link to desktop header (UX-01) - Voice TTS: Verified graceful degradation already implemented - Registry: Add proper connection cleanup with try/finally ## Workset G: Performance & Architecture - Fix module path: websocket.handler -> ws_manager.handler - Registry connections now properly closed after operations All 895 tests pass. Addresses QUALITY_ANALYSIS.md: - UX-01: /mobile route now in desktop nav - PERF-01: Connection cleanup improved (P3) - FUNC-01/02: Verified bidding system operational	2026-02-25 17:26:42 -05:00
Alexander Payne	8fec9c41a5	feat: autonomous self-modifying agent with multi-backend LLM support Adds SelfModifyLoop — an edit→validate→test→commit cycle that can read its own failure reports, diagnose root causes, and restart autonomously. Key capabilities: - Multi-backend LLM: Anthropic Claude API, Ollama, or auto-detect - Syntax validation via compile() before writing to disk - Autonomous self-correction loop with configurable max cycles - XML-based output format to avoid triple-quote delimiter conflicts - Branch creation skipped by default to prevent container restarts - CLI: self-modify run "instruction" --backend auto --autonomous - 939 tests passing, 30 skipped Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 17:23:47 -05:00
Alexander Whitestone	8958cf830a	Merge pull request #36 from AlexanderWhitestone/claude/keen-kalam fix: purge stale bytecache on make dev	2026-02-25 17:21:05 -05:00
Alexander Payne	27fabb765c	feat: autonomous self-modifying agent with multi-backend LLM support Adds SelfModifyLoop — an edit→validate→test→commit cycle that can read its own failure reports, diagnose root causes, and restart autonomously. Key capabilities: - Multi-backend LLM: Anthropic Claude API, Ollama, or auto-detect - Syntax validation via compile() before writing to disk - Autonomous self-correction loop with configurable max cycles - XML-based output format to avoid triple-quote delimiter conflicts - Branch creation skipped by default to prevent container restarts - CLI: self-modify run "instruction" --backend auto --autonomous - 939 tests passing, 30 skipped Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 17:18:58 -05:00
Alexander Payne	53f8d0912e	fix: purge stale bytecache on make dev to prevent old .pyc errors The Agno Toolkit API fix (`1bc2cdc`) wasn't taking effect because Python was loading stale __pycache__/*.pyc files with the old add_tool() calls. Now `make nuke` clears all bytecache, and `make dev` sets PYTHONDONTWRITEBYTECODE=1 to prevent .pyc creation during development. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 17:18:44 -05:00
Alexander Whitestone	273c666e37	Merge pull request #35 from AlexanderWhitestone/fix/security-privacy-hardening Security, Privacy & Agent Intelligence Hardening	2026-02-25 15:39:43 -05:00
Alexander Payne	4961c610f2	Security, privacy, and agent intelligence hardening ## Security (Workset A) - XSS: Verified templates use safe DOM methods (textContent, createElement) - Secrets: Fail-fast in production mode when L402 secrets not set - Environment mode: Add TIMMY_ENV (development\|production) validation ## Privacy (Workset C) - Add telemetry_enabled config (default: False for sovereign AI) - Pass telemetry setting to Agno Agent - Update .env.example with TELEMETRY_ENABLED and TIMMY_ENV docs ## Agent Intelligence (Workset D) - Enhanced TIMMY_SYSTEM_PROMPT with: - Tool usage guidelines (when to use, when not to) - Memory awareness documentation - Operating mode documentation - Help reduce unnecessary tool calls for simple queries All 895 tests pass. Telemetry disabled by default aligns with sovereign AI vision.	2026-02-25 15:32:19 -05:00
Alexander Whitestone	1df5145895	Merge pull request #34 from AlexanderWhitestone/fix/toolkit-api-compatibility fix: Agno Toolkit API compatibility + quality review	2026-02-25 15:24:28 -05:00
Alexander Payne	5571a4d8a0	docs: add quality review report and updated coverage (84.15%)	2026-02-25 15:15:30 -05:00
Alexander Whitestone	e1d88a0f47	Merge pull request #33 from AlexanderWhitestone/fix/toolkit-api-compatibility Fix Agno Toolkit API compatibility issues	2026-02-25 14:13:19 -05:00

1 2 3

148 Commits