Go to file

Alexander Whitestone 79e8a6894a Microservices Refactoring & CI/CD Optimization (#89 )

* feat: microservices refactoring with TDD and Docker optimization

## Summary
Complete refactoring of Timmy Time from monolithic architecture to microservices
using Test-Driven Development (TDD) and optimized Docker builds.

## Changes

### Core Improvements
- Optimized dashboard startup: moved blocking tasks to async background processes
- Fixed model fallback logic in agent configuration
- Enhanced test fixtures with comprehensive conftest.py

### Microservices Architecture
- Created separate Dockerfiles for dashboard, Ollama, and agent services
- Implemented docker-compose.microservices.yml for service orchestration
- Added health checks and non-root user execution for security
- Multi-stage Docker builds for lean, fast images

### Testing
- Added E2E tests for dashboard responsiveness
- Added E2E tests for Ollama integration
- Added E2E tests for microservices architecture validation
- All 36 tests passing, 8 skipped (environment-specific)

### Documentation
- Created comprehensive final report
- Generated issue resolution plan
- Added interview transcript demonstrating core agent functionality

### New Modules
- skill_absorption.py: Dynamic skill loading and integration system for Timmy

## Test Results
✅ 36 passed, 8 skipped, 6 warnings
✅ All microservices tests passing
✅ Dashboard responsiveness verified
✅ Ollama integration validated

## Files Added/Modified
- docker/: Multi-stage Dockerfiles for all services
- tests/e2e/: Comprehensive E2E test suite
- src/timmy/skill_absorption.py: Skill absorption system
- src/dashboard/app.py: Optimized startup logic
- tests/conftest.py: Enhanced test fixtures
- docker-compose.microservices.yml: Service orchestration

## Breaking Changes
None - all changes are backward compatible

## Next Steps
- Integrate skill absorption system into agent workflow
- Test with microservices-tdd-refactor skill
- Deploy to production with docker-compose orchestration

* CI/CD Optimization: Guard Rails, Black Linting, and Pre-commit Hooks

- Fixed all test collection errors (Selenium imports, fixture paths, syntax)
- Implemented pre-commit hooks with Black formatting and isort
- Created comprehensive Makefile with test targets (unit, integration, functional, e2e)
- Added pytest.ini with marker definitions for test categorization
- Established guard rails to prevent future collection errors
- Wrapped optional dependencies (Selenium, MoviePy) in try-except blocks
- Added conftest_markers for automatic test categorization

This ensures a smooth development stream with:
- Fast feedback loops (pre-commit checks before push)
- Consistent code formatting (Black)
- Reliable CI/CD (no collection errors, proper test isolation)
- Clear test organization (unit, integration, functional, E2E)

* Fix CI/CD test failures:
- Export templates from dashboard.app
- Fix model name assertion in test_agent.py
- Fix platform-agnostic path resolution in test_path_resolution.py
- Skip Docker tests in test_docker_deployment.py if docker not available
- Fix test_model_fallback_chain logic in test_ollama_integration.py

* Add preventative pre-commit checks and Docker test skipif decorators:
- Create pre_commit_checks.py script for common CI failures
- Add skipif decorators to Docker tests
- Improve test robustness for CI environments

2026-02-28 12:18:01 -05:00

.github/workflows

ci: mobile-friendly test results via GitHub Actions

2026-02-19 19:41:01 +00:00

.handoff

fix: resolve merge conflict in base.html nav with main

2026-02-25 17:51:15 -05:00

Fix router disabled provider check + comprehensive functional tests

2026-02-25 20:22:51 -05:00

config

feat: Multi-modal support with automatic model fallback

2026-02-26 22:29:44 -05:00

data/self_modify_reports

feat: add default thinking thread — Timmy always ponders (#75 )

2026-02-27 01:00:11 -05:00

deploy

feat: one-click cloud deployment — Caddy HTTPS, Ollama, systemd, cloud-init

2026-02-24 21:22:56 +00:00

docker

feat: microservices refactoring with TDD and Docker optimization (#88 )

2026-02-28 11:07:19 -05:00

docs

docs: add integration ROADMAP and Ascension manifesto (#83 )

2026-02-27 21:09:49 -05:00

hands

feat: Phase 5 Additional Hands (Scout, Scribe, Ledger, Weaver)

2026-02-26 13:07:43 -05:00

memory/self

feat: task queue system with startup drain and backlogging (#76 )

2026-02-27 01:52:42 -05:00

mobile-app

feat: wire mobile app to real Timmy backend via JSON REST API (#73 )

2026-02-26 23:58:53 -05:00

scripts

CI/CD Optimization: Guard Rails, Pre-commit Checks, and Test Fixes (#90 )

2026-02-28 11:36:50 -05:00

src

CI/CD Optimization: Guard Rails, Pre-commit Checks, and Test Fixes (#90 )

2026-02-28 11:36:50 -05:00

static

feat: add in-browser local model support for iPhone via WebLLM

2026-02-27 00:03:05 +00:00

tests

CI/CD Optimization: Guard Rails, Pre-commit Checks, and Test Fixes (#90 )

2026-02-28 11:36:50 -05:00

.dockerignore

feat: one-click cloud deployment — Caddy HTTPS, Ollama, systemd, cloud-init

2026-02-24 21:22:56 +00:00

.env.example

feat: add Grok (xAI) as opt-in premium backend with monetization

2026-02-27 01:12:51 +00:00

.gitignore

feat: automatic error feedback loop with bug report tracker (#80 )

2026-02-27 19:51:37 -05:00

.pre-commit-config.yaml

CI/CD Optimization: Guard Rails, Pre-commit Checks, and Test Fixes (#90 )

2026-02-28 11:36:50 -05:00

AGENTS.md

feat: Timmy fixes and improvements (#72 )

2026-02-26 23:39:13 -05:00

CLAUDE.md

refactor: Phase 2b — consolidate 28 modules into 14 packages

2026-02-26 22:07:41 +00:00

coverage.xml

docs: add quality review report and updated coverage (84.15%)

2026-02-25 15:15:30 -05:00

DECISIONS.md

feat: Timmy fixes and improvements (#72 )

2026-02-26 23:39:13 -05:00

docker-compose.dev.yml

feat: single-command Docker startup, fix UI bugs, add Selenium tests

2026-02-25 07:20:56 -05:00

docker-compose.enhanced.yml

feat: microservices refactoring with TDD and Docker optimization (#88 )

2026-02-28 11:07:19 -05:00

docker-compose.microservices.yml

feat: microservices refactoring with TDD and Docker optimization (#88 )

2026-02-28 11:07:19 -05:00

docker-compose.prod.yml

feat: one-click cloud deployment — Caddy HTTPS, Ollama, systemd, cloud-init

2026-02-24 21:22:56 +00:00

docker-compose.test.yml

feat: add functional Ollama chat tests with containerised LLM

2026-02-25 02:44:36 +00:00

docker-compose.yml

fix: register task_request handler and fix Docker 403 errors on macOS (#86 )

2026-02-28 07:39:31 -05:00

Dockerfile

fix: register task_request handler and fix Docker 403 errors on macOS (#86 )

2026-02-28 07:39:31 -05:00

Dockerfile.ollama

feat: microservices refactoring with TDD and Docker optimization (#88 )

2026-02-28 11:07:19 -05:00

final_report.md

feat: microservices refactoring with TDD and Docker optimization (#88 )

2026-02-28 11:07:19 -05:00

interview_timmy.py

feat: microservices refactoring with TDD and Docker optimization (#88 )

2026-02-28 11:07:19 -05:00

interview_transcript.txt

feat: microservices refactoring with TDD and Docker optimization (#88 )

2026-02-28 11:07:19 -05:00

issue_resolution_plan.md

feat: microservices refactoring with TDD and Docker optimization (#88 )

2026-02-28 11:07:19 -05:00

LICENSE

fix: serve_chat endpoint bug, stale docs, and license mismatch

2026-02-24 17:18:29 +00:00

Makefile

CI/CD Optimization: Guard Rails, Pre-commit Checks, and Test Fixes (#90 )

2026-02-28 11:36:50 -05:00

MEMORY.md

feat: task queue system with startup drain and backlogging (#76 )

2026-02-27 01:52:42 -05:00

message_to_alexander.txt

feat: task queue system with startup drain and backlogging (#76 )

2026-02-27 01:52:42 -05:00

opencode.json

config: add opencode.json with local Ollama provider (#71 )

2026-02-26 23:21:43 -05:00

pyproject.toml

test: remove hardcoded sleeps, add pytest-timeout (#69 )

2026-02-26 22:52:36 -05:00

pytest.ini

CI/CD Optimization: Guard Rails, Pre-commit Checks, and Test Fixes (#90 )

2026-02-28 11:36:50 -05:00

README.md

Merge main into feature/model-upgrade-llama3.1 with conflict resolution

2026-02-26 22:19:44 -05:00

REFACTORING_PLAN.md

refactor: Phase 2b — consolidate 28 modules into 14 packages

2026-02-26 22:07:41 +00:00

ROADMAP.md

docs: add integration ROADMAP and Ascension manifesto (#83 )

2026-02-27 21:09:49 -05:00

run_e2e_tests.sh

feat: complete Event Log, Ledger, Memory, Cascade Router, Upgrade Queue, Activity Feed

2026-02-26 08:01:01 -05:00

SECURITY.md

Fix dashboard tests and add SECURITY.md audit report (#84 )

2026-02-28 06:59:15 -05:00

thought_stream.txt

feat: task queue system with startup drain and backlogging (#76 )

2026-02-27 01:52:42 -05:00

README.md

Timmy Time — Mission Control

A local-first, sovereign AI agent system. Talk to Timmy, watch his swarm, gate API access with Bitcoin Lightning — all from a browser, no cloud AI required.

Live Docs →

Quick Start

git clone https://github.com/AlexanderWhitestone/Timmy-time-dashboard.git
cd Timmy-time-dashboard
make install              # create venv + install deps
cp .env.example .env      # configure environment

ollama serve              # separate terminal
ollama pull llama3.1:8b-instruct  # Required for reliable tool calling

make dev                  # http://localhost:8000
make test                 # no Ollama needed

Note: llama3.1:8b-instruct is used instead of llama3.2 because it is specifically fine-tuned for reliable tool/function calling. llama3.2 (3B) was found to hallucinate tool output consistently in testing. Fallback: qwen2.5:14b if llama3.1:8b-instruct is not available.

What's Here

Subsystem	Description
Timmy Agent	Agno-powered agent (Ollama default, AirLLM optional for 70B/405B)
Mission Control	FastAPI + HTMX dashboard — chat, health, swarm, marketplace
Swarm	Multi-agent coordinator — spawn agents, post tasks, Lightning auctions
L402 / Lightning	Bitcoin Lightning payment gating for API access
Spark	Event capture, predictions, memory consolidation, advisory
Creative Studio	Multi-persona pipeline — image, music, video generation
Hands	6 autonomous scheduled agents — Oracle, Sentinel, Scout, Scribe, Ledger, Weaver
Self-Coding	Codebase-aware self-modification with git safety
Integrations	Telegram bridge, Siri Shortcuts, voice NLU, mobile layout

Commands

make dev            # start dashboard (http://localhost:8000)
make test           # run all tests
make test-cov       # tests + coverage report
make lint           # run ruff/flake8
make docker-up      # start via Docker
make help           # see all commands

CLI tools: timmy, timmy-serve, self-tdd, self-modify

Documentation

Document	Purpose
CLAUDE.md	AI assistant development guide
AGENTS.md	Multi-agent development standards
.env.example	Configuration reference
docs/	Architecture docs, ADRs, audits

Configuration

cp .env.example .env

Variable	Default	Purpose
`OLLAMA_URL`	`http://localhost:11434`	Ollama host
`OLLAMA_MODEL`	`llama3.1:8b-instruct`	Model for tool calling. Use llama3.1:8b-instruct for reliable tool use; fallback to qwen2.5:14b
`DEBUG`	`false`	Enable `/docs` and `/redoc`
`TIMMY_MODEL_BACKEND`	`ollama`	`ollama` \| `airllm` \| `auto`
`AIRLLM_MODEL_SIZE`	`70b`	`8b` \| `70b` \| `405b`
`L402_HMAC_SECRET`	(default — change in prod)	HMAC signing key for macaroons
`L402_MACAROON_SECRET`	(default — change in prod)	Macaroon secret
`LIGHTNING_BACKEND`	`mock`	`mock` (production-ready) \| `lnd` (scaffolded, not yet functional)

Architecture

Browser / Phone
      │ HTTP + HTMX + WebSocket
      ▼
┌─────────────────────────────────────────┐
│             FastAPI (dashboard.app)      │
│  routes: agents, health, swarm,          │
│          marketplace, voice, mobile      │
└───┬─────────────┬──────────┬────────────┘
    │             │          │
    ▼             ▼          ▼
Jinja2        Timmy       Swarm
Templates     Agent       Coordinator
(HTMX)        │           ├─ Registry (SQLite)
              ├─ Ollama   ├─ AuctionManager (L402 bids)
              └─ AirLLM   ├─ SwarmComms (Redis / in-memory)
                          └─ SwarmManager (subprocess)
    │
    ├── Voice NLU + TTS (pyttsx3, local)
    ├── WebSocket live feed (ws_manager)
    ├── L402 Lightning proxy (macaroon + invoice)
    ├── Push notifications (local + macOS native)
    └── Siri Shortcuts API endpoints

Persistence: timmy.db (Agno memory), data/swarm.db (registry + tasks)
External:    Ollama :11434, optional Redis, optional LND gRPC

Project Layout

src/
  config.py           # pydantic-settings — all env vars live here
  timmy/              # Core agent (agent.py, backends.py, cli.py, prompts.py)
  hands/              # Autonomous scheduled agents (registry, scheduler, runner)
  dashboard/          # FastAPI app, routes, Jinja2 templates
  swarm/              # Multi-agent: coordinator, registry, bidder, tasks, comms
  timmy_serve/        # L402 proxy, payment handler, TTS, serve CLI
  spark/              # Intelligence engine — events, predictions, advisory
  creative/           # Creative director + video assembler pipeline
  tools/              # Git, image, music, video tools for persona agents
  lightning/          # Lightning backend abstraction (mock + LND)
  agent_core/         # Substrate-agnostic agent interface
  voice/              # NLU intent detection
  ws_manager/         # WebSocket connection manager
  notifications/      # Push notification store
  shortcuts/          # Siri Shortcuts endpoints
  telegram_bot/       # Telegram bridge
  self_tdd/           # Continuous test watchdog
hands/                # Hand manifests — oracle/, sentinel/, etc.
tests/                # one test file per module, all mocked
static/style.css      # Dark mission-control theme (JetBrains Mono)
docs/                 # GitHub Pages landing page
AGENTS.md             # AI agent development standards ← read this
.env.example          # Environment variable reference
Makefile              # Common dev commands

Mobile Access

The dashboard is fully mobile-optimized (iOS safe area, 44px touch targets, 16px input to prevent zoom, momentum scroll).

# Bind to your local network
uvicorn dashboard.app:app --host 0.0.0.0 --port 8000 --reload

# Find your IP
ipconfig getifaddr en0    # Wi-Fi on macOS

Open http://<your-ip>:8000 on your phone (same Wi-Fi network).

Mobile-specific routes:

/mobile — single-column optimized layout
/mobile-test — 21-scenario HITL test harness (layout, touch, scroll, notch)

Hands — Autonomous Agents

Hands are scheduled, autonomous agents that run on cron schedules. Each Hand has a HAND.toml manifest, SYSTEM.md prompt, and optional skills/ directory.

Built-in Hands:

Hand	Schedule	Purpose
Oracle	7am, 7pm UTC	Bitcoin intelligence — price, on-chain, macro analysis
Sentinel	Every 15 min	System health — dashboard, agents, database, resources
Scout	Every hour	OSINT monitoring — HN, Reddit, RSS for Bitcoin/sovereign AI
Scribe	Daily 9am	Content production — blog posts, docs, changelog
Ledger	Every 6 hours	Treasury tracking — Bitcoin/Lightning balances, payment audit
Weaver	Sunday 10am	Creative pipeline — orchestrates Pixel+Lyra+Reel for video

Dashboard: /hands — manage, trigger, approve actions

Example HAND.toml:

[hand]
name = "oracle"
schedule = "0 7,19 * * *"  # Twice daily
enabled = true

[tools]
required = ["mempool_fetch", "price_fetch"]

[approval_gates]
broadcast = { action = "broadcast", description = "Post to dashboard" }

[output]
dashboard = true
channel = "telegram"

AirLLM — Big Brain Backend

Run 70B or 405B models locally with no GPU, using AirLLM's layer-by-layer loading. Apple Silicon uses MLX automatically.

pip install ".[bigbrain]"
pip install "airllm[mlx]"   # Apple Silicon only

timmy chat "Explain self-custody" --backend airllm --model-size 70b

Or set once in .env:

TIMMY_MODEL_BACKEND=auto
AIRLLM_MODEL_SIZE=70b

Flag	Parameters	RAM needed
`8b`	8 billion	~16 GB
`70b`	70 billion	~140 GB
`405b`	405 billion	~810 GB

CLI

timmy chat "What is sovereignty?"
timmy think "Bitcoin and self-custody"
timmy status

timmy-serve start          # L402-gated API server (port 8402)
timmy-serve invoice        # generate a Lightning invoice
timmy-serve status

Or with the bootstrap script (creates venv, tests, watchdog, server in one shot):

bash scripts/activate_self_tdd.sh
bash scripts/activate_self_tdd.sh --big-brain   # also installs AirLLM

Troubleshooting

ollama: command not found — brew install ollama or ollama.com
connection refused — run ollama serve first
ModuleNotFoundError — source .venv/bin/activate && make install
Health panel shows DOWN — Ollama isn't running; chat returns offline message

Roadmap

Version	Name	Status
1.0	Genesis	Complete — Agno + Ollama + SQLite + Dashboard
2.0	Exodus	In progress — Swarm + L402 + Voice + Marketplace + Hands
3.0	Revelation	Planned — Lightning treasury + single `.app` bundle

Languages

Python 86.9%

HTML 6.8%

CSS 2.7%

Shell 1.6%

TypeScript 1%

Other 1%