Update _quota_allows_cloud() to use select_model("high") instead of
should_use_cloud(), so the cascade router uses the same metabolic tier
logic as the standalone quota API:
- BURST tier (5h < 50%): cloud providers allowed
- ACTIVE tier (5h >= 50%): cloud providers skipped, local 14B used
- RESTING tier (7d >= 80%): cloud providers skipped, local 8B used
Adds log message showing tier name and suggested local model when
downshifting, so Alexander can verify the protocol is working.
Adds TestMetabolicProtocol test class (5 tests) covering:
- BURST tier allows cloud
- ACTIVE tier skips cloud
- RESTING tier skips cloud
- Local (ollama) providers always attempted regardless of quota
- No quota monitor → cloud allowed (graceful degradation)
Files src/infrastructure/claude_quota.py and scripts/claude_quota_check.sh
were already in place from prior work.
Fixes #1106
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Timmy Time — Mission Control
A local-first, sovereign AI agent system. Talk to Timmy, watch his swarm, gate API access with Bitcoin Lightning — all from a browser, no cloud AI required.
Quick Start
git clone http://localhost:3000/rockachopa/Timmy-time-dashboard.git
cd Timmy-time-dashboard
make install # create venv + install deps
cp .env.example .env # configure environment
ollama serve # separate terminal
ollama pull qwen3:30b # Required for reliable tool calling
make dev # http://localhost:8000
make test # no Ollama needed
Note: qwen3:30b is the primary model — better reasoning and tool calling than llama3.1:8b-instruct while still running locally on modest hardware. Fallback: llama3.1:8b-instruct if qwen3:30b is not available. llama3.2 (3B) was found to hallucinate tool output consistently in testing.
What's Here
| Subsystem | Description |
|---|---|
| Timmy Agent | Agno-powered agent (Ollama default, AirLLM optional for 70B/405B) |
| Mission Control | FastAPI + HTMX dashboard — chat, health, swarm, marketplace |
| Spark | Event capture, predictions, memory consolidation, advisory |
| Infrastructure | WebSocket manager, notifications, events bus, LLM cascade router |
| Integrations | Telegram bridge, Siri Shortcuts, voice NLU, mobile layout |
| Brain | Identity system, memory interface |
Commands
make dev # start dashboard (http://localhost:8000)
make test # run all tests
make test-cov # tests + coverage report
make lint # run ruff/flake8
make docker-up # start via Docker
make help # see all commands
CLI tools: timmy, timmy-serve
Documentation
| Document | Purpose |
|---|---|
| CLAUDE.md | AI assistant development guide |
| AGENTS.md | Multi-agent development standards |
| .env.example | Configuration reference |
| docs/ | Architecture, ADRs, security audit, roadmap |
Configuration
cp .env.example .env
| Variable | Default | Purpose |
|---|---|---|
OLLAMA_URL |
http://localhost:11434 |
Ollama host |
OLLAMA_MODEL |
qwen3:30b |
Primary model for reasoning and tool calling. Fallback: llama3.1:8b-instruct |
DEBUG |
false |
Enable /docs and /redoc |
TIMMY_MODEL_BACKEND |
ollama |
ollama | airllm | auto |
AIRLLM_MODEL_SIZE |
70b |
8b | 70b | 405b |
L402_HMAC_SECRET |
(default — change in prod) | HMAC signing key for macaroons |
L402_MACAROON_SECRET |
(default — change in prod) | Macaroon secret |
LIGHTNING_BACKEND |
mock |
mock (production-ready) | lnd (scaffolded, not yet functional) |
Architecture
Browser / Phone
│ HTTP + HTMX + WebSocket
▼
┌─────────────────────────────────────────┐
│ FastAPI (dashboard.app) │
│ routes: agents, health, swarm, │
│ marketplace, voice, mobile │
└───┬─────────────┬──────────┬────────────┘
│ │ │
▼ ▼ ▼
Jinja2 Timmy Infrastructure
Templates Agent ├─ LLM Router (cascade)
(HTMX) │ ├─ WebSocket manager
├─ Ollama ├─ Notifications
└─ AirLLM └─ Events bus
│
├── Integrations (voice NLU, Telegram, Siri Shortcuts)
├── WebSocket live feed (ws_manager)
├── Push notifications (local + macOS native)
└── Spark (events, predictions, advisory)
Persistence: timmy.db (Agno memory), data/swarm.db (registry + tasks)
External: Ollama :11434, optional Redis, optional LND gRPC
Project Layout
src/
config.py # pydantic-settings — all env vars live here
timmy/ # Core agent, personas, agent interface, semantic memory
dashboard/ # FastAPI app, routes, Jinja2 templates
infrastructure/ # WebSocket, notifications, events, LLM router
integrations/ # Discord, Telegram, Siri Shortcuts, voice NLU
spark/ # Intelligence engine — events, predictions, advisory
brain/ # Identity system, memory interface
timmy_serve/ # API server, TTS, inter-agent communication
tests/ # one test file per module, all mocked
static/style.css # Dark mission-control theme (JetBrains Mono)
docs/ # GitHub Pages landing page
AGENTS.md # AI agent development standards ← read this
.env.example # Environment variable reference
Makefile # Common dev commands
Mobile Access
The dashboard is fully mobile-optimized (iOS safe area, 44px touch targets, 16px input to prevent zoom, momentum scroll).
# Bind to your local network
uvicorn dashboard.app:app --host 0.0.0.0 --port 8000 --reload
# Find your IP
ipconfig getifaddr en0 # Wi-Fi on macOS
Open http://<your-ip>:8000 on your phone (same Wi-Fi network).
Mobile-specific routes:
/mobile— single-column optimized layout/mobile-test— 21-scenario HITL test harness (layout, touch, scroll, notch)
AirLLM — Big Brain Backend
Run 70B or 405B models locally with no GPU, using AirLLM's layer-by-layer loading. Apple Silicon uses MLX automatically.
pip install ".[bigbrain]"
pip install "airllm[mlx]" # Apple Silicon only
timmy chat "Explain self-custody" --backend airllm --model-size 70b
Or set once in .env:
TIMMY_MODEL_BACKEND=auto
AIRLLM_MODEL_SIZE=70b
| Flag | Parameters | RAM needed |
|---|---|---|
8b |
8 billion | ~16 GB |
70b |
70 billion | ~140 GB |
405b |
405 billion | ~810 GB |
CLI
timmy chat "What is sovereignty?"
timmy think "Bitcoin and self-custody"
timmy status
timmy-serve start # L402-gated API server (port 8402)
timmy-serve invoice # generate a Lightning invoice
timmy-serve status
Or with the bootstrap script (creates venv, tests, watchdog, server in one shot):
bash scripts/activate_self_tdd.sh
bash scripts/activate_self_tdd.sh --big-brain # also installs AirLLM
Troubleshooting
ollama: command not found—brew install ollamaor ollama.comconnection refused— runollama servefirstModuleNotFoundError—source .venv/bin/activate && make install- Health panel shows DOWN — Ollama isn't running; chat returns offline message
Roadmap
| Version | Name | Status |
|---|---|---|
| 1.0 | Genesis | Complete — Agno + Ollama + SQLite + Dashboard |
| 2.0 | Exodus | In progress — Voice + Marketplace + Integrations |
| 3.0 | Revelation | Planned — Lightning treasury + single .app bundle |