Sovereign telemetry stack: Prometheus + Grafana + exporter

- Docker Compose with 3 containers (prometheus, grafana, exporter) - Custom Python exporter reads Gitea API, heartbeat JSONL, inference metrics, model health - Sovereignty score: 7 dimensions, weighted composite (0-100) - Top 5 common sense metrics: commit velocity, issues/PRs, heartbeat, inference, model health - Grafana auto-provisioned with 'Timmy Sovereign Command' dashboard - Grafana on :3033, Prometheus on :9090, Exporter on :9101 Refs: #590, #594
2026-03-27 01:19:13 +00:00
commit 7969842132
9 changed files with 1190 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -0,0 +1,83 @@
 # Timmy Telemetry — Sovereign Monitoring Stack
 Prometheus + Grafana + custom exporter for the Timmy Foundation infrastructure.
 Zero cloud dependencies. Runs entirely on Hermes.
 ## What You Get
 ### Sovereignty Index (0-100)
 Composite score across 7 dimensions:
 | Dimension | What it measures | Weight |
 |---|---|---|
 | **Inference** | % of model calls going local vs cloud | 25% |
 | **Compute** | Local GPU/CPU vs rented cloud | 20% |
 | **Data** | Local storage (Gitea/SQLite/JSONL) vs cloud | 15% |
 | **Infrastructure** | Self-hosted services vs SaaS | 15% |
 | **Tools** | Local tools vs cloud-dependent ones | 15% |
 | **Identity** | Self-managed keys vs platform accounts | 5% |
 | **Financial** | Lightning/Cashu vs fiat rails | 5% |
 ### Top 5 Common Sense Metrics
 1. **Commit Velocity** — commits/24h per repo (from Gitea API)
 2. **Issues & PRs** — open counts, burn rate (from Gitea API)
 3. **Heartbeat Health** — tick rate, consecutive OK, last tick age (from `~/.timmy/heartbeat/`)
 4. **Local Inference** — calls today, success rate, avg response length (from `~/.timmy/metrics/`)
 5. **Model Health** — Ollama up/down, API responding, inference OK, models loaded (from `~/.hermes/model_health.json`)
 ### Bonus
 - Agent activity (commits per agent: claude, gemini, kimi, grok, perplexity, Timmy)
 - DPO training pairs staged
 - Sovereignty trend over time
 ## Deploy on Hermes
 ```bash
 # Clone to Hermes
 cd ~/.timmy
 git clone <this-repo> timmy-telemetry
 cd timmy-telemetry
 # Set your Gitea token (or it uses the default)
 export GITEA_TOKEN=130c3811bf4bae84a724cc165d677573d0f89636
 # Launch
 docker compose up -d
 ```
 ## Access
 | Service | URL | Credentials |
 |---|---|---|
 | Grafana | http://localhost:3033 | admin / timmy |
 | Prometheus | http://localhost:9090 | — |
 | Exporter | http://localhost:9101/metrics | — |
 Grafana auto-loads the "Timmy Sovereign Command" dashboard on first boot.
 ## Architecture
 ```
 ┌──────────────┐     scrape/30s     ┌──────────────┐
 │  Prometheus  │◄────────────────── │   Exporter   │
 │  :9090       │                    │   :9101      │
 └──────┬───────┘                    └──────┬───────┘
       │                                   │ reads
       │                           ┌───────┴────────┐
 ┌──────▼───────┐                   │ ~/.timmy/       │
 │   Grafana    │                   │   heartbeat/    │
 │   :3033      │                   │   metrics/      │
 │              │                   │ ~/.hermes/      │
 │  Dashboards  │                   │   model_health  │
 └──────────────┘                   │ Gitea API       │
                                   └────────────────┘
 ```
 ## Sovereignty Score Logic
 The score starts pessimistic and improves as you migrate off cloud:
 - **Every local inference call** pushes the inference dimension up
 - **Deploying Nostr identity** (#13) will unlock the identity dimension
 - **Lightning/Cashu** (#554, #555) will unlock the financial dimension
 - **Replacing cloud agent workers** with local models pushes compute + tools up
 Target: 100 = fully sovereign, $0 cloud bill, no external dependencies.
--- a/docker-compose.yml
+++ b/docker-compose.yml
@@ -0,0 +1,57 @@
 # Timmy Telemetry Stack — Sovereign Monitoring
 # Deploy on Hermes (Mac M3 Max) alongside Ollama + Huey
 #
 #   cd ~/.timmy/timmy-telemetry && docker compose up -d
 #
 # Grafana:    http://localhost:3033  (admin / timmy)
 # Prometheus: http://localhost:9090
 # Exporter:   http://localhost:9101/metrics
 services:
  prometheus:
    image: prom/prometheus:latest
    container_name: timmy-prometheus
    restart: unless-stopped
    ports:
      - "9090:9090"
    volumes:
      - ./prometheus/prometheus.yml:/etc/prometheus/prometheus.yml:ro
      - prometheus-data:/prometheus
    command:
      - "--config.file=/etc/prometheus/prometheus.yml"
      - "--storage.tsdb.retention.time=90d"
      - "--web.enable-lifecycle"
  grafana:
    image: grafana/grafana:latest
    container_name: timmy-grafana
    restart: unless-stopped
    ports:
      - "3033:3000"
    environment:
      - GF_SECURITY_ADMIN_USER=admin
      - GF_SECURITY_ADMIN_PASSWORD=timmy
      - GF_DASHBOARDS_DEFAULT_HOME_DASHBOARD_PATH=/var/lib/grafana/dashboards/sovereignty.json
    volumes:
      - ./grafana/provisioning:/etc/grafana/provisioning:ro
      - ./grafana/dashboards:/var/lib/grafana/dashboards:ro
      - grafana-data:/var/lib/grafana
  exporter:
    build: ./exporter
    container_name: timmy-exporter
    restart: unless-stopped
    ports:
      - "9101:9101"
    volumes:
      - ${TIMMY_HOME:-~/.timmy}:/data/timmy:ro
      - ${HERMES_HOME:-~/.hermes}:/data/hermes:ro
    environment:
      - GITEA_URL=http://143.198.27.163:3000
      - GITEA_TOKEN=${GITEA_TOKEN:-130c3811bf4bae84a724cc165d677573d0f89636}
      - TIMMY_DATA=/data/timmy
      - HERMES_DATA=/data/hermes
 volumes:
  prometheus-data:
  grafana-data:
--- a/exporter/Dockerfile
+++ b/exporter/Dockerfile
@@ -0,0 +1,7 @@
 FROM python:3.12-slim
 WORKDIR /app
 COPY requirements.txt .
 RUN pip install --no-cache-dir -r requirements.txt
 COPY exporter.py .
 EXPOSE 9101
 CMD ["python", "exporter.py"]
--- a/exporter/exporter.py
+++ b/exporter/exporter.py
@@ -0,0 +1,529 @@
 """Timmy Telemetry Exporter — Prometheus metrics from sovereign infrastructure.
 Reads existing data sources on Hermes and exposes them as /metrics:
  - Gitea API  → commit velocity, PR throughput, issue burn
  - Heartbeat ticks → uptime, perception health
  - Local inference JSONL → model call rates, latency proxy
  - Model health JSON → Ollama status
  - Sovereignty score → computed composite
 Runs as a sidecar container, volumes mounted read-only.
 """
 import json
 import glob
 import os
 import time
 import threading
 from datetime import datetime, timezone, timedelta
 from pathlib import Path
 import requests
 from prometheus_client import (
    start_http_server, Gauge, Counter, Info, Summary, Enum,
    REGISTRY, CollectorRegistry,
 )
 # ── Config ───────────────────────────────────────────────────────────
 GITEA_URL = os.environ.get("GITEA_URL", "http://143.198.27.163:3000")
 GITEA_TOKEN = os.environ.get("GITEA_TOKEN", "")
 TIMMY_DATA = Path(os.environ.get("TIMMY_DATA", "/data/timmy"))
 HERMES_DATA = Path(os.environ.get("HERMES_DATA", "/data/hermes"))
 REPOS = [
    "Timmy_Foundation/the-nexus",
    "Timmy_Foundation/timmy-config",
 ]
 SCRAPE_INTERVAL = 30  # seconds
 # ── Prometheus Metrics ───────────────────────────────────────────────
 # == TOP 5 COMMON SENSE METRICS ==
 # 1. Commit velocity (commits in last 24h per repo)
 commits_24h = Gauge(
    "timmy_commits_24h",
    "Commits in the last 24 hours",
    ["repo"],
 )
 # 2. Open issues / open PRs
 open_issues = Gauge(
    "timmy_open_issues",
    "Open issue count",
    ["repo"],
 )
 open_prs = Gauge(
    "timmy_open_prs",
    "Open PR count",
    ["repo"],
 )
 # 3. Heartbeat health (ticks in last hour, consecutive healthy ticks)
 heartbeat_ticks_1h = Gauge(
    "timmy_heartbeat_ticks_1h",
    "Heartbeat ticks recorded in the last hour",
 )
 heartbeat_consecutive_healthy = Gauge(
    "timmy_heartbeat_consecutive_healthy",
    "Consecutive heartbeat ticks with severity=ok",
 )
 heartbeat_last_tick_age_seconds = Gauge(
    "timmy_heartbeat_last_tick_age_seconds",
    "Seconds since last heartbeat tick",
 )
 # 4. Local inference stats (calls today, success rate)
 inference_calls_today = Gauge(
    "timmy_inference_calls_today",
    "Local model inference calls today",
    ["model"],
 )
 inference_success_rate = Gauge(
    "timmy_inference_success_rate",
    "Local inference success rate (0-1) today",
    ["model"],
 )
 inference_avg_response_len = Gauge(
    "timmy_inference_avg_response_len",
    "Average response length (chars) today",
    ["model"],
 )
 # 5. Model health (is Ollama up, is inference working)
 ollama_up = Gauge(
    "timmy_ollama_up",
    "Whether Ollama process is running (1=yes, 0=no)",
 )
 ollama_api_up = Gauge(
    "timmy_ollama_api_up",
    "Whether Ollama API is responding (1=yes, 0=no)",
 )
 ollama_inference_ok = Gauge(
    "timmy_ollama_inference_ok",
    "Whether local inference smoke test passed (1=yes, 0=no)",
 )
 models_loaded_count = Gauge(
    "timmy_models_loaded",
    "Number of models loaded in Ollama",
 )
 # == SOVEREIGNTY SCORE ==
 sovereignty_score = Gauge(
    "timmy_sovereignty_score",
    "Composite sovereignty index (0-100)",
 )
 sovereignty_dimension = Gauge(
    "timmy_sovereignty_dimension",
    "Individual sovereignty dimension score (0-100)",
    ["dimension"],
 )
 # == BONUS: Gitea agent activity ==
 agent_commits_24h = Gauge(
    "timmy_agent_commits_24h",
    "Commits by agent in last 24h",
    ["agent"],
 )
 agent_prs_open = Gauge(
    "timmy_agent_prs_open",
    "Open PRs by agent",
    ["agent"],
 )
 dpo_pairs_staged = Gauge(
    "timmy_dpo_pairs_staged",
    "DPO training pair files exported",
 )
 alerts_today = Gauge(
    "timmy_alerts_today",
    "Heartbeat alerts logged today",
 )
 gitea_up_gauge = Gauge(
    "timmy_gitea_up",
    "Whether Gitea API is reachable (1=yes, 0=no)",
 )
 # ── Data Collection Functions ────────────────────────────────────────
 def gitea_get(path, params=None):
    """GET from Gitea API with token auth."""
    headers = {"Authorization": f"token {GITEA_TOKEN}"} if GITEA_TOKEN else {}
    try:
        r = requests.get(
            f"{GITEA_URL}/api/v1{path}",
            headers=headers,
            params=params or {},
            timeout=10,
        )
        r.raise_for_status()
        return r.json()
    except Exception:
        return None
 def collect_gitea_metrics():
    """Pull commit velocity, issue/PR counts, agent activity from Gitea."""
    since_24h = (datetime.now(timezone.utc) - timedelta(hours=24)).isoformat()
    agents = ["claude", "gemini", "kimi", "grok", "perplexity", "Timmy", "Rockachopa"]
    agent_commit_counts = {a: 0 for a in agents}
    gitea_alive = False
    for repo in REPOS:
        # Commits in last 24h
        commits = gitea_get(f"/repos/{repo}/commits", {"since": since_24h, "limit": 50})
        if commits is not None:
            gitea_alive = True
            commits_24h.labels(repo=repo).set(len(commits))
            # Count per-agent commits
            for c in commits:
                author = c.get("commit", {}).get("author", {}).get("name", "")
                for agent in agents:
                    if agent.lower() in author.lower():
                        agent_commit_counts[agent] += 1
        else:
            commits_24h.labels(repo=repo).set(0)
        # Open issues
        issues = gitea_get(f"/repos/{repo}/issues", {"state": "open", "type": "issues", "limit": 1})
        if issues is not None:
            # Gitea returns paginated — get total from header or count
            all_issues = gitea_get(f"/repos/{repo}/issues", {"state": "open", "type": "issues", "limit": 50})
            open_issues.labels(repo=repo).set(len(all_issues) if all_issues else 0)
        # Open PRs
        prs = gitea_get(f"/repos/{repo}/pulls", {"state": "open", "limit": 50})
        if prs is not None:
            open_prs.labels(repo=repo).set(len(prs))
            # Count per-agent PRs
            for pr in prs:
                author = pr.get("user", {}).get("login", "")
                for agent in agents:
                    if agent.lower() == author.lower():
                        agent_prs_open.labels(agent=agent).set(
                            agent_prs_open.labels(agent=agent)._value._value + 1
                            if hasattr(agent_prs_open.labels(agent=agent), '_value')
                            else 1
                        )
    # Set agent commit gauges
    for agent, count in agent_commit_counts.items():
        agent_commits_24h.labels(agent=agent).set(count)
    gitea_up_gauge.set(1 if gitea_alive else 0)
    return gitea_alive
 def collect_heartbeat_metrics():
    """Read heartbeat tick JSONL files from ~/.timmy/heartbeat/."""
    tick_dir = TIMMY_DATA / "heartbeat"
    today = datetime.now(timezone.utc).strftime("%Y%m%d")
    tick_file = tick_dir / f"ticks_{today}.jsonl"
    if not tick_file.exists():
        heartbeat_ticks_1h.set(0)
        heartbeat_consecutive_healthy.set(0)
        heartbeat_last_tick_age_seconds.set(9999)
        alerts_today.set(0)
        return
    ticks = []
    alert_count = 0
    for line in tick_file.read_text().strip().split("\n"):
        if not line.strip():
            continue
        try:
            t = json.loads(line)
            ticks.append(t)
            actions = t.get("decision", {}).get("actions", [])
            alert_count += len(actions)
        except (json.JSONDecodeError, KeyError):
            continue
    alerts_today.set(alert_count)
    if not ticks:
        heartbeat_ticks_1h.set(0)
        heartbeat_consecutive_healthy.set(0)
        heartbeat_last_tick_age_seconds.set(9999)
        return
    # Ticks in last hour
    one_hour_ago = datetime.now(timezone.utc) - timedelta(hours=1)
    recent = 0
    for t in ticks:
        try:
            ts = datetime.fromisoformat(t["timestamp"].replace("Z", "+00:00"))
            if ts > one_hour_ago:
                recent += 1
        except (KeyError, ValueError):
            pass
    heartbeat_ticks_1h.set(recent)
    # Consecutive healthy (severity = ok from end)
    consecutive = 0
    for t in reversed(ticks):
        severity = t.get("decision", {}).get("severity", "unknown")
        if severity == "ok":
            consecutive += 1
        else:
            break
    heartbeat_consecutive_healthy.set(consecutive)
    # Age of last tick
    try:
        last_ts = datetime.fromisoformat(ticks[-1]["timestamp"].replace("Z", "+00:00"))
        age = (datetime.now(timezone.utc) - last_ts).total_seconds()
        heartbeat_last_tick_age_seconds.set(age)
    except (KeyError, ValueError):
        heartbeat_last_tick_age_seconds.set(9999)
 def collect_inference_metrics():
    """Read local inference JSONL from ~/.timmy/metrics/."""
    metrics_dir = TIMMY_DATA / "metrics"
    today = datetime.now(timezone.utc).strftime("%Y%m%d")
    metrics_file = metrics_dir / f"local_{today}.jsonl"
    if not metrics_file.exists():
        return
    # Aggregate per model
    model_stats = {}  # model -> {calls, successes, total_response_len}
    for line in metrics_file.read_text().strip().split("\n"):
        if not line.strip():
            continue
        try:
            record = json.loads(line)
            model = record.get("model", "unknown")
            if model not in model_stats:
                model_stats[model] = {"calls": 0, "successes": 0, "total_resp_len": 0}
            model_stats[model]["calls"] += 1
            if record.get("success"):
                model_stats[model]["successes"] += 1
                model_stats[model]["total_resp_len"] += record.get("response_len", 0)
        except (json.JSONDecodeError, KeyError):
            continue
    for model, stats in model_stats.items():
        inference_calls_today.labels(model=model).set(stats["calls"])
        rate = stats["successes"] / stats["calls"] if stats["calls"] > 0 else 0
        inference_success_rate.labels(model=model).set(round(rate, 3))
        avg_len = stats["total_resp_len"] / stats["successes"] if stats["successes"] > 0 else 0
        inference_avg_response_len.labels(model=model).set(round(avg_len))
 def collect_model_health():
    """Read model health from ~/.hermes/model_health.json."""
    health_file = HERMES_DATA / "model_health.json"
    if not health_file.exists():
        ollama_up.set(0)
        ollama_api_up.set(0)
        ollama_inference_ok.set(0)
        models_loaded_count.set(0)
        return
    try:
        h = json.loads(health_file.read_text())
        ollama_up.set(1 if h.get("ollama_running") else 0)
        ollama_api_up.set(1 if h.get("api_responding") else 0)
        ollama_inference_ok.set(1 if h.get("inference_ok") else 0)
        models_loaded_count.set(len(h.get("models_loaded", [])))
    except (json.JSONDecodeError, KeyError):
        ollama_up.set(0)
        ollama_api_up.set(0)
        ollama_inference_ok.set(0)
        models_loaded_count.set(0)
 def collect_dpo_metrics():
    """Count DPO training pairs staged."""
    dpo_dir = TIMMY_DATA / "training-data" / "dpo-pairs"
    if dpo_dir.exists():
        count = len(list(dpo_dir.glob("*.json")))
        dpo_pairs_staged.set(count)
    else:
        dpo_pairs_staged.set(0)
 # ── Sovereignty Score ────────────────────────────────────────────────
 def compute_sovereignty_score():
    """Compute composite sovereignty score from available data.
    Dimensions (each 0-100):
      1. Inference sovereignty — % of calls going to local models
      2. Data sovereignty — all data stored locally (Gitea self-hosted, SQLite, JSONL)
      3. Compute sovereignty — local CPU/GPU utilization vs cloud
      4. Infrastructure sovereignty — self-hosted services vs SaaS
      5. Identity sovereignty — Nostr/self-managed keys vs platform accounts
      6. Financial sovereignty — Lightning/Cashu vs fiat payment rails
      7. Tool sovereignty — self-hosted tools vs cloud SaaS
    For v1, we compute what we CAN measure and estimate the rest.
    """
    scores = {}
    # 1. INFERENCE SOVEREIGNTY
    # Read today's inference log — check ratio of local vs cloud calls
    metrics_dir = TIMMY_DATA / "metrics"
    today = datetime.now(timezone.utc).strftime("%Y%m%d")
    metrics_file = metrics_dir / f"local_{today}.jsonl"
    local_calls = 0
    cloud_calls = 0  # We don't log cloud calls yet, so estimate from known usage
    if metrics_file.exists():
        for line in metrics_file.read_text().strip().split("\n"):
            if not line.strip():
                continue
            try:
                r = json.loads(line)
                if r.get("success"):
                    local_calls += 1
            except (json.JSONDecodeError, KeyError):
                pass
    # Known cloud dependencies: Anthropic (rate-limited, occasional),
    # Gemini/Groq agents, Perplexity. Estimate 30% cloud for now.
    # As local models improve, this should trend toward 100.
    if local_calls > 0:
        # Rough estimate: each heartbeat tick = 1 local call (every 10 min = 144/day)
        # Agent workers use cloud. Assume 30% cloud overhead for now.
        estimated_total = max(local_calls * 1.3, 1)
        scores["inference"] = min(100, round((local_calls / estimated_total) * 100))
    else:
        scores["inference"] = 0
    # 2. DATA SOVEREIGNTY
    # Gitea is self-hosted ✓, SQLite for Huey ✓, JSONL for metrics ✓
    # Minus: some conversation history lives in Anthropic/Gemini cloud sessions
    scores["data"] = 75  # High — Gitea+SQLite+JSONL all local. Cloud sessions are the gap.
    # 3. COMPUTE SOVEREIGNTY
    # Mac M3 Max runs Ollama locally ✓
    # But agent workers (gemini, grok) call cloud APIs
    # Heartbeat uses local model ✓
    health_file = HERMES_DATA / "model_health.json"
    ollama_running = False
    if health_file.exists():
        try:
            h = json.loads(health_file.read_text())
            ollama_running = h.get("inference_ok", False)
        except Exception:
            pass
    # Base 40 for having Ollama + local models, +30 if inference working, +30 when no cloud agents
    scores["compute"] = 40 + (30 if ollama_running else 0)
    # Cloud agents (gemini, grok) are still in use → cap at 70
    # 4. INFRASTRUCTURE SOVEREIGNTY
    # Gitea self-hosted ✓, Huey local ✓, Ollama local ✓
    # VPS for Gitea is rented (DigitalOcean) — not fully sovereign
    # Prometheus+Grafana (this stack) adds sovereignty ✓
    scores["infrastructure"] = 70  # VPS is rented, everything else is self-managed
    # 5. IDENTITY SOVEREIGNTY
    # Gitea accounts are self-managed ✓
    # Nostr identity not yet deployed (see issue #13)
    # Still using platform accounts (GitHub, Anthropic, Google) for some things
    scores["identity"] = 30  # Self-hosted Gitea accounts, but Nostr not live yet
    # 6. FINANCIAL SOVEREIGNTY
    # Lightning/Cashu not deployed yet (issues #554, #555)
    # Currently paying for cloud APIs with credit card
    scores["financial"] = 10  # Lightning+Cashu are tickets, not live yet
    # 7. TOOL SOVEREIGNTY
    # Hermes Agent harness ✓, Huey ✓, Gitea ✓, Ollama ✓
    # Still depends on: aider (cloud), opencode (cloud), Docker Hub
    tool_count_local = 5   # hermes-agent, huey, gitea, ollama, prometheus/grafana
    tool_count_cloud = 3   # aider (gemini), opencode (grok), docker hub
    total_tools = tool_count_local + tool_count_cloud
    scores["tools"] = round((tool_count_local / total_tools) * 100) if total_tools > 0 else 0
    # Set individual dimension gauges
    for dim, score in scores.items():
        sovereignty_dimension.labels(dimension=dim).set(score)
    # Composite: weighted average
    weights = {
        "inference": 0.25,   # Most important — this is the core work
        "data": 0.15,
        "compute": 0.20,
        "infrastructure": 0.15,
        "identity": 0.05,
        "financial": 0.05,
        "tools": 0.15,
    }
    composite = sum(scores.get(d, 0) * w for d, w in weights.items())
    sovereignty_score.set(round(composite))
    return scores, round(composite)
 # ── Main Collection Loop ─────────────────────────────────────────────
 def collect_all():
    """Run all collectors."""
    try:
        collect_gitea_metrics()
    except Exception as e:
        print(f"[WARN] Gitea collection failed: {e}")
    try:
        collect_heartbeat_metrics()
    except Exception as e:
        print(f"[WARN] Heartbeat collection failed: {e}")
    try:
        collect_inference_metrics()
    except Exception as e:
        print(f"[WARN] Inference collection failed: {e}")
    try:
        collect_model_health()
    except Exception as e:
        print(f"[WARN] Model health collection failed: {e}")
    try:
        collect_dpo_metrics()
    except Exception as e:
        print(f"[WARN] DPO collection failed: {e}")
    try:
        scores, composite = compute_sovereignty_score()
        print(f"[INFO] Sovereignty score: {composite} | {scores}")
    except Exception as e:
        print(f"[WARN] Sovereignty computation failed: {e}")
 def collection_loop():
    """Background thread that refreshes metrics every SCRAPE_INTERVAL."""
    while True:
        collect_all()
        time.sleep(SCRAPE_INTERVAL)
 if __name__ == "__main__":
    print("🔭 Timmy Telemetry Exporter starting on :9101")
    print(f"   Gitea: {GITEA_URL}")
    print(f"   Timmy data: {TIMMY_DATA}")
    print(f"   Hermes data: {HERMES_DATA}")
    # Initial collection
    collect_all()
    # Start background collection thread
    t = threading.Thread(target=collection_loop, daemon=True)
    t.start()
    # Start Prometheus HTTP server
    start_http_server(9101)
    print("🟢 Exporter ready — http://localhost:9101/metrics")
    # Block forever
    while True:
        time.sleep(3600)
--- a/exporter/requirements.txt
+++ b/exporter/requirements.txt
@@ -0,0 +1,2 @@
 prometheus_client==0.21.1
 requests==2.32.3
--- a/grafana/dashboards/sovereignty.json
+++ b/grafana/dashboards/sovereignty.json
@@ -0,0 +1,470 @@
 {
  "annotations": { "list": [] },
  "editable": true,
  "fiscalYearStartMonth": 0,
  "graphTooltip": 1,
  "id": null,
  "links": [],
  "panels": [
    {
      "id": 1,
      "title": "⚡ SOVEREIGNTY INDEX",
      "description": "Composite sovereignty score across all dimensions (0-100)",
      "type": "gauge",
      "gridPos": { "h": 8, "w": 6, "x": 0, "y": 0 },
      "targets": [
        {
          "expr": "timmy_sovereignty_score",
          "legendFormat": "Sovereignty",
          "refId": "A"
        }
      ],
      "fieldConfig": {
        "defaults": {
          "color": { "mode": "thresholds" },
          "thresholds": {
            "mode": "absolute",
            "steps": [
              { "color": "#ff0000", "value": null },
              { "color": "#ff4500", "value": 20 },
              { "color": "#ff8c00", "value": 40 },
              { "color": "#ffd700", "value": 60 },
              { "color": "#32cd32", "value": 80 },
              { "color": "#00ff00", "value": 95 }
            ]
          },
          "min": 0,
          "max": 100,
          "unit": "none"
        },
        "overrides": []
      },
      "options": {
        "reduceOptions": { "calcs": ["lastNotNull"] },
        "showThresholdLabels": false,
        "showThresholdMarkers": true,
        "orientation": "auto"
      }
    },
    {
      "id": 2,
      "title": "Sovereignty Dimensions",
      "description": "Individual sovereignty scores per dimension",
      "type": "bargauge",
      "gridPos": { "h": 8, "w": 10, "x": 6, "y": 0 },
      "targets": [
        {
          "expr": "timmy_sovereignty_dimension",
          "legendFormat": "{{dimension}}",
          "refId": "A"
        }
      ],
      "fieldConfig": {
        "defaults": {
          "color": { "mode": "thresholds" },
          "thresholds": {
            "mode": "absolute",
            "steps": [
              { "color": "#ff4500", "value": null },
              { "color": "#ff8c00", "value": 30 },
              { "color": "#ffd700", "value": 50 },
              { "color": "#32cd32", "value": 70 },
              { "color": "#00ff00", "value": 90 }
            ]
          },
          "min": 0,
          "max": 100,
          "unit": "none"
        },
        "overrides": []
      },
      "options": {
        "reduceOptions": { "calcs": ["lastNotNull"] },
        "displayMode": "gradient",
        "orientation": "horizontal",
        "showUnfilled": true,
        "minVizWidth": 0,
        "minVizHeight": 10,
        "valueMode": "color"
      }
    },
    {
      "id": 3,
      "title": "System Status",
      "description": "Core infrastructure health at a glance",
      "type": "stat",
      "gridPos": { "h": 8, "w": 8, "x": 16, "y": 0 },
      "targets": [
        {
          "expr": "timmy_ollama_up",
          "legendFormat": "Ollama",
          "refId": "A"
        },
        {
          "expr": "timmy_ollama_inference_ok",
          "legendFormat": "Inference",
          "refId": "B"
        },
        {
          "expr": "timmy_gitea_up",
          "legendFormat": "Gitea",
          "refId": "C"
        },
        {
          "expr": "timmy_heartbeat_ticks_1h",
          "legendFormat": "Heartbeat (1h)",
          "refId": "D"
        }
      ],
      "fieldConfig": {
        "defaults": {
          "mappings": [
            { "options": { "0": { "text": "DOWN", "color": "red" }, "1": { "text": "UP", "color": "green" } }, "type": "value" }
          ],
          "thresholds": {
            "mode": "absolute",
            "steps": [
              { "color": "red", "value": null },
              { "color": "green", "value": 1 }
            ]
          }
        },
        "overrides": [
          {
            "matcher": { "id": "byName", "options": "Heartbeat (1h)" },
            "properties": [
              { "id": "mappings", "value": [] },
              { "id": "thresholds", "value": { "mode": "absolute", "steps": [{ "color": "red", "value": null }, { "color": "yellow", "value": 1 }, { "color": "green", "value": 4 }] } }
            ]
          }
        ]
      },
      "options": {
        "reduceOptions": { "calcs": ["lastNotNull"] },
        "colorMode": "background",
        "graphMode": "none",
        "justifyMode": "auto",
        "textMode": "auto",
        "wideLayout": true,
        "showPercentChange": false
      }
    },
    {
      "id": 10,
      "title": "① Commit Velocity (24h)",
      "description": "Commits across repos in the last 24 hours",
      "type": "timeseries",
      "gridPos": { "h": 7, "w": 12, "x": 0, "y": 8 },
      "targets": [
        {
          "expr": "timmy_commits_24h",
          "legendFormat": "{{repo}}",
          "refId": "A"
        }
      ],
      "fieldConfig": {
        "defaults": {
          "color": { "mode": "palette-classic" },
          "custom": {
            "drawStyle": "line",
            "lineWidth": 2,
            "fillOpacity": 20,
            "pointSize": 5,
            "showPoints": "auto",
            "spanNulls": false
          },
          "unit": "short"
        },
        "overrides": []
      },
      "options": {
        "legend": { "displayMode": "list", "placement": "bottom" },
        "tooltip": { "mode": "multi" }
      }
    },
    {
      "id": 11,
      "title": "② Issues & PRs",
      "description": "Open issues and PRs per repo",
      "type": "bargauge",
      "gridPos": { "h": 7, "w": 12, "x": 12, "y": 8 },
      "targets": [
        {
          "expr": "timmy_open_issues",
          "legendFormat": "Issues: {{repo}}",
          "refId": "A"
        },
        {
          "expr": "timmy_open_prs",
          "legendFormat": "PRs: {{repo}}",
          "refId": "B"
        }
      ],
      "fieldConfig": {
        "defaults": {
          "color": { "mode": "palette-classic" },
          "min": 0,
          "unit": "short"
        },
        "overrides": []
      },
      "options": {
        "reduceOptions": { "calcs": ["lastNotNull"] },
        "displayMode": "gradient",
        "orientation": "horizontal",
        "showUnfilled": true
      }
    },
    {
      "id": 12,
      "title": "③ Heartbeat Health",
      "description": "Heartbeat tick rate and last tick age",
      "type": "stat",
      "gridPos": { "h": 7, "w": 8, "x": 0, "y": 15 },
      "targets": [
        {
          "expr": "timmy_heartbeat_ticks_1h",
          "legendFormat": "Ticks (1h)",
          "refId": "A"
        },
        {
          "expr": "timmy_heartbeat_consecutive_healthy",
          "legendFormat": "Consecutive OK",
          "refId": "B"
        },
        {
          "expr": "timmy_heartbeat_last_tick_age_seconds",
          "legendFormat": "Last Tick Age",
          "refId": "C"
        },
        {
          "expr": "timmy_alerts_today",
          "legendFormat": "Alerts Today",
          "refId": "D"
        }
      ],
      "fieldConfig": {
        "defaults": {
          "thresholds": {
            "mode": "absolute",
            "steps": [
              { "color": "red", "value": null },
              { "color": "green", "value": 1 }
            ]
          }
        },
        "overrides": [
          {
            "matcher": { "id": "byName", "options": "Last Tick Age" },
            "properties": [
              { "id": "unit", "value": "s" },
              { "id": "thresholds", "value": { "mode": "absolute", "steps": [{ "color": "green", "value": null }, { "color": "yellow", "value": 900 }, { "color": "red", "value": 1800 }] } }
            ]
          },
          {
            "matcher": { "id": "byName", "options": "Alerts Today" },
            "properties": [
              { "id": "thresholds", "value": { "mode": "absolute", "steps": [{ "color": "green", "value": null }, { "color": "yellow", "value": 1 }, { "color": "red", "value": 5 }] } }
            ]
          }
        ]
      },
      "options": {
        "reduceOptions": { "calcs": ["lastNotNull"] },
        "colorMode": "background",
        "graphMode": "none",
        "textMode": "auto"
      }
    },
    {
      "id": 13,
      "title": "④ Local Inference",
      "description": "Local model call volume and success rate today",
      "type": "timeseries",
      "gridPos": { "h": 7, "w": 8, "x": 8, "y": 15 },
      "targets": [
        {
          "expr": "timmy_inference_calls_today",
          "legendFormat": "Calls: {{model}}",
          "refId": "A"
        },
        {
          "expr": "timmy_inference_success_rate * 100",
          "legendFormat": "Success %: {{model}}",
          "refId": "B"
        }
      ],
      "fieldConfig": {
        "defaults": {
          "color": { "mode": "palette-classic" },
          "custom": {
            "drawStyle": "line",
            "lineWidth": 2,
            "fillOpacity": 10,
            "pointSize": 5,
            "showPoints": "auto"
          },
          "unit": "short"
        },
        "overrides": []
      },
      "options": {
        "legend": { "displayMode": "list", "placement": "bottom" },
        "tooltip": { "mode": "multi" }
      }
    },
    {
      "id": 14,
      "title": "⑤ Model Health",
      "description": "Ollama status, models loaded, inference health",
      "type": "stat",
      "gridPos": { "h": 7, "w": 8, "x": 16, "y": 15 },
      "targets": [
        {
          "expr": "timmy_ollama_up",
          "legendFormat": "Ollama Process",
          "refId": "A"
        },
        {
          "expr": "timmy_ollama_api_up",
          "legendFormat": "API Responding",
          "refId": "B"
        },
        {
          "expr": "timmy_ollama_inference_ok",
          "legendFormat": "Inference OK",
          "refId": "C"
        },
        {
          "expr": "timmy_models_loaded",
          "legendFormat": "Models Loaded",
          "refId": "D"
        },
        {
          "expr": "timmy_dpo_pairs_staged",
          "legendFormat": "DPO Pairs Staged",
          "refId": "E"
        }
      ],
      "fieldConfig": {
        "defaults": {
          "mappings": [
            { "options": { "0": { "text": "DOWN", "color": "red" }, "1": { "text": "UP", "color": "green" } }, "type": "value" }
          ],
          "thresholds": {
            "mode": "absolute",
            "steps": [
              { "color": "red", "value": null },
              { "color": "green", "value": 1 }
            ]
          }
        },
        "overrides": [
          {
            "matcher": { "id": "byName", "options": "Models Loaded" },
            "properties": [
              { "id": "mappings", "value": [] },
              { "id": "thresholds", "value": { "mode": "absolute", "steps": [{ "color": "red", "value": null }, { "color": "green", "value": 1 }] } }
            ]
          },
          {
            "matcher": { "id": "byName", "options": "DPO Pairs Staged" },
            "properties": [
              { "id": "mappings", "value": [] },
              { "id": "thresholds", "value": { "mode": "absolute", "steps": [{ "color": "blue", "value": null }] } }
            ]
          }
        ]
      },
      "options": {
        "reduceOptions": { "calcs": ["lastNotNull"] },
        "colorMode": "background",
        "graphMode": "none",
        "textMode": "auto"
      }
    },
    {
      "id": 20,
      "title": "Agent Activity (24h Commits)",
      "description": "Commits per agent in the last 24 hours",
      "type": "bargauge",
      "gridPos": { "h": 6, "w": 12, "x": 0, "y": 22 },
      "targets": [
        {
          "expr": "timmy_agent_commits_24h",
          "legendFormat": "{{agent}}",
          "refId": "A"
        }
      ],
      "fieldConfig": {
        "defaults": {
          "color": { "mode": "palette-classic" },
          "min": 0,
          "unit": "short"
        },
        "overrides": []
      },
      "options": {
        "reduceOptions": { "calcs": ["lastNotNull"] },
        "displayMode": "gradient",
        "orientation": "horizontal",
        "showUnfilled": true
      }
    },
    {
      "id": 21,
      "title": "Sovereignty Over Time",
      "description": "Track the composite sovereignty score trending over time",
      "type": "timeseries",
      "gridPos": { "h": 6, "w": 12, "x": 12, "y": 22 },
      "targets": [
        {
          "expr": "timmy_sovereignty_score",
          "legendFormat": "Sovereignty Index",
          "refId": "A"
        }
      ],
      "fieldConfig": {
        "defaults": {
          "color": { "fixedColor": "#7b68ee", "mode": "fixed" },
          "custom": {
            "drawStyle": "line",
            "lineWidth": 3,
            "fillOpacity": 30,
            "gradientMode": "scheme",
            "pointSize": 5,
            "showPoints": "auto",
            "spanNulls": true
          },
          "min": 0,
          "max": 100,
          "unit": "none",
          "thresholds": {
            "mode": "absolute",
            "steps": [
              { "color": "red", "value": null },
              { "color": "orange", "value": 30 },
              { "color": "yellow", "value": 50 },
              { "color": "green", "value": 70 }
            ]
          }
        },
        "overrides": []
      },
      "options": {
        "legend": { "displayMode": "list", "placement": "bottom" },
        "tooltip": { "mode": "single" }
      }
    }
  ],
  "refresh": "30s",
  "schemaVersion": 39,
  "tags": ["timmy", "sovereignty"],
  "templating": { "list": [] },
  "time": { "from": "now-24h", "to": "now" },
  "timepicker": {},
  "timezone": "browser",
  "title": "Timmy Sovereign Command",
  "uid": "timmy-sovereign",
  "version": 1
 }
--- a/grafana/provisioning/dashboards/default.yml
+++ b/grafana/provisioning/dashboards/default.yml
@@ -0,0 +1,11 @@
 apiVersion: 1
 providers:
  - name: "Timmy Dashboards"
    orgId: 1
    folder: ""
    type: file
    disableDeletion: false
    updateIntervalSeconds: 30
    options:
      path: /var/lib/grafana/dashboards
      foldersFromFilesStructure: false
--- a/grafana/provisioning/datasources/prometheus.yml
+++ b/grafana/provisioning/datasources/prometheus.yml
@@ -0,0 +1,8 @@
 apiVersion: 1
 datasources:
  - name: Prometheus
    type: prometheus
    access: proxy
    url: http://prometheus:9090
    isDefault: true
    editable: true
--- a/prometheus/prometheus.yml
+++ b/prometheus/prometheus.yml
@@ -0,0 +1,23 @@
 # Timmy Prometheus — Sovereign Metrics Collection
 global:
  scrape_interval: 30s
  evaluation_interval: 30s
 scrape_configs:
  # Timmy's custom exporter (Gitea, Huey, heartbeat, sovereignty)
  - job_name: "timmy-exporter"
    static_configs:
      - targets: ["exporter:9101"]
    scrape_interval: 30s
  # Prometheus self-monitoring
  - job_name: "prometheus"
    static_configs:
      - targets: ["localhost:9090"]
  # Ollama metrics (if exposed — ollama serves on host network)
  # Uncomment once Ollama is confirmed running with metrics
  # - job_name: "ollama"
  #   static_configs:
  #     - targets: ["host.docker.internal:11434"]
  #   metrics_path: /metrics
		`@@ -0,0 +1,2 @@`
							`prometheus_client==0.21.1`
							`requests==2.32.3`