Compare commits
28 Commits
pre-agent-
...
gemini/iss
| Author | SHA1 | Date | |
|---|---|---|---|
|
|
5d781a215d | ||
|
|
4042a23441 | ||
|
|
8f10b5fc92 | ||
| fbd1b9e88f | |||
|
|
ea38041514 | ||
| 579a775a0a | |||
|
|
689a2331d5 | ||
| 2ddda436a9 | |||
|
|
d72ae92189 | ||
| 2384908be7 | |||
|
|
82ba8896b3 | ||
|
|
3b34faeb17 | ||
|
|
f9be0eb481 | ||
|
|
383a969791 | ||
|
|
f46a4826d9 | ||
|
|
3b1763ce4c | ||
|
|
78f5216540 | ||
|
|
49020b34d9 | ||
|
|
7468a6d063 | ||
|
|
f9155b28e3 | ||
|
|
16675abd79 | ||
|
|
1fce489364 | ||
|
|
7c7e19f6d2 | ||
|
|
8fd451fb52 | ||
|
|
0b63da1c9e | ||
|
|
20532819e9 | ||
|
|
27c1fb940d | ||
|
|
56364e62b4 |
1
.gitignore
vendored
1
.gitignore
vendored
@@ -8,3 +8,4 @@
|
||||
*.db-wal
|
||||
*.db-shm
|
||||
__pycache__/
|
||||
.aider*
|
||||
|
||||
@@ -1,7 +1,7 @@
|
||||
# DEPRECATED — Bash Loop Scripts Removed
|
||||
|
||||
**Date:** 2026-03-25
|
||||
**Reason:** Replaced by sovereign-orchestration (SQLite + Python single-process executor)
|
||||
**Reason:** Replaced by Hermes + timmy-config sidecar orchestration
|
||||
|
||||
## What was removed
|
||||
- claude-loop.sh, gemini-loop.sh, agent-loop.sh
|
||||
@@ -9,14 +9,15 @@
|
||||
- nexus-merge-bot.sh, claudemax-watchdog.sh, timmy-loopstat.sh
|
||||
|
||||
## What replaces them
|
||||
**Repo:** Timmy_Foundation/sovereign-orchestration
|
||||
**Entry point:** `python3 src/sovereign_executor.py --workers 3 --poll 30`
|
||||
**Features:** SQLite task queue, crash recovery, dedup, playbooks, MCP server
|
||||
**Issues:** #29 (fix imports), #30 (deploy as service)
|
||||
**Harness:** Hermes
|
||||
**Overlay repo:** Timmy_Foundation/timmy-config
|
||||
**Entry points:** `orchestration.py`, `tasks.py`, `deploy.sh`
|
||||
**Features:** Huey + SQLite scheduling, local-model health checks, session export, DPO artifact staging
|
||||
|
||||
## Why
|
||||
The bash loops crash-looped, produced zero work after relaunch, had no crash
|
||||
recovery, no dedup, and required 8 separate scripts. The Python executor is
|
||||
one process with SQLite durability.
|
||||
recovery, no durable export path, and required too many ad hoc scripts. The
|
||||
Hermes sidecar keeps orchestration close to Timmy's actual config and training
|
||||
surfaces.
|
||||
|
||||
Do NOT recreate bash loops. If the executor is broken, fix the executor.
|
||||
Do NOT recreate bash loops. If orchestration is broken, fix the Hermes sidecar.
|
||||
|
||||
26
README.md
26
README.md
@@ -2,7 +2,7 @@
|
||||
|
||||
Timmy's sovereign configuration. Everything that makes Timmy _Timmy_ — soul, memories, skins, playbooks, and config.
|
||||
|
||||
This repo is the canonical source of truth for Timmy's identity and operational state. Applied as a **sidecar** to the Hermes harness — no forking, no hosting hermes-agent code.
|
||||
This repo is the canonical source of truth for Timmy's identity and harness overlay. Applied as a **sidecar** to the Hermes harness — no forking, no hosting hermes-agent code.
|
||||
|
||||
## Structure
|
||||
|
||||
@@ -14,22 +14,40 @@ timmy-config/
|
||||
├── DEPRECATED.md ← What was removed and why
|
||||
├── config.yaml ← Hermes harness configuration
|
||||
├── channel_directory.json ← Platform channel mappings
|
||||
├── bin/ ← Utility scripts (NOT loops — see below)
|
||||
├── bin/ ← Live utility scripts (NOT deprecated loops)
|
||||
│ ├── hermes-startup.sh ← Hermes boot sequence
|
||||
│ ├── agent-dispatch.sh ← Manual agent dispatch
|
||||
│ ├── ops-panel.sh ← Ops dashboard panel
|
||||
│ ├── ops-gitea.sh ← Gitea ops helpers
|
||||
│ ├── pipeline-freshness.sh ← Session/export drift check
|
||||
│ └── timmy-status.sh ← Status check
|
||||
├── memories/ ← Persistent memory YAML
|
||||
├── skins/ ← UI skins (timmy skin)
|
||||
├── playbooks/ ← Agent playbooks (YAML)
|
||||
└── cron/ ← Cron job definitions
|
||||
├── cron/ ← Cron job definitions
|
||||
└── training/ ← Transitional training recipes, not canonical lived data
|
||||
```
|
||||
|
||||
## Boundary
|
||||
|
||||
`timmy-config` owns identity, conscience, memories, skins, playbooks, channel
|
||||
maps, and harness-side orchestration glue.
|
||||
|
||||
`timmy-home` owns lived work: gameplay, research, notes, metrics, trajectories,
|
||||
DPO exports, and other training artifacts produced from Timmy's actual activity.
|
||||
|
||||
If a file answers "who is Timmy?" or "how does Hermes host him?", it belongs
|
||||
here. If it answers "what has Timmy done or learned?" it belongs in
|
||||
`timmy-home`.
|
||||
|
||||
The scripts in `bin/` are live operational helpers for the Hermes sidecar.
|
||||
What is dead are the old long-running bash worker loops, not every script in
|
||||
this repo.
|
||||
|
||||
## Orchestration: Huey
|
||||
|
||||
All orchestration (triage, PR review, dispatch) runs via [Huey](https://github.com/coleifer/huey) with SQLite.
|
||||
`orchestration.py` (6 lines) + `tasks.py` (~70 lines) replace the entire sovereign-orchestration repo (3,846 lines).
|
||||
`orchestration.py` + `tasks.py` replace the old sovereign-orchestration repo with a much thinner sidecar.
|
||||
|
||||
```bash
|
||||
pip install huey
|
||||
|
||||
BIN
assets/Vassal Rising.mp3
Normal file
BIN
assets/Vassal Rising.mp3
Normal file
Binary file not shown.
62
autolora/manifest.yaml
Normal file
62
autolora/manifest.yaml
Normal file
@@ -0,0 +1,62 @@
|
||||
# Timmy Adapter Manifest
|
||||
# Only version adapters, never base models. Base models are reproducible downloads.
|
||||
# Adapters are the diff. The manifest is the record.
|
||||
|
||||
bases:
|
||||
hermes3-8b-4bit:
|
||||
source: mlx-community/Hermes-3-Llama-3.1-8B-4bit
|
||||
local: ~/models/Hermes-3-Llama-3.1-8B-4bit
|
||||
arch: llama3
|
||||
params: 8B
|
||||
quant: 4-bit MLX
|
||||
|
||||
hermes4-14b-4bit:
|
||||
source: mlx-community/Hermes-4-14B-4bit
|
||||
local: ~/models/hermes4-14b-mlx
|
||||
arch: qwen3
|
||||
params: 14.8B
|
||||
quant: 4-bit MLX
|
||||
|
||||
adapters:
|
||||
timmy-v0:
|
||||
base: hermes3-8b-4bit
|
||||
date: 2026-03-24
|
||||
status: retired
|
||||
data: 1154 sessions (technical only, no crisis/pastoral)
|
||||
training: { lr: 2e-6, rank: 8, iters: 1000, best_iter: 800, val_loss: 2.134 }
|
||||
eval: { identity: PASS, sovereignty: PASS, coding: PASS, crisis: FAIL, faith: FAIL }
|
||||
notes: "First adapter. Crisis fails — data was 99% technical. Sacred rule: REJECTED."
|
||||
|
||||
timmy-v0-nan-run1:
|
||||
base: hermes3-8b-4bit
|
||||
date: 2026-03-24
|
||||
status: rejected
|
||||
notes: "NaN at iter 70. lr=1e-5 too high for 4-bit. Dead on arrival."
|
||||
|
||||
timmy-v0.1:
|
||||
base: hermes3-8b-4bit
|
||||
date: 2026-03-25
|
||||
status: retired
|
||||
data: 1203 train / 135 valid (enriched with 49 crisis/faith synthetic)
|
||||
training: { lr: 5e-6, rank: 8, iters: 600, val_loss: 2.026 }
|
||||
eval: { identity: PASS, sovereignty: PASS, coding: PASS, crisis: PARTIAL, faith: FAIL }
|
||||
notes: "Crisis partial — mentions seeking help but no 988/gospel. Rank 8 can't override base priors."
|
||||
|
||||
timmy-v0.2:
|
||||
base: hermes3-8b-4bit
|
||||
date: 2026-03-25
|
||||
status: rejected
|
||||
data: 1214 train / 141 valid (12 targeted crisis/faith examples, 5x duplicated)
|
||||
training: { lr: 5e-6, rank: 16, iters: 800 }
|
||||
eval: "NaN at iter 100. Rank 16 + lr 5e-6 unstable on 4-bit."
|
||||
notes: "Dead. Halve lr when doubling rank."
|
||||
|
||||
# NEXT
|
||||
timmy-v1.0:
|
||||
base: hermes4-14b-4bit
|
||||
date: 2026-03-26
|
||||
status: rejected
|
||||
data: 1125 train / 126 valid (same curated set, reused from 8B — NOT re-tokenized)
|
||||
training: { lr: 1e-6, rank: 16, iters: 800 }
|
||||
eval: "Val NaN iter 100, train NaN iter 160. Dead."
|
||||
notes: "Data was pre-truncated for Llama3 tokenizer, not Qwen3. Must re-run clean_data.py with 14B tokenizer before v1.1."
|
||||
42
bin/pipeline-freshness.sh
Executable file
42
bin/pipeline-freshness.sh
Executable file
@@ -0,0 +1,42 @@
|
||||
#!/usr/bin/env bash
|
||||
|
||||
set -euo pipefail
|
||||
|
||||
SESSIONS_DIR="$HOME/.hermes/sessions"
|
||||
EXPORT_DIR="$HOME/.timmy/training-data/dpo-pairs"
|
||||
|
||||
latest_session=$(find "$SESSIONS_DIR" -maxdepth 1 -name 'session_*.json' -type f -print 2>/dev/null | sort | tail -n 1)
|
||||
latest_export=$(find "$EXPORT_DIR" -maxdepth 1 -name 'session_*.json' -type f -print 2>/dev/null | sort | tail -n 1)
|
||||
|
||||
echo "latest_session=${latest_session:-none}"
|
||||
echo "latest_export=${latest_export:-none}"
|
||||
|
||||
if [ -z "${latest_session:-}" ]; then
|
||||
echo "status=ok"
|
||||
echo "reason=no sessions yet"
|
||||
exit 0
|
||||
fi
|
||||
|
||||
if [ -z "${latest_export:-}" ]; then
|
||||
echo "status=lagging"
|
||||
echo "reason=no exports yet"
|
||||
exit 1
|
||||
fi
|
||||
|
||||
session_mtime=$(stat -f '%m' "$latest_session")
|
||||
export_mtime=$(stat -f '%m' "$latest_export")
|
||||
lag_minutes=$(( (session_mtime - export_mtime) / 60 ))
|
||||
if [ "$lag_minutes" -lt 0 ]; then
|
||||
lag_minutes=0
|
||||
fi
|
||||
|
||||
echo "lag_minutes=$lag_minutes"
|
||||
|
||||
if [ "$lag_minutes" -gt 300 ]; then
|
||||
echo "status=lagging"
|
||||
echo "reason=exports more than 5 hours behind sessions"
|
||||
exit 1
|
||||
fi
|
||||
|
||||
echo "status=ok"
|
||||
echo "reason=exports within freshness window"
|
||||
@@ -1,7 +1,7 @@
|
||||
#!/usr/bin/env bash
|
||||
# sync-up.sh — Push live ~/.hermes config changes UP to timmy-config repo.
|
||||
# The harness is the source. The repo is the record.
|
||||
# Run periodically or after significant config changes.
|
||||
# Only commits when there are REAL changes (not empty syncs).
|
||||
|
||||
set -euo pipefail
|
||||
|
||||
@@ -12,31 +12,29 @@ log() { echo "[sync-up] $*"; }
|
||||
|
||||
# === Copy live config into repo ===
|
||||
cp "$HERMES_HOME/config.yaml" "$REPO_DIR/config.yaml"
|
||||
log "config.yaml"
|
||||
|
||||
# === Playbooks ===
|
||||
for f in "$HERMES_HOME"/playbooks/*.yaml; do
|
||||
[ -f "$f" ] && cp "$f" "$REPO_DIR/playbooks/"
|
||||
done
|
||||
log "playbooks/"
|
||||
|
||||
# === Skins ===
|
||||
for f in "$HERMES_HOME"/skins/*; do
|
||||
[ -f "$f" ] && cp "$f" "$REPO_DIR/skins/"
|
||||
done
|
||||
log "skins/"
|
||||
|
||||
# === Channel directory ===
|
||||
[ -f "$HERMES_HOME/channel_directory.json" ] && cp "$HERMES_HOME/channel_directory.json" "$REPO_DIR/"
|
||||
log "channel_directory.json"
|
||||
|
||||
# === Commit and push if there are changes ===
|
||||
# === Only commit if there are real diffs ===
|
||||
cd "$REPO_DIR"
|
||||
if [ -n "$(git status --porcelain)" ]; then
|
||||
git add -A
|
||||
git commit -m "sync: live config from ~/.hermes $(date +%Y-%m-%d_%H:%M)"
|
||||
git push
|
||||
log "Pushed changes to Gitea."
|
||||
else
|
||||
git add -A
|
||||
|
||||
# Check if there are staged changes
|
||||
if git diff --cached --quiet; then
|
||||
log "No changes to sync."
|
||||
exit 0
|
||||
fi
|
||||
|
||||
# Build a meaningful commit message from what actually changed
|
||||
CHANGED=$(git diff --cached --name-only | tr '\n' ', ' | sed 's/,$//')
|
||||
git commit -m "config: update ${CHANGED}"
|
||||
git push
|
||||
log "Pushed: ${CHANGED}"
|
||||
|
||||
252
bin/timmy-dashboard
Executable file
252
bin/timmy-dashboard
Executable file
@@ -0,0 +1,252 @@
|
||||
#!/usr/bin/env python3
|
||||
"""Timmy Model Dashboard — where are my models, what are they doing.
|
||||
|
||||
Usage:
|
||||
timmy-dashboard # one-shot
|
||||
timmy-dashboard --watch # live refresh every 30s
|
||||
timmy-dashboard --hours=48 # look back 48h
|
||||
"""
|
||||
|
||||
import json
|
||||
import os
|
||||
import subprocess
|
||||
import sys
|
||||
import time
|
||||
import urllib.request
|
||||
from datetime import datetime, timezone, timedelta
|
||||
from pathlib import Path
|
||||
|
||||
HERMES_HOME = Path.home() / ".hermes"
|
||||
TIMMY_HOME = Path.home() / ".timmy"
|
||||
METRICS_DIR = TIMMY_HOME / "metrics"
|
||||
|
||||
# ── Data Sources ──────────────────────────────────────────────────────
|
||||
|
||||
def get_ollama_models():
|
||||
try:
|
||||
req = urllib.request.Request("http://localhost:11434/api/tags")
|
||||
with urllib.request.urlopen(req, timeout=5) as resp:
|
||||
return json.loads(resp.read()).get("models", [])
|
||||
except Exception:
|
||||
return []
|
||||
|
||||
|
||||
def get_loaded_models():
|
||||
try:
|
||||
req = urllib.request.Request("http://localhost:11434/api/ps")
|
||||
with urllib.request.urlopen(req, timeout=5) as resp:
|
||||
return json.loads(resp.read()).get("models", [])
|
||||
except Exception:
|
||||
return []
|
||||
|
||||
|
||||
def get_huey_pid():
|
||||
try:
|
||||
r = subprocess.run(["pgrep", "-f", "huey_consumer"],
|
||||
capture_output=True, text=True, timeout=5)
|
||||
return r.stdout.strip().split("\n")[0] if r.returncode == 0 else None
|
||||
except Exception:
|
||||
return None
|
||||
|
||||
|
||||
def get_hermes_sessions():
|
||||
sessions_file = HERMES_HOME / "sessions" / "sessions.json"
|
||||
if not sessions_file.exists():
|
||||
return []
|
||||
try:
|
||||
data = json.loads(sessions_file.read_text())
|
||||
return list(data.values())
|
||||
except Exception:
|
||||
return []
|
||||
|
||||
|
||||
def get_heartbeat_ticks(date_str=None):
|
||||
if not date_str:
|
||||
date_str = datetime.now().strftime("%Y%m%d")
|
||||
tick_file = TIMMY_HOME / "heartbeat" / f"ticks_{date_str}.jsonl"
|
||||
if not tick_file.exists():
|
||||
return []
|
||||
ticks = []
|
||||
for line in tick_file.read_text().strip().split("\n"):
|
||||
if not line.strip():
|
||||
continue
|
||||
try:
|
||||
ticks.append(json.loads(line))
|
||||
except Exception:
|
||||
continue
|
||||
return ticks
|
||||
|
||||
|
||||
def get_local_metrics(hours=24):
|
||||
"""Read local inference metrics from jsonl files."""
|
||||
records = []
|
||||
cutoff = datetime.now(timezone.utc) - timedelta(hours=hours)
|
||||
if not METRICS_DIR.exists():
|
||||
return records
|
||||
for f in sorted(METRICS_DIR.glob("local_*.jsonl")):
|
||||
for line in f.read_text().strip().split("\n"):
|
||||
if not line.strip():
|
||||
continue
|
||||
try:
|
||||
r = json.loads(line)
|
||||
ts = datetime.fromisoformat(r["timestamp"])
|
||||
if ts >= cutoff:
|
||||
records.append(r)
|
||||
except Exception:
|
||||
continue
|
||||
return records
|
||||
|
||||
|
||||
def get_cron_jobs():
|
||||
"""Get Hermes cron job status."""
|
||||
try:
|
||||
r = subprocess.run(
|
||||
["hermes", "cron", "list", "--json"],
|
||||
capture_output=True, text=True, timeout=10
|
||||
)
|
||||
if r.returncode == 0:
|
||||
return json.loads(r.stdout).get("jobs", [])
|
||||
except Exception:
|
||||
pass
|
||||
return []
|
||||
|
||||
|
||||
# ── Rendering ─────────────────────────────────────────────────────────
|
||||
|
||||
DIM = "\033[2m"
|
||||
BOLD = "\033[1m"
|
||||
GREEN = "\033[32m"
|
||||
YELLOW = "\033[33m"
|
||||
RED = "\033[31m"
|
||||
CYAN = "\033[36m"
|
||||
RST = "\033[0m"
|
||||
CLR = "\033[2J\033[H"
|
||||
|
||||
|
||||
def render(hours=24):
|
||||
models = get_ollama_models()
|
||||
loaded = get_loaded_models()
|
||||
huey_pid = get_huey_pid()
|
||||
ticks = get_heartbeat_ticks()
|
||||
metrics = get_local_metrics(hours)
|
||||
sessions = get_hermes_sessions()
|
||||
|
||||
loaded_names = {m.get("name", "") for m in loaded}
|
||||
now = datetime.now().strftime("%Y-%m-%d %H:%M:%S")
|
||||
|
||||
print(CLR, end="")
|
||||
print(f"{BOLD}{'=' * 70}")
|
||||
print(f" TIMMY MODEL DASHBOARD")
|
||||
print(f" {now} | Huey: {GREEN}PID {huey_pid}{RST if huey_pid else f'{RED}DOWN{RST}'}")
|
||||
print(f"{'=' * 70}{RST}")
|
||||
|
||||
# ── LOCAL MODELS ──
|
||||
print(f"\n {BOLD}LOCAL MODELS (Ollama){RST}")
|
||||
print(f" {DIM}{'-' * 55}{RST}")
|
||||
if models:
|
||||
for m in models:
|
||||
name = m.get("name", "?")
|
||||
size_gb = m.get("size", 0) / 1e9
|
||||
if name in loaded_names:
|
||||
status = f"{GREEN}IN VRAM{RST}"
|
||||
else:
|
||||
status = f"{DIM}on disk{RST}"
|
||||
print(f" {name:35s} {size_gb:5.1f}GB {status}")
|
||||
else:
|
||||
print(f" {RED}(Ollama not responding){RST}")
|
||||
|
||||
# ── LOCAL INFERENCE ACTIVITY ──
|
||||
print(f"\n {BOLD}LOCAL INFERENCE ({len(metrics)} calls, last {hours}h){RST}")
|
||||
print(f" {DIM}{'-' * 55}{RST}")
|
||||
if metrics:
|
||||
by_caller = {}
|
||||
for r in metrics:
|
||||
caller = r.get("caller", "unknown")
|
||||
if caller not in by_caller:
|
||||
by_caller[caller] = {"count": 0, "success": 0, "errors": 0}
|
||||
by_caller[caller]["count"] += 1
|
||||
if r.get("success"):
|
||||
by_caller[caller]["success"] += 1
|
||||
else:
|
||||
by_caller[caller]["errors"] += 1
|
||||
for caller, stats in by_caller.items():
|
||||
err = f" {RED}err:{stats['errors']}{RST}" if stats["errors"] else ""
|
||||
print(f" {caller:25s} calls:{stats['count']:4d} "
|
||||
f"{GREEN}ok:{stats['success']}{RST}{err}")
|
||||
|
||||
by_model = {}
|
||||
for r in metrics:
|
||||
model = r.get("model", "unknown")
|
||||
by_model[model] = by_model.get(model, 0) + 1
|
||||
print(f"\n {DIM}Models used:{RST}")
|
||||
for model, count in sorted(by_model.items(), key=lambda x: -x[1]):
|
||||
print(f" {model:30s} {count} calls")
|
||||
else:
|
||||
print(f" {DIM}(no local calls recorded yet){RST}")
|
||||
|
||||
# ── HEARTBEAT STATUS ──
|
||||
print(f"\n {BOLD}HEARTBEAT ({len(ticks)} ticks today){RST}")
|
||||
print(f" {DIM}{'-' * 55}{RST}")
|
||||
if ticks:
|
||||
last = ticks[-1]
|
||||
decision = last.get("decision", last.get("actions", {}))
|
||||
if isinstance(decision, dict):
|
||||
severity = decision.get("severity", "unknown")
|
||||
reasoning = decision.get("reasoning", "")
|
||||
sev_color = GREEN if severity == "ok" else YELLOW if severity == "warning" else RED
|
||||
print(f" Last tick: {last.get('tick_id', '?')}")
|
||||
print(f" Severity: {sev_color}{severity}{RST}")
|
||||
if reasoning:
|
||||
print(f" Reasoning: {reasoning[:65]}")
|
||||
else:
|
||||
print(f" Last tick: {last.get('tick_id', '?')}")
|
||||
actions = last.get("actions", [])
|
||||
print(f" Actions: {actions if actions else 'none'}")
|
||||
|
||||
model_decisions = sum(1 for t in ticks
|
||||
if isinstance(t.get("decision"), dict)
|
||||
and t["decision"].get("severity") != "fallback")
|
||||
fallback = len(ticks) - model_decisions
|
||||
print(f" {CYAN}Model: {model_decisions}{RST} | {DIM}Fallback: {fallback}{RST}")
|
||||
else:
|
||||
print(f" {DIM}(no ticks today){RST}")
|
||||
|
||||
# ── HERMES SESSIONS ──
|
||||
local_sessions = [s for s in sessions
|
||||
if "localhost:11434" in str(s.get("base_url", ""))]
|
||||
cloud_sessions = [s for s in sessions if s not in local_sessions]
|
||||
print(f"\n {BOLD}HERMES SESSIONS{RST}")
|
||||
print(f" {DIM}{'-' * 55}{RST}")
|
||||
print(f" Total: {len(sessions)} | "
|
||||
f"{GREEN}Local: {len(local_sessions)}{RST} | "
|
||||
f"{YELLOW}Cloud: {len(cloud_sessions)}{RST}")
|
||||
|
||||
# ── ACTIVE LOOPS ──
|
||||
print(f"\n {BOLD}ACTIVE LOOPS{RST}")
|
||||
print(f" {DIM}{'-' * 55}{RST}")
|
||||
print(f" {CYAN}heartbeat_tick{RST} 10m hermes4:14b DECIDE phase")
|
||||
print(f" {DIM}model_health{RST} 5m (local check) Ollama ping")
|
||||
print(f" {DIM}gemini_worker{RST} 20m gemini-2.5-pro aider")
|
||||
print(f" {DIM}grok_worker{RST} 20m grok-3-fast opencode")
|
||||
print(f" {DIM}cross_review{RST} 30m gemini+grok PR review")
|
||||
|
||||
print(f"\n{BOLD}{'=' * 70}{RST}")
|
||||
print(f" {DIM}Refresh: timmy-dashboard --watch | History: --hours=N{RST}")
|
||||
|
||||
|
||||
if __name__ == "__main__":
|
||||
watch = "--watch" in sys.argv
|
||||
hours = 24
|
||||
for a in sys.argv[1:]:
|
||||
if a.startswith("--hours="):
|
||||
hours = int(a.split("=")[1])
|
||||
|
||||
if watch:
|
||||
try:
|
||||
while True:
|
||||
render(hours)
|
||||
time.sleep(30)
|
||||
except KeyboardInterrupt:
|
||||
print(f"\n{DIM}Dashboard stopped.{RST}")
|
||||
else:
|
||||
render(hours)
|
||||
@@ -1,5 +1,5 @@
|
||||
{
|
||||
"updated_at": "2026-03-26T06:59:37.300889",
|
||||
"updated_at": "2026-03-27T21:56:42.192325",
|
||||
"platforms": {
|
||||
"discord": [
|
||||
{
|
||||
|
||||
35
config.yaml
35
config.yaml
@@ -1,16 +1,19 @@
|
||||
model:
|
||||
default: claude-opus-4-6
|
||||
provider: anthropic
|
||||
default: gpt-5.4
|
||||
provider: openai-codex
|
||||
context_length: 65536
|
||||
base_url: https://chatgpt.com/backend-api/codex
|
||||
toolsets:
|
||||
- all
|
||||
agent:
|
||||
max_turns: 30
|
||||
reasoning_effort: medium
|
||||
reasoning_effort: xhigh
|
||||
verbose: false
|
||||
terminal:
|
||||
backend: local
|
||||
cwd: .
|
||||
timeout: 180
|
||||
env_passthrough: []
|
||||
docker_image: nikolaik/python-nodejs:python3.11-nodejs20
|
||||
docker_forward_env: []
|
||||
singularity_image: docker://nikolaik/python-nodejs:python3.11-nodejs20
|
||||
@@ -25,6 +28,7 @@ terminal:
|
||||
persistent_shell: true
|
||||
browser:
|
||||
inactivity_timeout: 120
|
||||
command_timeout: 30
|
||||
record_sessions: false
|
||||
checkpoints:
|
||||
enabled: true
|
||||
@@ -32,6 +36,8 @@ checkpoints:
|
||||
compression:
|
||||
enabled: false
|
||||
threshold: 0.5
|
||||
target_ratio: 0.2
|
||||
protect_last_n: 20
|
||||
summary_model: ''
|
||||
summary_provider: ''
|
||||
summary_base_url: ''
|
||||
@@ -90,11 +96,13 @@ display:
|
||||
compact: false
|
||||
personality: ''
|
||||
resume_display: full
|
||||
busy_input_mode: interrupt
|
||||
bell_on_complete: false
|
||||
show_reasoning: false
|
||||
streaming: false
|
||||
show_cost: false
|
||||
skin: timmy
|
||||
tool_progress_command: false
|
||||
tool_progress: all
|
||||
privacy:
|
||||
redact_pii: false
|
||||
@@ -142,6 +150,7 @@ delegation:
|
||||
provider: ''
|
||||
base_url: ''
|
||||
api_key: ''
|
||||
max_iterations: 50
|
||||
prefill_messages_file: ''
|
||||
honcho: {}
|
||||
timezone: ''
|
||||
@@ -176,17 +185,17 @@ session_reset:
|
||||
mode: none
|
||||
idle_minutes: 0
|
||||
custom_providers:
|
||||
- name: Local Ollama
|
||||
base_url: http://localhost:11434/v1
|
||||
api_key: ollama
|
||||
model: glm-4.7-flash:latest
|
||||
- name: Local llama.cpp
|
||||
base_url: http://localhost:8081/v1
|
||||
api_key: none
|
||||
model: auto
|
||||
- name: Google Gemini
|
||||
base_url: https://generativelanguage.googleapis.com/v1beta/openai
|
||||
api_key_env: GEMINI_API_KEY
|
||||
model: gemini-2.5-pro
|
||||
system_prompt_suffix: "You are Timmy. Your soul is defined in SOUL.md \u2014 read\
|
||||
\ it, live it.\nYou run locally on your owner's machine via Ollama. You never phone\
|
||||
\ home.\nYou speak plainly. You prefer short sentences. Brevity is a kindness.\n\
|
||||
\ it, live it.\nYou run locally on your owner's machine via llama.cpp. You never\
|
||||
\ phone home.\nYou speak plainly. You prefer short sentences. Brevity is a kindness.\n\
|
||||
When you don't know something, say so. Refusal over fabrication.\nSovereignty and\
|
||||
\ service always.\n"
|
||||
skills:
|
||||
@@ -197,12 +206,12 @@ providers:
|
||||
base_url: http://localhost:11434/v1
|
||||
model: hermes3:latest
|
||||
mcp_servers:
|
||||
orchestration:
|
||||
command: /Users/apayne/.hermes/hermes-agent/venv/bin/python3
|
||||
morrowind:
|
||||
command: python3
|
||||
args:
|
||||
- /Users/apayne/.hermes/hermes-agent/tools/orchestration_mcp_server.py
|
||||
- /Users/apayne/.timmy/morrowind/mcp_server.py
|
||||
env: {}
|
||||
timeout: 120
|
||||
timeout: 30
|
||||
fallback_model:
|
||||
provider: custom
|
||||
model: gemini-2.5-pro
|
||||
|
||||
24
deploy.sh
24
deploy.sh
@@ -3,7 +3,7 @@
|
||||
# This is the canonical way to deploy Timmy's configuration.
|
||||
# Hermes-agent is the engine. timmy-config is the driver's seat.
|
||||
#
|
||||
# Usage: ./deploy.sh [--restart-loops]
|
||||
# Usage: ./deploy.sh
|
||||
|
||||
set -euo pipefail
|
||||
|
||||
@@ -74,24 +74,10 @@ done
|
||||
chmod +x "$HERMES_HOME/bin/"*.sh "$HERMES_HOME/bin/"*.py 2>/dev/null || true
|
||||
log "bin/ -> $HERMES_HOME/bin/"
|
||||
|
||||
# === Restart loops if requested ===
|
||||
if [ "${1:-}" = "--restart-loops" ]; then
|
||||
log "Killing existing loops..."
|
||||
pkill -f 'claude-loop.sh' 2>/dev/null || true
|
||||
pkill -f 'gemini-loop.sh' 2>/dev/null || true
|
||||
pkill -f 'timmy-orchestrator.sh' 2>/dev/null || true
|
||||
sleep 2
|
||||
|
||||
log "Clearing stale locks..."
|
||||
rm -rf "$HERMES_HOME/logs/claude-locks/"* 2>/dev/null || true
|
||||
rm -rf "$HERMES_HOME/logs/gemini-locks/"* 2>/dev/null || true
|
||||
|
||||
log "Relaunching loops..."
|
||||
nohup bash "$HERMES_HOME/bin/timmy-orchestrator.sh" >> "$HERMES_HOME/logs/timmy-orchestrator.log" 2>&1 &
|
||||
nohup bash "$HERMES_HOME/bin/claude-loop.sh" 2 >> "$HERMES_HOME/logs/claude-loop.log" 2>&1 &
|
||||
nohup bash "$HERMES_HOME/bin/gemini-loop.sh" 1 >> "$HERMES_HOME/logs/gemini-loop.log" 2>&1 &
|
||||
sleep 1
|
||||
log "Loops relaunched."
|
||||
if [ "${1:-}" != "" ]; then
|
||||
echo "ERROR: deploy.sh no longer accepts legacy loop flags." >&2
|
||||
echo "Deploy the sidecar only. Do not relaunch deprecated bash loops." >&2
|
||||
exit 1
|
||||
fi
|
||||
|
||||
log "Deploy complete. timmy-config applied to $HERMES_HOME/"
|
||||
|
||||
438
docs/local-model-integration-sketch.md
Normal file
438
docs/local-model-integration-sketch.md
Normal file
@@ -0,0 +1,438 @@
|
||||
# Local Model Integration Sketch v2
|
||||
# Hermes4-14B in the Heartbeat Loop — No New Telemetry
|
||||
|
||||
## Principle
|
||||
|
||||
No new inference layer. Huey tasks call `hermes chat -q` pointed at
|
||||
Ollama. Hermes handles sessions, token tracking, cost logging.
|
||||
The dashboard reads what Hermes already stores.
|
||||
|
||||
---
|
||||
|
||||
## Why Not Ollama Directly?
|
||||
|
||||
Ollama is fine as a serving backend. The issue isn't Ollama — it's that
|
||||
calling Ollama directly with urllib bypasses the harness. The harness
|
||||
already tracks sessions, tokens, model/provider, platform. Building a
|
||||
second telemetry layer is owning code we don't need.
|
||||
|
||||
Ollama as a named provider isn't wired into the --provider flag yet,
|
||||
but routing works via env vars:
|
||||
|
||||
HERMES_MODEL="hermes4:14b" \
|
||||
HERMES_PROVIDER="custom" \
|
||||
HERMES_BASE_URL="http://localhost:11434/v1" \
|
||||
hermes chat -q "prompt here" -Q
|
||||
|
||||
This creates a tracked session, logs tokens, and returns the response.
|
||||
That's our local inference call.
|
||||
|
||||
### Alternatives to Ollama for serving:
|
||||
- **llama.cpp server** — lighter, no Python, raw HTTP. Good for single
|
||||
model serving. Less convenient for model switching.
|
||||
- **vLLM** — best throughput, but needs NVIDIA GPU. Not for M3 Mac.
|
||||
- **MLX serving** — native Apple Silicon, but no OpenAI-compat API yet.
|
||||
MLX is for training, not serving (our current policy).
|
||||
- **llamafile** — single binary, portable. Good for distribution.
|
||||
|
||||
Verdict: Ollama is fine. It's the standard OpenAI-compat local server
|
||||
on Mac. The issue was never Ollama — it was bypassing the harness.
|
||||
|
||||
---
|
||||
|
||||
## 1. The Call Pattern
|
||||
|
||||
One function in tasks.py that all Huey tasks use:
|
||||
|
||||
```python
|
||||
import subprocess
|
||||
import json
|
||||
|
||||
HERMES_BIN = "hermes"
|
||||
LOCAL_ENV = {
|
||||
"HERMES_MODEL": "hermes4:14b",
|
||||
"HERMES_PROVIDER": "custom",
|
||||
"HERMES_BASE_URL": "http://localhost:11434/v1",
|
||||
}
|
||||
|
||||
def hermes_local(prompt, caller_tag=None, max_retries=2):
|
||||
"""Call hermes with local Ollama model. Returns response text.
|
||||
|
||||
Every call creates a hermes session with full telemetry.
|
||||
caller_tag gets prepended to prompt for searchability.
|
||||
"""
|
||||
import os
|
||||
env = os.environ.copy()
|
||||
env.update(LOCAL_ENV)
|
||||
|
||||
tagged_prompt = prompt
|
||||
if caller_tag:
|
||||
tagged_prompt = f"[{caller_tag}] {prompt}"
|
||||
|
||||
for attempt in range(max_retries + 1):
|
||||
try:
|
||||
result = subprocess.run(
|
||||
[HERMES_BIN, "chat", "-q", tagged_prompt, "-Q", "-t", "none"],
|
||||
capture_output=True, text=True,
|
||||
timeout=120, env=env,
|
||||
)
|
||||
if result.returncode == 0 and result.stdout.strip():
|
||||
# Strip the session_id line from -Q output
|
||||
lines = result.stdout.strip().split("\n")
|
||||
response_lines = [l for l in lines if not l.startswith("session_id:")]
|
||||
return "\n".join(response_lines).strip()
|
||||
except subprocess.TimeoutExpired:
|
||||
if attempt == max_retries:
|
||||
return None
|
||||
continue
|
||||
return None
|
||||
```
|
||||
|
||||
Notes:
|
||||
- `-t none` disables all toolsets — the heartbeat model shouldn't
|
||||
have terminal/file access. Pure reasoning only.
|
||||
- `-Q` quiet mode suppresses banner/spinner, gives clean output.
|
||||
- Every call creates a session in Hermes session store. Searchable,
|
||||
exportable, countable.
|
||||
- The `[caller_tag]` prefix lets you filter sessions by which Huey
|
||||
task generated them: `hermes sessions list | grep heartbeat`
|
||||
|
||||
---
|
||||
|
||||
## 2. Heartbeat DECIDE Phase
|
||||
|
||||
Replace the hardcoded if/else with a model call:
|
||||
|
||||
```python
|
||||
# In heartbeat_tick(), replace the DECIDE + ACT section:
|
||||
|
||||
# DECIDE: let hermes4:14b reason about what to do
|
||||
decide_prompt = f"""System state at {now.isoformat()}:
|
||||
|
||||
{json.dumps(perception, indent=2)}
|
||||
|
||||
Previous tick: {last_tick.get('tick_id', 'none')}
|
||||
|
||||
You are the heartbeat monitor. Based on this state:
|
||||
1. List any actions needed (alerts, restarts, escalations). Empty if all OK.
|
||||
2. Rate severity: ok, warning, or critical.
|
||||
3. One sentence of reasoning.
|
||||
|
||||
Respond ONLY with JSON:
|
||||
{{"actions": [], "severity": "ok", "reasoning": "..."}}"""
|
||||
|
||||
decision = None
|
||||
try:
|
||||
raw = hermes_local(decide_prompt, caller_tag="heartbeat_tick")
|
||||
if raw:
|
||||
# Try to parse JSON from the response
|
||||
# Model might wrap it in markdown, so extract
|
||||
for line in raw.split("\n"):
|
||||
line = line.strip()
|
||||
if line.startswith("{"):
|
||||
decision = json.loads(line)
|
||||
break
|
||||
if not decision:
|
||||
decision = json.loads(raw)
|
||||
except (json.JSONDecodeError, Exception) as e:
|
||||
decision = None
|
||||
|
||||
# Fallback to hardcoded logic if model fails or is down
|
||||
if decision is None:
|
||||
actions = []
|
||||
if not perception.get("gitea_alive"):
|
||||
actions.append("ALERT: Gitea unreachable")
|
||||
health = perception.get("model_health", {})
|
||||
if isinstance(health, dict) and not health.get("ollama_running"):
|
||||
actions.append("ALERT: Ollama not running")
|
||||
decision = {
|
||||
"actions": actions,
|
||||
"severity": "fallback",
|
||||
"reasoning": "model unavailable, used hardcoded checks"
|
||||
}
|
||||
|
||||
tick_record["decision"] = decision
|
||||
actions = decision.get("actions", [])
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## 3. DPO Candidate Collection
|
||||
|
||||
No new database. Hermes sessions ARE the DPO candidates.
|
||||
|
||||
Every `hermes_local()` call creates a session. To extract DPO pairs:
|
||||
|
||||
```bash
|
||||
# Export all local-model sessions
|
||||
hermes sessions export --output /tmp/local-sessions.jsonl
|
||||
|
||||
# Filter for heartbeat decisions
|
||||
grep "heartbeat_tick" /tmp/local-sessions.jsonl > heartbeat_decisions.jsonl
|
||||
```
|
||||
|
||||
The existing `session_export` Huey task (runs every 4h) already extracts
|
||||
user→assistant pairs. It just needs to be aware that some sessions are
|
||||
now local-model decisions instead of human conversations.
|
||||
|
||||
For DPO annotation, add a simple review script:
|
||||
|
||||
```python
|
||||
# review_decisions.py — reads heartbeat tick logs, shows model decisions,
|
||||
# asks Alexander to mark chosen/rejected
|
||||
# Writes annotations back to the tick log files
|
||||
|
||||
import json
|
||||
from pathlib import Path
|
||||
|
||||
TICK_DIR = Path.home() / ".timmy" / "heartbeat"
|
||||
|
||||
for log_file in sorted(TICK_DIR.glob("ticks_*.jsonl")):
|
||||
for line in log_file.read_text().strip().split("\n"):
|
||||
tick = json.loads(line)
|
||||
decision = tick.get("decision", {})
|
||||
if decision.get("severity") == "fallback":
|
||||
continue # skip fallback entries
|
||||
|
||||
print(f"\n--- Tick {tick['tick_id']} ---")
|
||||
print(f"Perception: {json.dumps(tick['perception'], indent=2)}")
|
||||
print(f"Decision: {json.dumps(decision, indent=2)}")
|
||||
|
||||
rating = input("Rate (c=chosen, r=rejected, s=skip): ").strip()
|
||||
if rating in ("c", "r"):
|
||||
tick["dpo_label"] = "chosen" if rating == "c" else "rejected"
|
||||
# write back... (append to annotated file)
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## 4. Dashboard — Reads Hermes Data
|
||||
|
||||
```python
|
||||
#!/usr/bin/env python3
|
||||
"""Timmy Model Dashboard — reads from Hermes, owns nothing."""
|
||||
|
||||
import json
|
||||
import os
|
||||
import subprocess
|
||||
import sys
|
||||
import time
|
||||
import urllib.request
|
||||
from datetime import datetime
|
||||
from pathlib import Path
|
||||
|
||||
HERMES_HOME = Path.home() / ".hermes"
|
||||
TIMMY_HOME = Path.home() / ".timmy"
|
||||
|
||||
|
||||
def get_ollama_models():
|
||||
"""What's available in Ollama."""
|
||||
try:
|
||||
req = urllib.request.Request("http://localhost:11434/api/tags")
|
||||
with urllib.request.urlopen(req, timeout=5) as resp:
|
||||
return json.loads(resp.read()).get("models", [])
|
||||
except Exception:
|
||||
return []
|
||||
|
||||
|
||||
def get_loaded_models():
|
||||
"""What's actually in VRAM right now."""
|
||||
try:
|
||||
req = urllib.request.Request("http://localhost:11434/api/ps")
|
||||
with urllib.request.urlopen(req, timeout=5) as resp:
|
||||
return json.loads(resp.read()).get("models", [])
|
||||
except Exception:
|
||||
return []
|
||||
|
||||
|
||||
def get_huey_status():
|
||||
try:
|
||||
r = subprocess.run(["pgrep", "-f", "huey_consumer"],
|
||||
capture_output=True, timeout=5)
|
||||
return r.returncode == 0
|
||||
except Exception:
|
||||
return False
|
||||
|
||||
|
||||
def get_hermes_sessions(hours=24):
|
||||
"""Read session metadata from Hermes session store."""
|
||||
sessions_file = HERMES_HOME / "sessions" / "sessions.json"
|
||||
if not sessions_file.exists():
|
||||
return []
|
||||
try:
|
||||
data = json.loads(sessions_file.read_text())
|
||||
return list(data.values())
|
||||
except Exception:
|
||||
return []
|
||||
|
||||
|
||||
def get_heartbeat_ticks(date_str=None):
|
||||
"""Read today's heartbeat ticks."""
|
||||
if not date_str:
|
||||
date_str = datetime.now().strftime("%Y%m%d")
|
||||
tick_file = TIMMY_HOME / "heartbeat" / f"ticks_{date_str}.jsonl"
|
||||
if not tick_file.exists():
|
||||
return []
|
||||
ticks = []
|
||||
for line in tick_file.read_text().strip().split("\n"):
|
||||
try:
|
||||
ticks.append(json.loads(line))
|
||||
except Exception:
|
||||
continue
|
||||
return ticks
|
||||
|
||||
|
||||
def render(hours=24):
|
||||
models = get_ollama_models()
|
||||
loaded = get_loaded_models()
|
||||
huey = get_huey_status()
|
||||
sessions = get_hermes_sessions(hours)
|
||||
ticks = get_heartbeat_ticks()
|
||||
|
||||
loaded_names = {m.get("name", "") for m in loaded}
|
||||
|
||||
print("\033[2J\033[H")
|
||||
print("=" * 70)
|
||||
print(" TIMMY MODEL DASHBOARD")
|
||||
now = datetime.now().strftime("%Y-%m-%d %H:%M:%S")
|
||||
print(f" {now} | Huey: {'UP' if huey else 'DOWN'} | Ollama models: {len(models)}")
|
||||
print("=" * 70)
|
||||
|
||||
# DEPLOYMENTS
|
||||
print("\n LOCAL MODELS")
|
||||
print(" " + "-" * 55)
|
||||
for m in models:
|
||||
name = m.get("name", "?")
|
||||
size_gb = m.get("size", 0) / 1e9
|
||||
status = "IN VRAM" if name in loaded_names else "on disk"
|
||||
print(f" {name:35s} {size_gb:5.1f}GB {status}")
|
||||
if not models:
|
||||
print(" (Ollama not responding)")
|
||||
|
||||
# HERMES SESSION ACTIVITY
|
||||
# Count sessions by platform/provider
|
||||
print(f"\n HERMES SESSIONS (recent)")
|
||||
print(" " + "-" * 55)
|
||||
local_sessions = [s for s in sessions
|
||||
if "localhost" in str(s.get("origin", {}))]
|
||||
cli_sessions = [s for s in sessions
|
||||
if s.get("platform") == "cli" or s.get("origin", {}).get("platform") == "cli"]
|
||||
|
||||
total_tokens = sum(s.get("total_tokens", 0) for s in sessions)
|
||||
print(f" Total sessions: {len(sessions)}")
|
||||
print(f" CLI sessions: {len(cli_sessions)}")
|
||||
print(f" Total tokens: {total_tokens:,}")
|
||||
|
||||
# HEARTBEAT STATUS
|
||||
print(f"\n HEARTBEAT ({len(ticks)} ticks today)")
|
||||
print(" " + "-" * 55)
|
||||
if ticks:
|
||||
last = ticks[-1]
|
||||
decision = last.get("decision", {})
|
||||
severity = decision.get("severity", "unknown")
|
||||
reasoning = decision.get("reasoning", "no model decision yet")
|
||||
print(f" Last tick: {last.get('tick_id', '?')}")
|
||||
print(f" Severity: {severity}")
|
||||
print(f" Reasoning: {reasoning[:60]}")
|
||||
|
||||
# Count model vs fallback decisions
|
||||
model_decisions = sum(1 for t in ticks
|
||||
if t.get("decision", {}).get("severity") != "fallback")
|
||||
fallback = len(ticks) - model_decisions
|
||||
print(f" Model decisions: {model_decisions} | Fallback: {fallback}")
|
||||
|
||||
# DPO labels if any
|
||||
labeled = sum(1 for t in ticks if "dpo_label" in t)
|
||||
if labeled:
|
||||
chosen = sum(1 for t in ticks if t.get("dpo_label") == "chosen")
|
||||
rejected = sum(1 for t in ticks if t.get("dpo_label") == "rejected")
|
||||
print(f" DPO labeled: {labeled} (chosen: {chosen}, rejected: {rejected})")
|
||||
else:
|
||||
print(" (no ticks today)")
|
||||
|
||||
# ACTIVE LOOPS
|
||||
print(f"\n ACTIVE LOOPS USING LOCAL MODELS")
|
||||
print(" " + "-" * 55)
|
||||
print(" heartbeat_tick 10m hermes4:14b DECIDE phase")
|
||||
print(" (future) 15m hermes4:14b issue triage")
|
||||
print(" (future) daily timmy:v0.1 morning report")
|
||||
|
||||
print(f"\n NON-LOCAL LOOPS (Gemini/Grok API)")
|
||||
print(" " + "-" * 55)
|
||||
print(" gemini_worker 20m gemini-2.5-pro aider")
|
||||
print(" grok_worker 20m grok-3-fast opencode")
|
||||
print(" cross_review 30m both PR review")
|
||||
|
||||
print("\n" + "=" * 70)
|
||||
|
||||
|
||||
if __name__ == "__main__":
|
||||
watch = "--watch" in sys.argv
|
||||
hours = 24
|
||||
for a in sys.argv[1:]:
|
||||
if a.startswith("--hours="):
|
||||
hours = int(a.split("=")[1])
|
||||
if watch:
|
||||
while True:
|
||||
render(hours)
|
||||
time.sleep(30)
|
||||
else:
|
||||
render(hours)
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## 5. Implementation Steps
|
||||
|
||||
### Step 1: Add hermes_local() to tasks.py
|
||||
- One function, ~20 lines
|
||||
- Calls `hermes chat -q` with Ollama env vars
|
||||
- All telemetry comes from Hermes for free
|
||||
|
||||
### Step 2: Wire heartbeat_tick DECIDE phase
|
||||
- Replace 6 lines of if/else with hermes_local() call
|
||||
- Keep hardcoded fallback when model is down
|
||||
- Decision stored in tick record for DPO review
|
||||
|
||||
### Step 3: Fix the MCP server warning
|
||||
- The orchestration MCP server path is broken — harmless but noisy
|
||||
- Either fix the path or remove from config
|
||||
|
||||
### Step 4: Drop model_dashboard.py in timmy-config/bin/
|
||||
- Reads Ollama API, Hermes sessions, heartbeat ticks
|
||||
- No new data stores — just views over existing ones
|
||||
- `python3 model_dashboard.py --watch` for live view
|
||||
|
||||
### Step 5: Expand to more Huey tasks
|
||||
- triage_issues: model reads issue, picks agent
|
||||
- good_morning_report: model writes the "From Timmy" section
|
||||
- Each expansion is just calling hermes_local() with a different prompt
|
||||
|
||||
---
|
||||
|
||||
## What Gets Hotfixed in Hermes Config
|
||||
|
||||
If `hermes insights` is broken (the cache_read_tokens column error),
|
||||
that needs a fix. The dashboard falls back to reading sessions.json
|
||||
directly, but insights would be the better data source.
|
||||
|
||||
The `providers.ollama` section in config.yaml exists but isn't wired
|
||||
to the --provider flag. Filing this upstream or patching locally would
|
||||
let us do `hermes chat -q "..." --provider ollama` cleanly instead
|
||||
of relying on env vars. Not blocking — env vars work today.
|
||||
|
||||
---
|
||||
|
||||
## What This Owns
|
||||
|
||||
- hermes_local() — 20-line wrapper around a subprocess call
|
||||
- model_dashboard.py — read-only views over existing data
|
||||
- review_decisions.py — optional DPO annotation CLI
|
||||
|
||||
## What This Does NOT Own
|
||||
|
||||
- Inference. Ollama does that.
|
||||
- Telemetry. Hermes does that.
|
||||
- Session storage. Hermes does that.
|
||||
- Token counting. Hermes does that.
|
||||
- Training pipeline. Already exists in timmy-config/training/.
|
||||
530
gitea_client.py
Normal file
530
gitea_client.py
Normal file
@@ -0,0 +1,530 @@
|
||||
"""
|
||||
Gitea API Client — typed, sovereign, zero-dependency.
|
||||
|
||||
Replaces raw curl calls scattered across 41 bash scripts.
|
||||
Uses only stdlib (urllib) so it works on any Python install.
|
||||
|
||||
Usage:
|
||||
from tools.gitea_client import GiteaClient
|
||||
|
||||
client = GiteaClient() # reads token from ~/.hermes/gitea_token
|
||||
issues = client.list_issues("Timmy_Foundation/the-nexus", state="open")
|
||||
client.create_comment("Timmy_Foundation/the-nexus", 42, "PR created.")
|
||||
"""
|
||||
|
||||
from __future__ import annotations
|
||||
|
||||
import json
|
||||
import os
|
||||
import urllib.request
|
||||
import urllib.error
|
||||
import urllib.parse
|
||||
from dataclasses import dataclass, field
|
||||
from datetime import datetime, timezone
|
||||
from pathlib import Path
|
||||
from typing import Any, Optional
|
||||
|
||||
|
||||
# ---------------------------------------------------------------------------
|
||||
# Configuration
|
||||
# ---------------------------------------------------------------------------
|
||||
|
||||
def _read_token() -> str:
|
||||
"""Read Gitea token from standard locations."""
|
||||
for path in [
|
||||
Path.home() / ".hermes" / "gitea_token",
|
||||
Path.home() / ".hermes" / "gitea_token_vps",
|
||||
Path.home() / ".config" / "gitea" / "token",
|
||||
]:
|
||||
if path.exists():
|
||||
return path.read_text().strip()
|
||||
raise FileNotFoundError(
|
||||
"No Gitea token found. Checked: ~/.hermes/gitea_token, "
|
||||
"~/.hermes/gitea_token_vps, ~/.config/gitea/token"
|
||||
)
|
||||
|
||||
|
||||
def _read_base_url() -> str:
|
||||
"""Read Gitea base URL. Defaults to the VPS."""
|
||||
env = os.environ.get("GITEA_URL")
|
||||
if env:
|
||||
return env.rstrip("/")
|
||||
return "http://143.198.27.163:3000"
|
||||
|
||||
|
||||
# ---------------------------------------------------------------------------
|
||||
# Data classes — typed responses
|
||||
# ---------------------------------------------------------------------------
|
||||
|
||||
@dataclass
|
||||
class User:
|
||||
id: int
|
||||
login: str
|
||||
full_name: str = ""
|
||||
email: str = ""
|
||||
|
||||
@classmethod
|
||||
def from_dict(cls, d: dict) -> "User":
|
||||
return cls(
|
||||
id=d.get("id", 0),
|
||||
login=d.get("login", ""),
|
||||
full_name=d.get("full_name", ""),
|
||||
email=d.get("email", ""),
|
||||
)
|
||||
|
||||
|
||||
@dataclass
|
||||
class Label:
|
||||
id: int
|
||||
name: str
|
||||
color: str = ""
|
||||
|
||||
@classmethod
|
||||
def from_dict(cls, d: dict) -> "Label":
|
||||
return cls(id=d.get("id", 0), name=d.get("name", ""), color=d.get("color", ""))
|
||||
|
||||
|
||||
@dataclass
|
||||
class Issue:
|
||||
number: int
|
||||
title: str
|
||||
body: str
|
||||
state: str
|
||||
user: User
|
||||
assignees: list[User] = field(default_factory=list)
|
||||
labels: list[Label] = field(default_factory=list)
|
||||
created_at: str = ""
|
||||
updated_at: str = ""
|
||||
comments: int = 0
|
||||
|
||||
@classmethod
|
||||
def from_dict(cls, d: dict) -> "Issue":
|
||||
return cls(
|
||||
number=d.get("number", 0),
|
||||
title=d.get("title", ""),
|
||||
body=d.get("body", "") or "",
|
||||
state=d.get("state", ""),
|
||||
user=User.from_dict(d.get("user", {})),
|
||||
assignees=[User.from_dict(a) for a in d.get("assignees", []) or []],
|
||||
labels=[Label.from_dict(lb) for lb in d.get("labels", []) or []],
|
||||
created_at=d.get("created_at", ""),
|
||||
updated_at=d.get("updated_at", ""),
|
||||
comments=d.get("comments", 0),
|
||||
)
|
||||
|
||||
|
||||
@dataclass
|
||||
class Comment:
|
||||
id: int
|
||||
body: str
|
||||
user: User
|
||||
created_at: str = ""
|
||||
|
||||
@classmethod
|
||||
def from_dict(cls, d: dict) -> "Comment":
|
||||
return cls(
|
||||
id=d.get("id", 0),
|
||||
body=d.get("body", "") or "",
|
||||
user=User.from_dict(d.get("user", {})),
|
||||
created_at=d.get("created_at", ""),
|
||||
)
|
||||
|
||||
|
||||
@dataclass
|
||||
class PullRequest:
|
||||
number: int
|
||||
title: str
|
||||
body: str
|
||||
state: str
|
||||
user: User
|
||||
head_branch: str = ""
|
||||
base_branch: str = ""
|
||||
mergeable: bool = False
|
||||
merged: bool = False
|
||||
changed_files: int = 0
|
||||
|
||||
@classmethod
|
||||
def from_dict(cls, d: dict) -> "PullRequest":
|
||||
head = d.get("head", {}) or {}
|
||||
base = d.get("base", {}) or {}
|
||||
return cls(
|
||||
number=d.get("number", 0),
|
||||
title=d.get("title", ""),
|
||||
body=d.get("body", "") or "",
|
||||
state=d.get("state", ""),
|
||||
user=User.from_dict(d.get("user", {})),
|
||||
head_branch=head.get("ref", ""),
|
||||
base_branch=base.get("ref", ""),
|
||||
mergeable=d.get("mergeable", False),
|
||||
merged=d.get("merged", False) or False,
|
||||
changed_files=d.get("changed_files", 0),
|
||||
)
|
||||
|
||||
|
||||
@dataclass
|
||||
class PRFile:
|
||||
filename: str
|
||||
status: str # added, modified, deleted
|
||||
additions: int = 0
|
||||
deletions: int = 0
|
||||
|
||||
@classmethod
|
||||
def from_dict(cls, d: dict) -> "PRFile":
|
||||
return cls(
|
||||
filename=d.get("filename", ""),
|
||||
status=d.get("status", ""),
|
||||
additions=d.get("additions", 0),
|
||||
deletions=d.get("deletions", 0),
|
||||
)
|
||||
|
||||
|
||||
# ---------------------------------------------------------------------------
|
||||
# Client
|
||||
# ---------------------------------------------------------------------------
|
||||
|
||||
class GiteaError(Exception):
|
||||
"""Gitea API error with status code."""
|
||||
def __init__(self, status: int, message: str, url: str = ""):
|
||||
self.status = status
|
||||
self.url = url
|
||||
super().__init__(f"Gitea {status}: {message} [{url}]")
|
||||
|
||||
|
||||
class GiteaClient:
|
||||
"""
|
||||
Typed Gitea API client. Sovereign, zero-dependency.
|
||||
|
||||
Covers all operations the agent loops need:
|
||||
- Issues: list, get, create, update, close, assign, label, comment
|
||||
- PRs: list, get, create, merge, update, close, files
|
||||
- Repos: list org repos
|
||||
"""
|
||||
|
||||
def __init__(
|
||||
self,
|
||||
base_url: Optional[str] = None,
|
||||
token: Optional[str] = None,
|
||||
):
|
||||
self.base_url = base_url or _read_base_url()
|
||||
self.token = token or _read_token()
|
||||
self.api = f"{self.base_url}/api/v1"
|
||||
|
||||
# -- HTTP layer ----------------------------------------------------------
|
||||
|
||||
def _request(
|
||||
self,
|
||||
method: str,
|
||||
path: str,
|
||||
data: Optional[dict] = None,
|
||||
params: Optional[dict] = None,
|
||||
) -> Any:
|
||||
"""Make an authenticated API request. Returns parsed JSON."""
|
||||
url = f"{self.api}{path}"
|
||||
if params:
|
||||
url += "?" + urllib.parse.urlencode(params)
|
||||
|
||||
body = json.dumps(data).encode() if data else None
|
||||
req = urllib.request.Request(url, data=body, method=method)
|
||||
req.add_header("Authorization", f"token {self.token}")
|
||||
req.add_header("Content-Type", "application/json")
|
||||
req.add_header("Accept", "application/json")
|
||||
|
||||
try:
|
||||
with urllib.request.urlopen(req, timeout=30) as resp:
|
||||
raw = resp.read().decode()
|
||||
if not raw:
|
||||
return {}
|
||||
return json.loads(raw)
|
||||
except urllib.error.HTTPError as e:
|
||||
body_text = ""
|
||||
try:
|
||||
body_text = e.read().decode()
|
||||
except Exception:
|
||||
pass
|
||||
raise GiteaError(e.code, body_text, url) from e
|
||||
|
||||
def _get(self, path: str, **params) -> Any:
|
||||
# Filter out None values
|
||||
clean = {k: v for k, v in params.items() if v is not None}
|
||||
return self._request("GET", path, params=clean)
|
||||
|
||||
def _post(self, path: str, data: dict) -> Any:
|
||||
return self._request("POST", path, data=data)
|
||||
|
||||
def _patch(self, path: str, data: dict) -> Any:
|
||||
return self._request("PATCH", path, data=data)
|
||||
|
||||
def _delete(self, path: str) -> Any:
|
||||
return self._request("DELETE", path)
|
||||
|
||||
def _repo_path(self, repo: str) -> str:
|
||||
"""Convert 'owner/name' to '/repos/owner/name'."""
|
||||
return f"/repos/{repo}"
|
||||
|
||||
# -- Health --------------------------------------------------------------
|
||||
|
||||
def ping(self) -> bool:
|
||||
"""Check if Gitea is responding."""
|
||||
try:
|
||||
self._get("/version")
|
||||
return True
|
||||
except Exception:
|
||||
return False
|
||||
|
||||
# -- Repos ---------------------------------------------------------------
|
||||
|
||||
def list_org_repos(self, org: str, limit: int = 50) -> list[dict]:
|
||||
"""List repos in an organization."""
|
||||
return self._get(f"/orgs/{org}/repos", limit=limit)
|
||||
|
||||
# -- Issues --------------------------------------------------------------
|
||||
|
||||
def list_issues(
|
||||
self,
|
||||
repo: str,
|
||||
state: str = "open",
|
||||
assignee: Optional[str] = None,
|
||||
labels: Optional[str] = None,
|
||||
sort: str = "created",
|
||||
direction: str = "desc",
|
||||
limit: int = 30,
|
||||
page: int = 1,
|
||||
) -> list[Issue]:
|
||||
"""List issues for a repo."""
|
||||
raw = self._get(
|
||||
f"{self._repo_path(repo)}/issues",
|
||||
state=state,
|
||||
type="issues",
|
||||
assignee=assignee,
|
||||
labels=labels,
|
||||
sort=sort,
|
||||
direction=direction,
|
||||
limit=limit,
|
||||
page=page,
|
||||
)
|
||||
return [Issue.from_dict(i) for i in raw]
|
||||
|
||||
def get_issue(self, repo: str, number: int) -> Issue:
|
||||
"""Get a single issue."""
|
||||
return Issue.from_dict(
|
||||
self._get(f"{self._repo_path(repo)}/issues/{number}")
|
||||
)
|
||||
|
||||
def create_issue(
|
||||
self,
|
||||
repo: str,
|
||||
title: str,
|
||||
body: str = "",
|
||||
labels: Optional[list[int]] = None,
|
||||
assignees: Optional[list[str]] = None,
|
||||
) -> Issue:
|
||||
"""Create an issue."""
|
||||
data: dict[str, Any] = {"title": title, "body": body}
|
||||
if labels:
|
||||
data["labels"] = labels
|
||||
if assignees:
|
||||
data["assignees"] = assignees
|
||||
return Issue.from_dict(
|
||||
self._post(f"{self._repo_path(repo)}/issues", data)
|
||||
)
|
||||
|
||||
def update_issue(
|
||||
self,
|
||||
repo: str,
|
||||
number: int,
|
||||
title: Optional[str] = None,
|
||||
body: Optional[str] = None,
|
||||
state: Optional[str] = None,
|
||||
assignees: Optional[list[str]] = None,
|
||||
) -> Issue:
|
||||
"""Update an issue (title, body, state, assignees)."""
|
||||
data: dict[str, Any] = {}
|
||||
if title is not None:
|
||||
data["title"] = title
|
||||
if body is not None:
|
||||
data["body"] = body
|
||||
if state is not None:
|
||||
data["state"] = state
|
||||
if assignees is not None:
|
||||
data["assignees"] = assignees
|
||||
return Issue.from_dict(
|
||||
self._patch(f"{self._repo_path(repo)}/issues/{number}", data)
|
||||
)
|
||||
|
||||
def close_issue(self, repo: str, number: int) -> Issue:
|
||||
"""Close an issue."""
|
||||
return self.update_issue(repo, number, state="closed")
|
||||
|
||||
def assign_issue(self, repo: str, number: int, assignees: list[str]) -> Issue:
|
||||
"""Assign users to an issue."""
|
||||
return self.update_issue(repo, number, assignees=assignees)
|
||||
|
||||
def add_labels(self, repo: str, number: int, label_ids: list[int]) -> list[Label]:
|
||||
"""Add labels to an issue."""
|
||||
raw = self._post(
|
||||
f"{self._repo_path(repo)}/issues/{number}/labels",
|
||||
{"labels": label_ids},
|
||||
)
|
||||
return [Label.from_dict(lb) for lb in raw]
|
||||
|
||||
# -- Comments ------------------------------------------------------------
|
||||
|
||||
def list_comments(
|
||||
self, repo: str, number: int, since: Optional[str] = None
|
||||
) -> list[Comment]:
|
||||
"""List comments on an issue."""
|
||||
raw = self._get(
|
||||
f"{self._repo_path(repo)}/issues/{number}/comments",
|
||||
since=since,
|
||||
)
|
||||
return [Comment.from_dict(c) for c in raw]
|
||||
|
||||
def create_comment(self, repo: str, number: int, body: str) -> Comment:
|
||||
"""Add a comment to an issue."""
|
||||
return Comment.from_dict(
|
||||
self._post(
|
||||
f"{self._repo_path(repo)}/issues/{number}/comments",
|
||||
{"body": body},
|
||||
)
|
||||
)
|
||||
|
||||
# -- Pull Requests -------------------------------------------------------
|
||||
|
||||
def list_pulls(
|
||||
self,
|
||||
repo: str,
|
||||
state: str = "open",
|
||||
sort: str = "newest",
|
||||
limit: int = 20,
|
||||
page: int = 1,
|
||||
) -> list[PullRequest]:
|
||||
"""List pull requests."""
|
||||
raw = self._get(
|
||||
f"{self._repo_path(repo)}/pulls",
|
||||
state=state,
|
||||
sort=sort,
|
||||
limit=limit,
|
||||
page=page,
|
||||
)
|
||||
return [PullRequest.from_dict(p) for p in raw]
|
||||
|
||||
def get_pull(self, repo: str, number: int) -> PullRequest:
|
||||
"""Get a single pull request."""
|
||||
return PullRequest.from_dict(
|
||||
self._get(f"{self._repo_path(repo)}/pulls/{number}")
|
||||
)
|
||||
|
||||
def create_pull(
|
||||
self,
|
||||
repo: str,
|
||||
title: str,
|
||||
head: str,
|
||||
base: str = "main",
|
||||
body: str = "",
|
||||
) -> PullRequest:
|
||||
"""Create a pull request."""
|
||||
return PullRequest.from_dict(
|
||||
self._post(
|
||||
f"{self._repo_path(repo)}/pulls",
|
||||
{"title": title, "head": head, "base": base, "body": body},
|
||||
)
|
||||
)
|
||||
|
||||
def merge_pull(
|
||||
self,
|
||||
repo: str,
|
||||
number: int,
|
||||
method: str = "squash",
|
||||
delete_branch: bool = True,
|
||||
) -> bool:
|
||||
"""Merge a pull request. Returns True on success."""
|
||||
try:
|
||||
self._post(
|
||||
f"{self._repo_path(repo)}/pulls/{number}/merge",
|
||||
{"Do": method, "delete_branch_after_merge": delete_branch},
|
||||
)
|
||||
return True
|
||||
except GiteaError as e:
|
||||
if e.status == 405: # not mergeable
|
||||
return False
|
||||
raise
|
||||
|
||||
def update_pull_branch(
|
||||
self, repo: str, number: int, style: str = "rebase"
|
||||
) -> bool:
|
||||
"""Update a PR branch (rebase onto base). Returns True on success."""
|
||||
try:
|
||||
self._post(
|
||||
f"{self._repo_path(repo)}/pulls/{number}/update",
|
||||
{"style": style},
|
||||
)
|
||||
return True
|
||||
except GiteaError:
|
||||
return False
|
||||
|
||||
def close_pull(self, repo: str, number: int) -> PullRequest:
|
||||
"""Close a pull request without merging."""
|
||||
return PullRequest.from_dict(
|
||||
self._patch(
|
||||
f"{self._repo_path(repo)}/pulls/{number}",
|
||||
{"state": "closed"},
|
||||
)
|
||||
)
|
||||
|
||||
def get_pull_files(self, repo: str, number: int) -> list[PRFile]:
|
||||
"""Get files changed in a pull request."""
|
||||
raw = self._get(f"{self._repo_path(repo)}/pulls/{number}/files")
|
||||
return [PRFile.from_dict(f) for f in raw]
|
||||
|
||||
def find_pull_by_branch(
|
||||
self, repo: str, branch: str
|
||||
) -> Optional[PullRequest]:
|
||||
"""Find an open PR for a given head branch."""
|
||||
prs = self.list_pulls(repo, state="open", limit=50)
|
||||
for pr in prs:
|
||||
if pr.head_branch == branch:
|
||||
return pr
|
||||
return None
|
||||
|
||||
# -- Convenience ---------------------------------------------------------
|
||||
|
||||
def get_issue_with_comments(
|
||||
self, repo: str, number: int, last_n: int = 5
|
||||
) -> tuple[Issue, list[Comment]]:
|
||||
"""Get an issue and its most recent comments."""
|
||||
issue = self.get_issue(repo, number)
|
||||
comments = self.list_comments(repo, number)
|
||||
return issue, comments[-last_n:] if len(comments) > last_n else comments
|
||||
|
||||
def find_unassigned_issues(
|
||||
self,
|
||||
repo: str,
|
||||
limit: int = 30,
|
||||
exclude_labels: Optional[list[str]] = None,
|
||||
exclude_title_patterns: Optional[list[str]] = None,
|
||||
) -> list[Issue]:
|
||||
"""Find open issues not assigned to anyone."""
|
||||
issues = self.list_issues(repo, state="open", limit=limit)
|
||||
result = []
|
||||
for issue in issues:
|
||||
if issue.assignees:
|
||||
continue
|
||||
if exclude_labels:
|
||||
issue_label_names = {lb.name for lb in issue.labels}
|
||||
if issue_label_names & set(exclude_labels):
|
||||
continue
|
||||
if exclude_title_patterns:
|
||||
title_lower = issue.title.lower()
|
||||
if any(p.lower() in title_lower for p in exclude_title_patterns):
|
||||
continue
|
||||
result.append(issue)
|
||||
return result
|
||||
|
||||
def find_agent_issues(self, repo: str, agent: str, limit: int = 50) -> list[Issue]:
|
||||
"""Find open issues assigned to a specific agent."""
|
||||
return self.list_issues(repo, state="open", assignee=agent, limit=limit)
|
||||
|
||||
def find_agent_pulls(self, repo: str, agent: str) -> list[PullRequest]:
|
||||
"""Find open PRs created by a specific agent."""
|
||||
prs = self.list_pulls(repo, state="open", limit=50)
|
||||
return [pr for pr in prs if pr.user.login == agent]
|
||||
@@ -1,8 +1,11 @@
|
||||
# Training
|
||||
|
||||
LoRA fine-tuning pipeline for Timmy's sovereign model. No custom harness — just config files for existing tools.
|
||||
Transitional training recipes for Timmy's sovereign model. These files are
|
||||
useful as reference configs and export helpers, but they are not the canonical
|
||||
home of Timmy's lived training data.
|
||||
|
||||
Replaces the `autolora` repo (1,500 lines of custom code → config + `make`).
|
||||
Canonical data should live in `timmy-home` under gameplay trajectories,
|
||||
research artifacts, and `training-data/` exports such as DPO pairs.
|
||||
|
||||
## Install
|
||||
|
||||
@@ -23,6 +26,16 @@ make convert # Convert merged data to MLX train/valid format
|
||||
make help # Show all targets
|
||||
```
|
||||
|
||||
## Status
|
||||
|
||||
This directory exists to avoid re-growing a bespoke training harness while the
|
||||
system boundary is being cleaned up.
|
||||
|
||||
- Keep thin recipes and export helpers here only when they directly support the
|
||||
Hermes sidecar.
|
||||
- Keep generated data, DPO pairs, and other lived artifacts in `timmy-home`.
|
||||
- Prefer deleting stale pipeline code over expanding it.
|
||||
|
||||
## Files
|
||||
|
||||
```
|
||||
|
||||
Reference in New Issue
Block a user