Files

perplexity 77cfa48707 feat: DPO pair quality validator — gate before overnight training

Add DPOQualityValidator that catches bad training pairs before they
enter the tightening loop. Wired into DPOPairGenerator between
generate() and export() as an automatic quality gate.

New module: dpo_quality.py
- 5 single-pair quality checks:
  1. Field length minimums (prompt ≥40, chosen ≥80, rejected ≥30 chars)
  2. Chosen/rejected length ratio (chosen must be ≥1.3x longer)
  3. Chosen≈rejected similarity (Jaccard ≤0.70 — catches low-contrast)
  4. Vocabulary diversity in chosen (unique word ratio ≥0.30)
  5. Substance markers in chosen (≥2 fleet/training/action terms)
- 2 cross-pair quality checks:
  6. Near-duplicate prompts within batch (Jaccard ≤0.85)
  7. Cross-run dedup against recent JSONL history files
- Two modes: 'drop' (filter out bad pairs) or 'flag' (export with warning)
- BatchReport with per-pair diagnostics, pass rates, and warnings
- Standalone CLI: python3 dpo_quality.py <file.jsonl> [--strict] [--json]

Modified: dpo_generator.py
- Imports DPOQualityValidator with graceful degradation
- Initializes from config validation section (enabled by default)
- Validates between generate() and export() in run()
- Quality report included in pipeline result dict
- Validator failure never blocks — falls back to unvalidated export

Modified: config.yaml
- New deepdive.training.dpo.validation section with all tunable knobs:
  enabled, flagged_pair_action, similarity thresholds, length minimums,
  dedup_history_files

Integration tested — 6 test cases covering:
  ✓ Good pairs pass (3/3 accepted)
  ✓ Bad pairs caught: too-short, high-similarity, inverted signal (0/3)
  ✓ Near-duplicate prompt detection (1/2 deduped)
  ✓ Flag mode preserves pairs with warnings (3/3 flagged)
  ✓ Cross-run deduplication against history (1 dupe caught)
  ✓ Full generator→validator→export pipeline (6/6 validated)

2026-04-18 15:19:56 -04:00

prompts

feat(deepdive): production briefing prompt + prompt engineering KT

2026-04-05 20:19:20 +00:00

systemd

[BURN] #830 : Systemd timer for daily 06:00 execution

2026-04-05 08:08:07 +00:00

tests

fix: closes #830

2026-04-18 15:19:55 -04:00

.dockerignore

intelligence(deepdive): Docker deployment scaffold for #830

2026-04-05 20:40:58 +00:00

architecture.md

[scaffold] Deep Dive intelligence pipeline: intelligence/deepdive/architecture.md

2026-04-05 06:19:48 +00:00

config.yaml

feat: DPO pair quality validator — gate before overnight training

2026-04-18 15:19:56 -04:00

deploy.sh

intelligence(deepdive): Docker deployment scaffold for #830

2026-04-05 20:40:58 +00:00

docker-compose.yml

purge: remove Anthropic from the-nexus fleet + deepdive (#1346 )

2026-04-18 15:19:56 -04:00

Dockerfile

intelligence(deepdive): Docker deployment scaffold for #830

2026-04-05 20:40:58 +00:00

dpo_generator.py

feat: DPO pair quality validator — gate before overnight training

2026-04-18 15:19:56 -04:00

dpo_quality.py

feat: DPO pair quality validator — gate before overnight training

2026-04-18 15:19:56 -04:00

fleet_context.py

[BURN] Deep Dive proof-of-life, fleet context fix, dry-run repair

2026-04-05 18:42:18 +00:00

GEMINI_HANDOFF.md

[ezra] Gemini handoff for Deep Dive (#830 )

2026-04-05 18:20:53 +00:00

Makefile

[BURN] #830 : Build automation (Makefile)

2026-04-05 08:06:12 +00:00

OPERATIONAL_READINESS.md

[ezra] #830 : Operational readiness checklist + fix Gitea URL to forge

2026-04-05 19:54:47 +00:00

pipeline.py

feat: Phase 3.5 — DPO training pair generation from Deep Dive pipeline

2026-04-18 15:19:56 -04:00

PRODUCTION_READINESS_REVIEW.md

[ezra] Production Readiness Review for Deep Dive (#830 )

2026-04-05 21:00:26 +00:00

PROOF_OF_EXECUTION.md

[ezra] #830 : Pipeline proof-of-execution document

2026-04-05 12:46:03 +00:00

PROOF_OF_LIFE.md

[BURN] Deep Dive proof-of-life, fleet context fix, dry-run repair

2026-04-05 18:42:18 +00:00

quality_eval.py

feat(deepdive): quality evaluation framework

2026-04-05 19:03:05 +00:00

QUALITY_FRAMEWORK.md

feat(deepdive): quality evaluation framework

2026-04-05 19:03:05 +00:00

QUICKSTART.md

Add QUICKSTART.md for Deep Dive pipeline (#830 )

2026-04-05 12:17:16 +00:00

README.md

Update README to reflect production implementation status (#830 )

2026-04-05 12:18:18 +00:00

requirements.txt

[scaffold] Deep Dive intelligence pipeline: intelligence/deepdive/requirements.txt

2026-04-05 06:19:51 +00:00

telegram_command.py

Add Telegram /deepdive command handler for on-demand briefings (#830 )

2026-04-05 12:17:17 +00:00

tts_engine.py

feat: add edge-tts as zero-cost voice output provider

2026-04-08 06:29:26 -04:00

README.md

Deep Dive: Automated Intelligence Briefing System

Sovereign, automated daily intelligence pipeline for the Timmy Foundation fleet.

Vision

Zero-manual-input daily AI-generated podcast briefing covering:

arXiv (cs.AI, cs.CL, cs.LG)
OpenAI, Anthropic, DeepMind research blogs
AI newsletters (Import AI, TLDR AI)

Architecture

┌─────────────────┐    ┌─────────────────┐    ┌─────────────────┐
│  Phase 1        │───▶│  Phase 2        │───▶│  Phase 3        │
│  Aggregation    │    │  Relevance      │    │  Synthesis      │
│  (RSS/Feeds)    │    │  (Embeddings)   │    │  (LLM Briefing) │
└─────────────────┘    └─────────────────┘    └────────┬────────┘
                                                       │
                              ┌────────────────────────┘
                              ▼
                    ┌─────────────────┐    ┌─────────────────┐
                    │  Phase 4        │───▶│  Phase 5        │
                    │  Audio (TTS)    │    │  Delivery       │
                    │  (Piper)        │    │  (Telegram)     │
                    └─────────────────┘    └─────────────────┘

Status: IMPLEMENTATION COMPLETE

This is no longer a reference scaffold — it is a production-ready executable pipeline.

Component	Status	File
Phase 1: Aggregation	✅ Complete	`pipeline.py` — RSS fetcher with caching
Phase 2: Relevance	✅ Complete	`pipeline.py` — sentence-transformers ranking
Phase 3: Synthesis	✅ Complete	`pipeline.py` — LLM briefing generation
Phase 4: Audio	✅ Complete	`tts_engine.py` — Piper + ElevenLabs hybrid
Phase 5: Delivery	✅ Complete	`pipeline.py` — Telegram text + voice
Orchestrator	✅ Complete	`pipeline.py` — asyncio CLI + Python API
Tests	✅ Complete	`tests/test_e2e.py` — dry-run validation
Systemd Timer	✅ Complete	`systemd/deepdive.timer` — 06:00 daily

Quick Start

See QUICKSTART.md for exact commands to run the pipeline.

Sovereignty Compliance

Component	Implementation	Non-Negotiable
Aggregation	Local RSS polling	No third-party APIs
Relevance	sentence-transformers local	No cloud embeddings
Synthesis	Gemma 4 via Hermes llama-server	No OpenAI/Anthropic API
TTS	Piper TTS local	No ElevenLabs
Delivery	Hermes Telegram gateway	Existing infra

Files

pipeline.py — Main orchestrator (production implementation)
tts_engine.py — Phase 4 TTS engine (Piper + ElevenLabs fallback)
config.yaml — Configuration template
Makefile — Build automation (make test-e2e, make install-systemd)
tests/ — pytest suite including end-to-end dry-run test
systemd/ — Daily timer for 06:00 execution
QUICKSTART.md — Step-by-step execution guide
architecture.md — Full technical specification
telegram_command.py — Hermes /deepdive command handler

Issue

#830 — Deep Dive: Sovereign NotebookLM + Daily AI Intelligence Briefing