the-nexus

Author	SHA1	Message	Date
perplexity	bb4922adeb	feat: DPO pair quality validator — gate before overnight training Some checks failed CI / test (pull_request) Failing after 20s Details CI / validate (pull_request) Failing after 16s Details Review Approval Gate / verify-review (pull_request) Failing after 2s Details Add DPOQualityValidator that catches bad training pairs before they enter the tightening loop. Wired into DPOPairGenerator between generate() and export() as an automatic quality gate. New module: dpo_quality.py - 5 single-pair quality checks: 1. Field length minimums (prompt ≥40, chosen ≥80, rejected ≥30 chars) 2. Chosen/rejected length ratio (chosen must be ≥1.3x longer) 3. Chosen≈rejected similarity (Jaccard ≤0.70 — catches low-contrast) 4. Vocabulary diversity in chosen (unique word ratio ≥0.30) 5. Substance markers in chosen (≥2 fleet/training/action terms) - 2 cross-pair quality checks: 6. Near-duplicate prompts within batch (Jaccard ≤0.85) 7. Cross-run dedup against recent JSONL history files - Two modes: 'drop' (filter out bad pairs) or 'flag' (export with warning) - BatchReport with per-pair diagnostics, pass rates, and warnings - Standalone CLI: python3 dpo_quality.py <file.jsonl> [--strict] [--json] Modified: dpo_generator.py - Imports DPOQualityValidator with graceful degradation - Initializes from config validation section (enabled by default) - Validates between generate() and export() in run() - Quality report included in pipeline result dict - Validator failure never blocks — falls back to unvalidated export Modified: config.yaml - New deepdive.training.dpo.validation section with all tunable knobs: enabled, flagged_pair_action, similarity thresholds, length minimums, dedup_history_files Integration tested — 6 test cases covering: ✓ Good pairs pass (3/3 accepted) ✓ Bad pairs caught: too-short, high-similarity, inverted signal (0/3) ✓ Near-duplicate prompt detection (1/2 deduped) ✓ Flag mode preserves pairs with warnings (3/3 flagged) ✓ Cross-run deduplication against history (1 dupe caught) ✓ Full generator→validator→export pipeline (6/6 validated)	2026-04-13 02:46:50 +00:00
perplexity	55d53c513c	feat: Phase 3.5 — DPO training pair generation from Deep Dive pipeline Some checks failed CI / test (pull_request) Failing after 22s Details CI / validate (pull_request) Failing after 15s Details Review Approval Gate / verify-review (pull_request) Failing after 2s Details Wire arXiv relevance filter output directly into DPO pair generation, closing the loop between research synthesis and overnight training data. New module: dpo_generator.py - DPOPairGenerator class with 3 pair strategies: * summarize: paper → fleet-grounded analysis (chosen) vs generic (rejected) * relevance: 'what matters to Hermes?' → scored context vs vague * implication: 'what should we do?' → actionable insight vs platitude - Extracts synthesis excerpts matched to each ranked item - Outputs to ~/.timmy/training-data/dpo-pairs/deepdive_{timestamp}.jsonl - Format: {prompt, chosen, rejected, task_type, evidence_ids, source_session, safety_flags, metadata} Pipeline changes (pipeline.py): - Import DPOPairGenerator with graceful degradation - Initialize from config deepdive.training.dpo section - Execute as Phase 3.5 between synthesis and audio - DPO results included in pipeline return dict - Wrapped in try/except — DPO failure never blocks delivery Config changes (config.yaml): - New deepdive.training.dpo section with: enabled, output_dir, min_score, max_pairs_per_run, pair_types Integration tested: 2 mock items × 3 pair types = 6 valid JSONL pairs. Chosen responses consistently richer than rejected (assert-verified).	2026-04-13 02:24:04 +00:00
Bezalel	34862cf5e5	feat(fleet): promote Ollama to first-class provider, assign Gemma 4 across fleet Some checks failed Deploy Nexus / deploy (push) Failing after 3s Details Staging Verification Gate / verify-staging (push) Failing after 3s Details - lazarus-registry.yaml: replace big_brain/RunPod with local ollama/gemma4:12b - fleet-routing.json: assign ollama:gemma4:12b to carnice, bilbobagginshire, substratum - intelligence/deepdive/config.yaml: local model -> gemma4:12b	2026-04-07 15:55:52 +00:00
Ezra (Archivist)	9ad2132482	[ezra] #830 : Operational readiness checklist + fix Gitea URL to forge Some checks failed Deploy Nexus / deploy (push) Has been cancelled Details	2026-04-05 19:54:47 +00:00
Ezra (Archivist)	00600a7e67	[BURN] Deep Dive proof-of-life, fleet context fix, dry-run repair Some checks failed Deploy Nexus / deploy (push) Has been cancelled Details - Fix fleet_context.py env-var substitution for 0c16baadaebaaabc2c8390f35ef5e9aa2f4db671 - Remove non-existent wizard-checkpoints from config.yaml - Fix bin/deepdive_orchestrator.py dry-run mock items - Add PROOF_OF_LIFE.md with live execution output including fleet context Progresses #830	2026-04-05 18:42:18 +00:00
Ezra	5f4cc8cae2	config(deepdive): enable fleet context grounding (#830 ) Some checks failed Deploy Nexus / deploy (push) Has been cancelled Details	2026-04-05 17:32:24 +00:00
Ezra	cca5909cf9	[scaffold] Deep Dive intelligence pipeline: intelligence/deepdive/config.yaml Some checks failed Deploy Nexus / deploy (push) Has been cancelled Details	2026-04-05 06:19:50 +00:00

7 Commits