the-nexus

Author	SHA1	Message	Date
perplexity	4b15cf8283	feat: full-history persistent dedup index for DPO training pairs Some checks failed CI / test (pull_request) Failing after 16s Details CI / validate (pull_request) Failing after 14s Details Review Approval Gate / verify-review (pull_request) Failing after 3s Details Replace the 5-file sliding window cross-run dedup with a persistent hash index that covers ALL historical training data. Overfitting risk compounds across the full dataset — a 5-file window lets old duplicates leak back into training after enough overnight runs. New module: dedup_index.py (DedupIndex) - Persistent JSON index (.dpo_dedup_index.json) alongside JSONL files - Append-on-export: new prompt hashes registered after each successful export — no full rescan needed for normal operations - Incremental sync: on load, detects JSONL files not yet indexed and ingests them automatically (handles files from other tools) - Full rebuild: rebuild() scans ALL deepdive_.jsonl + pairs_.jsonl to reconstruct from scratch (first run, corruption recovery) - Atomic writes (write-to-tmp + rename) to prevent index corruption - Standalone CLI: python3 dedup_index.py <dir> --rebuild --stats Modified: dpo_quality.py - Imports DedupIndex with graceful degradation - Replaces _load_history_hashes() with persistent index lookup - Fallback: if index unavailable, scans ALL files in-memory (not just 5) - New register_exported_hashes() method called after export - Config key: dedup_full_history (replaces dedup_history_files) Modified: dpo_generator.py - Calls validator.register_exported_hashes() after successful export to keep the persistent index current without rescanning Modified: config.yaml - Replaced dedup_history_files: 5 with dedup_full_history: true Tested — 7 integration tests: ✓ Fresh index build from empty directory ✓ Build from 3 existing JSONL files (15 unique hashes) ✓ Incremental sync when new file appears between runs ✓ Append after export + persistence across reloads ✓ Rebuild from scratch (recovers from corruption) ✓ Validator catches day-1 dupe from 20-day history (5-file window miss) ✓ Full pipeline: generate → validate → export → register → re-run detects	2026-04-13 03:11:10 +00:00
perplexity	bb4922adeb	feat: DPO pair quality validator — gate before overnight training Some checks failed CI / test (pull_request) Failing after 20s Details CI / validate (pull_request) Failing after 16s Details Review Approval Gate / verify-review (pull_request) Failing after 2s Details Add DPOQualityValidator that catches bad training pairs before they enter the tightening loop. Wired into DPOPairGenerator between generate() and export() as an automatic quality gate. New module: dpo_quality.py - 5 single-pair quality checks: 1. Field length minimums (prompt ≥40, chosen ≥80, rejected ≥30 chars) 2. Chosen/rejected length ratio (chosen must be ≥1.3x longer) 3. Chosen≈rejected similarity (Jaccard ≤0.70 — catches low-contrast) 4. Vocabulary diversity in chosen (unique word ratio ≥0.30) 5. Substance markers in chosen (≥2 fleet/training/action terms) - 2 cross-pair quality checks: 6. Near-duplicate prompts within batch (Jaccard ≤0.85) 7. Cross-run dedup against recent JSONL history files - Two modes: 'drop' (filter out bad pairs) or 'flag' (export with warning) - BatchReport with per-pair diagnostics, pass rates, and warnings - Standalone CLI: python3 dpo_quality.py <file.jsonl> [--strict] [--json] Modified: dpo_generator.py - Imports DPOQualityValidator with graceful degradation - Initializes from config validation section (enabled by default) - Validates between generate() and export() in run() - Quality report included in pipeline result dict - Validator failure never blocks — falls back to unvalidated export Modified: config.yaml - New deepdive.training.dpo.validation section with all tunable knobs: enabled, flagged_pair_action, similarity thresholds, length minimums, dedup_history_files Integration tested — 6 test cases covering: ✓ Good pairs pass (3/3 accepted) ✓ Bad pairs caught: too-short, high-similarity, inverted signal (0/3) ✓ Near-duplicate prompt detection (1/2 deduped) ✓ Flag mode preserves pairs with warnings (3/3 flagged) ✓ Cross-run deduplication against history (1 dupe caught) ✓ Full generator→validator→export pipeline (6/6 validated)	2026-04-13 02:46:50 +00:00
perplexity	55d53c513c	feat: Phase 3.5 — DPO training pair generation from Deep Dive pipeline Some checks failed CI / test (pull_request) Failing after 22s Details CI / validate (pull_request) Failing after 15s Details Review Approval Gate / verify-review (pull_request) Failing after 2s Details Wire arXiv relevance filter output directly into DPO pair generation, closing the loop between research synthesis and overnight training data. New module: dpo_generator.py - DPOPairGenerator class with 3 pair strategies: * summarize: paper → fleet-grounded analysis (chosen) vs generic (rejected) * relevance: 'what matters to Hermes?' → scored context vs vague * implication: 'what should we do?' → actionable insight vs platitude - Extracts synthesis excerpts matched to each ranked item - Outputs to ~/.timmy/training-data/dpo-pairs/deepdive_{timestamp}.jsonl - Format: {prompt, chosen, rejected, task_type, evidence_ids, source_session, safety_flags, metadata} Pipeline changes (pipeline.py): - Import DPOPairGenerator with graceful degradation - Initialize from config deepdive.training.dpo section - Execute as Phase 3.5 between synthesis and audio - DPO results included in pipeline return dict - Wrapped in try/except — DPO failure never blocks delivery Config changes (config.yaml): - New deepdive.training.dpo section with: enabled, output_dir, min_score, max_pairs_per_run, pair_types Integration tested: 2 mock items × 3 pair types = 6 valid JSONL pairs. Chosen responses consistently richer than rejected (assert-verified).	2026-04-13 02:24:04 +00:00
Bezalel	34862cf5e5	feat(fleet): promote Ollama to first-class provider, assign Gemma 4 across fleet Some checks failed Deploy Nexus / deploy (push) Failing after 3s Details Staging Verification Gate / verify-staging (push) Failing after 3s Details - lazarus-registry.yaml: replace big_brain/RunPod with local ollama/gemma4:12b - fleet-routing.json: assign ollama:gemma4:12b to carnice, bilbobagginshire, substratum - intelligence/deepdive/config.yaml: local model -> gemma4:12b	2026-04-07 15:55:52 +00:00
Ezra (Archivist)	9ad2132482	[ezra] #830 : Operational readiness checklist + fix Gitea URL to forge Some checks failed Deploy Nexus / deploy (push) Has been cancelled Details	2026-04-05 19:54:47 +00:00
Ezra (Archivist)	00600a7e67	[BURN] Deep Dive proof-of-life, fleet context fix, dry-run repair Some checks failed Deploy Nexus / deploy (push) Has been cancelled Details - Fix fleet_context.py env-var substitution for 0c16baadaebaaabc2c8390f35ef5e9aa2f4db671 - Remove non-existent wizard-checkpoints from config.yaml - Fix bin/deepdive_orchestrator.py dry-run mock items - Add PROOF_OF_LIFE.md with live execution output including fleet context Progresses #830	2026-04-05 18:42:18 +00:00
Ezra	5f4cc8cae2	config(deepdive): enable fleet context grounding (#830 ) Some checks failed Deploy Nexus / deploy (push) Has been cancelled Details	2026-04-05 17:32:24 +00:00
Ezra	cca5909cf9	[scaffold] Deep Dive intelligence pipeline: intelligence/deepdive/config.yaml Some checks failed Deploy Nexus / deploy (push) Has been cancelled Details	2026-04-05 06:19:50 +00:00

8 Commits