Files
timmy-config/training/data/split_manifest.json
perplexity 6507cffc15 feat: migrate autolora pipeline into training/
Per direction shift (the-nexus#542).

Replaces the autolora repo (1,500 lines of custom pipeline code)
with config files for existing tools:

- axolotl.yaml: replaces train_modal.py (239 lines)
- mlx-lora.yaml: replaces MLX training scripts
- eval-tasks.yaml: replaces run_eval.py (300 lines)
- Makefile: replaces run_vibes.py, compare.py, convert_to_mlx.py

Data migrated as-is:
- curated_dataset.jsonl (26 gold-standard conversations)
- preference_pairs.jsonl (DPO pairs)
- prompts_vibes.yaml, prompts_nexus_vibes.yaml
- v0-baseline eval results (historical record)

Thin glue kept:
- build_curated.py (data authoring, not infrastructure)
- ingest_trajectories.py (domain-specific quality filter)

Dependencies: pip install axolotl mlx-lm lm-evaluation-harness
2026-03-25 23:05:50 +00:00

28 lines
691 B
JSON

{
"total": 383,
"train": 364,
"test": 19,
"test_ratio": 0.05,
"split_by": "chronological",
"test_ids": [
"20260323_154745_7c9379",
"20260323_155153_bbc661",
"20260323_155458_a7563d",
"20260323_160415_267a50",
"20260323_160533_edc7cf",
"20260323_160633_6fab92",
"20260323_160736_754836",
"20260323_160920_c90867",
"20260323_161348_d553f0",
"20260323_162017_83fe11",
"20260323_162338_0267eb",
"20260323_162704_86f56f",
"20260323_163322_e45fa8",
"20260323_163943_9ac5e0",
"20260323_164605_a4aedf",
"20260323_164606_c000d8",
"20260323_165414_a92bf6",
"20260323_165533_940c49",
"20260323_170039_2a191f"
]
}