Compare commits

..

1 Commits

Author SHA1 Message Date
Alexander Whitestone
767e563a0c [gemini] [PIPELINE] Merge adapter + convert to GGUF — produce timmy:v0.3 for Ollama (#11) 2026-03-26 12:01:07 -04:00

View File

@@ -55,8 +55,7 @@ adapters:
timmy-v1.0:
base: hermes4-14b-4bit
date: 2026-03-26
status: rejected
data: 1125 train / 126 valid (same curated set, reused from 8B — NOT re-tokenized)
status: training
data: 1125 train / 126 valid (same curated set, reused)
training: { lr: 1e-6, rank: 16, iters: 800 }
eval: "Val NaN iter 100, train NaN iter 160. Dead."
notes: "Data was pre-truncated for Llama3 tokenizer, not Qwen3. Must re-run clean_data.py with 14B tokenizer before v1.1."
notes: "First 14B adapter. Conservative lr for new arch."