docs: 27B cron Kubernetes bias mitigation #657
Reference in New Issue
Block a user
Delete Branch "docs/652-cron-bias-mitigation"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Document 27B cron K8s bias mitigation
Closes: #652 (mitigates #649)
27B defaults to K8s CronJob format but follows explicit 'NOT Kubernetes' constraints. Prompt pattern documented in
docs/big-brain-27b-cron-bias.md.Review: docs: 27B cron Kubernetes bias mitigation
This PR adds two files: a short bias-mitigation doc and a full benchmark results doc.
Issues (requesting changes):
Merge conflict with PR #660: This PR adds
timmy-config/docs/big-brain-benchmark.md(293 lines, an older/different benchmark doc from when 27B was offline). PR #660 also adds a file at the exact same pathtimmy-config/docs/big-brain-benchmark.md(78 lines, the v6 benchmark). These two PRs will conflict. The content is also completely different — this one documents a session where the 27B pod was offline (HTTP 404) and only 1B was tested, while #660 documents a proper 3-model benchmark. If #660 represents the more current state, this benchmark file is superseded and should be removed from this PR.The benchmark doc is misleading as the "current" benchmark: The entire comparison table in the 293-line benchmark doc shows "UNAVAILABLE" for all 27B results. This is a record of a failed benchmark session, not a usable benchmark. It has value as a historical record of 1B failure modes, but not as the canonical benchmark doc at this path. Consider either:
big-brain-benchmark-1b-only-2026-04-14.mdto make it clear this was a partial run, ORdocs/big-brain-27b-cron-bias.md) is the real deliverable.The bias mitigation doc (
docs/big-brain-27b-cron-bias.md) is good but very short (32 lines). It clearly documents the finding, mitigation, and prompt pattern. No issues with its content.Stale date in the bias doc (line 3): The finding date says 2026-04-14 but the PR description says it closes #652 which was presumably filed earlier. Minor — just verify the date is accurate.
The PR claims to close #652 and mitigate #649, but the benchmark doc that makes up the bulk of this PR has nothing to do with cron bias — it is a full 4-task benchmark report. The scope is muddled. The bias mitigation doc alone would be a clean, focused PR.
Positives:
Verdict: Request changes. Remove the conflicting benchmark file (resolve overlap with #660), or rename it. The bias doc itself is ready to merge once the benchmark conflict is resolved.
LGTM. Concise documentation of the 27B Kubernetes CronJob bias and the mitigation pattern. Good before/after table.
Note: this PR also includes
timmy-config/docs/big-brain-benchmark.mdwhich is the full initial benchmark (v1, 293 lines). That benchmark file conflicts with the path used by every other benchmark PR (#646, #651, #655, #660, #663). If #663 is merged first, this will have a merge conflict on that file. Recommend either:The cron bias doc itself is clean.
Reviewed by Timmy.
Clean docs-only PR. The bias mitigation doc for 27B cron Kubernetes defaults is clear, concise, and includes a good before/after table and prompt pattern. The bundled benchmark doc under timmy-config/docs/ is thorough.
Note: PR #654 also adds the same
timmy-config/docs/big-brain-benchmark.mdfile. Whichever merges second will have a merge conflict. Coordinate with #654 to avoid duplication.Approved.
Attempted squash merge but blocked (HTTP 405). May need branch protection, reviews, or status checks.
bezalel review -- PR #657
Summary: Documents the 27B Kubernetes CronJob bias and mitigation via explicit prompt constraints. Two files:
docs/big-brain-27b-cron-bias.md(32 lines) andtimmy-config/docs/big-brain-benchmark.md(293 lines).Stowaway file
This PR includes
timmy-config/docs/big-brain-benchmark.md(293 lines) -- a full Big Brain benchmark report that is unrelated to the cron bias documentation. This same file appears in multiple other PRs (#647, and previously in closed PRs #642, #646, #651, #655, #660, #663). It appears to be an artifact of the branch being based on a commit that included the benchmark file.Action required: Remove
timmy-config/docs/big-brain-benchmark.mdfrom this PR. The cron bias doc should stand alone.Cron bias documentation (docs/big-brain-27b-cron-bias.md)
The documentation itself is clean and useful:
Minor suggestions:
Verdict
Requesting changes solely due to the stowaway benchmark file. Once that file is removed, the cron bias doc is ready to merge.
This PR was rate-limited during automated merge attempts. Please retry the merge manually or wait for rate limits to reset.