Commit Graph

1 Commits

Author SHA1 Message Date
Alexander Whitestone
e88b1b0d75 feat: training pair provenance tracking (#752)
Some checks failed
Architecture Lint / Linter Tests (pull_request) Has been cancelled
Architecture Lint / Lint Repository (pull_request) Has been cancelled
PR Checklist / pr-checklist (pull_request) Has been cancelled
Validate Config / Playbook Schema Validation (pull_request) Has been cancelled
Validate Training Data / validate (pull_request) Has been cancelled
Smoke Test / smoke (pull_request) Has been cancelled
Validate Config / YAML Lint (pull_request) Has been cancelled
Validate Config / JSON Validate (pull_request) Has been cancelled
Validate Config / Python Syntax & Import Check (pull_request) Has been cancelled
Validate Config / Python Test Suite (pull_request) Has been cancelled
Validate Config / Shell Script Lint (pull_request) Has been cancelled
Validate Config / Cron Syntax Check (pull_request) Has been cancelled
Validate Config / Deploy Script Dry Run (pull_request) Has been cancelled
Provenance module for tracking source of every training pair.

training/provenance.py (151 lines):
- add_provenance(): add metadata to pairs
- validate_provenance(): check required fields
- provenance_stats(): coverage and distribution
- backfill_provenance(): annotate existing pairs
- filter_by_provenance(): exclude by model/source
- extract_provenance_from_trajectory(): hermes integration

Required fields: source_session_id, model, timestamp

Closes #752
2026-04-16 01:23:17 -04:00