- schema.json: JSON Schema for scene description entries (song, artist,
beat, timestamp, duration_seconds, lyric_line, scene with
mood/colors/composition/camera/description)
- validate.py: CLI validator for JSONL files (no jsonschema dependency,
manual fallback). Validates all fields, types, ranges, empty values.
- .gitea/workflows/validate-training-data.yml: CI check on PR
Tested: 100/100 entries in scene-descriptions-r&b-soul.jsonl pass.
Closes#647