[MEDIA-INGEST] Build a local manifest for Twitter archive videos with tweet text and hashtags preserved end-to-end #41

Closed
opened 2026-03-28 19:39:01 +00:00 by Timmy · 1 comment
Owner

Goal:
Extend the existing Know Thy Father archive lane so every Twitter video is indexed locally with the tweet post and hashtags carried through the pipeline.

Acceptance:

  • generate a local video manifest under ~/.timmy/twitter-archive/media/
  • each entry preserves: tweet id, timestamp, full text, hashtags, media path, media type, and source file
  • no raw Twitter media is uploaded to cloud services for analysis
  • document the schema and prove it on a real sample set

Critical requirement:
posts and hashtags are first-class metadata, not disposable decoration.

Goal: Extend the existing Know Thy Father archive lane so every Twitter video is indexed locally with the tweet post and hashtags carried through the pipeline. Acceptance: - generate a local video manifest under `~/.timmy/twitter-archive/media/` - each entry preserves: tweet id, timestamp, full text, hashtags, media path, media type, and source file - no raw Twitter media is uploaded to cloud services for analysis - document the schema and prove it on a real sample set Critical requirement: posts and hashtags are first-class metadata, not disposable decoration.
Timmy added this to the Art appreciation & ingestion milestone 2026-03-28 19:39:01 +00:00
Timmy self-assigned this 2026-03-28 19:39:01 +00:00
Timmy closed this issue 2026-03-28 19:58:34 +00:00
Author
Owner

Delivered.

Local proof:

  • ~/.timmy/twitter-archive/media/manifest.jsonl
  • ~/.timmy/twitter-archive/media/manifest_summary.json
  • records: 818
  • #timmyTime: 64
  • #TimmyChain: 33
  • media types: animated_gif=366, photo=234, video=218

Tests:

  • python3 -m pytest tests/test_twitter_video_manifest.py -q
  • python3 -m pytest tests -q
Delivered. Local proof: - `~/.timmy/twitter-archive/media/manifest.jsonl` - `~/.timmy/twitter-archive/media/manifest_summary.json` - records: 818 - `#timmyTime`: 64 - `#TimmyChain`: 33 - media types: animated_gif=366, photo=234, video=218 Tests: - `python3 -m pytest tests/test_twitter_video_manifest.py -q` - `python3 -m pytest tests -q`
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: Timmy_Foundation/timmy-home#41