[EZRA BURN-MODE] Deep Dive scaffold directory guide
Some checks failed
Deploy Nexus / deploy (push) Has been cancelled
Some checks failed
Deploy Nexus / deploy (push) Has been cancelled
This commit is contained in:
61
scaffold/deepdive/README.md
Normal file
61
scaffold/deepdive/README.md
Normal file
@@ -0,0 +1,61 @@
|
|||||||
|
# Deep Dive Scaffold
|
||||||
|
|
||||||
|
> Parent: the-nexus#830
|
||||||
|
> Created: 2026-04-05
|
||||||
|
|
||||||
|
This directory contains phase-by-phase implementation skeletons for the Deep Dive automated intelligence briefing system.
|
||||||
|
|
||||||
|
## Directory Structure
|
||||||
|
|
||||||
|
```
|
||||||
|
scaffold/deepdive/
|
||||||
|
├── phase1/ # Source aggregation (ZERO blockers, can start now)
|
||||||
|
│ ├── arxiv_aggregator.py ← Run this today
|
||||||
|
│ ├── blog_scraper.py (stub)
|
||||||
|
│ └── config.yaml
|
||||||
|
├── phase2/ # Relevance engine (needs Phase 1)
|
||||||
|
│ ├── relevance_engine.py (stub)
|
||||||
|
│ └── embeddings.py (stub)
|
||||||
|
├── phase3/ # Synthesis (needs Phase 2)
|
||||||
|
│ ├── synthesis.py (stub)
|
||||||
|
│ └── briefing_template.md
|
||||||
|
├── phase4/ # TTS pipeline (needs Phase 3)
|
||||||
|
│ ├── tts_pipeline.py (stub)
|
||||||
|
│ └── piper_config.json
|
||||||
|
└── phase5/ # Delivery (needs Phase 4)
|
||||||
|
├── telegram_delivery.py (stub)
|
||||||
|
└── deepdive_command.py (stub)
|
||||||
|
```
|
||||||
|
|
||||||
|
## Quick Start
|
||||||
|
|
||||||
|
### Phase 1 (Today)
|
||||||
|
|
||||||
|
```bash
|
||||||
|
cd the-nexus/scaffold/deepdive/phase1
|
||||||
|
python3 arxiv_aggregator.py
|
||||||
|
```
|
||||||
|
|
||||||
|
**Requirements**: Python 3.8+, internet connection, no API keys.
|
||||||
|
|
||||||
|
**Output**: `data/deepdive/raw/arxiv-YYYY-MM-DD.jsonl`
|
||||||
|
|
||||||
|
## Sovereignty Preservation
|
||||||
|
|
||||||
|
| Component | Local Option | Cloud Fallback |
|
||||||
|
|-----------|-------------|----------------|
|
||||||
|
| Embeddings | nomic-embed-text via llama.cpp | OpenAI |
|
||||||
|
| LLM | Gemma 4 via Hermes | Kimi K2.5 |
|
||||||
|
| TTS | Piper | ElevenLabs |
|
||||||
|
|
||||||
|
**Rule**: Implement local first, add cloud fallback only if quality unacceptable.
|
||||||
|
|
||||||
|
## Next Steps
|
||||||
|
|
||||||
|
1. ✅ **Phase 1**: Run `arxiv_aggregator.py` to validate fetch pipeline
|
||||||
|
2. ⏳ **Phase 2**: Implement `relevance_engine.py` with embeddings
|
||||||
|
3. ⏳ **Phase 3**: Draft `synthesis.py` with prompt templates
|
||||||
|
4. ⏳ **Phase 4**: Test `tts_pipeline.py` with Piper
|
||||||
|
5. ⏳ **Phase 5**: Integrate `telegram_delivery.py` with Hermes gateway
|
||||||
|
|
||||||
|
See `docs/deep-dive-architecture.md` for full technical specification.
|
||||||
Reference in New Issue
Block a user