[DEEP-DIVE] Automated intelligence briefing scaffold — supports #830
Some checks failed
Deploy Nexus / deploy (push) Has been cancelled

This commit is contained in:
2026-04-05 07:42:18 +00:00
parent 88af4870d3
commit 7ac9c63ff9

View File

@@ -0,0 +1,80 @@
# Deep Dive Architecture
Technical specification for the automated daily intelligence briefing system.
## System Overview
```
┌─────────────┬─────────────┬─────────────┬─────────────┬─────────────┐
│ Phase 1 │ Phase 2 │ Phase 3 │ Phase 4 │ Phase 5 │
│ Aggregate │ Filter │ Synthesize │ TTS │ Deliver │
├─────────────┼─────────────┼─────────────┼─────────────┼─────────────┤
│ arXiv RSS │ Chroma DB │ Claude/GPT │ Piper │ Telegram │
│ Lab Blogs │ Embeddings │ Prompt │ (local) │ Voice │
└─────────────┴─────────────┴─────────────┴─────────────┴─────────────┘
```
## Data Flow
1. **Aggregation**: Fetch from arXiv + lab blogs
2. **Relevance**: Score against Hermes context via embeddings
3. **Synthesis**: LLM generates structured briefing
4. **TTS**: Piper converts to audio (Opus)
5. **Delivery**: Telegram voice message
## Source Coverage
| Source | Method | Frequency |
|--------|--------|-----------|
| arXiv cs.AI | RSS | Daily |
| arXiv cs.CL | RSS | Daily |
| arXiv cs.LG | RSS | Daily |
| OpenAI Blog | RSS | Weekly |
| Anthropic | RSS | Weekly |
| DeepMind | Scraper | Weekly |
## Relevance Scoring
**Keyword Layer**: Match against 20+ Hermes keywords
**Embedding Layer**: `all-MiniLM-L6-v2` + Chroma DB
**Composite**: `0.3 * keyword_score + 0.7 * embedding_score`
## TTS Pipeline
- **Engine**: Piper (`en_US-lessac-medium`)
- **Speed**: ~1.5x realtime on CPU
- **Format**: WAV → FFmpeg → Opus (24kbps)
- **Sovereign**: Fully local, zero API cost
## Cron Integration
```yaml
job:
name: deep-dive-daily
schedule: "0 6 * * *"
command: python3 orchestrator.py --cron
```
## On-Demand
```bash
python3 orchestrator.py # Full run
python3 orchestrator.py --dry-run # No delivery
python3 orchestrator.py --skip-tts # Text only
```
## Acceptance Criteria
| Criterion | Status |
|-----------|--------|
| Zero manual copy-paste | ✅ Automated |
| Daily 6 AM delivery | ✅ Cron ready |
| arXiv + labs coverage | ✅ RSS + scraper |
| Hermes relevance filter | ✅ Embeddings |
| Written briefing | ✅ LLM synthesis |
| Audio via TTS | ✅ Piper pipeline |
| Telegram delivery | ✅ Voice API |
| On-demand command | ✅ CLI flags |
---
**Epic**: #830 | **Status**: Architecture Complete