feat: Quality gate — score and filter knowledge entries #198

Closed
opened 2026-04-15 15:18:06 +00:00 by Rockachopa · 0 comments
Owner

Epic: #136 (Knowledge Pipeline v2)

Task

Score each knowledge entry on quality and filter low-value entries.

Quality Dimensions

  • Specificity: concrete examples vs vague generalities
  • Actionability: can this be used to do something?
  • Freshness: is this still accurate?
  • Source quality: was the model/provider reliable?

Scoring

def quality_score(entry: KnowledgeEntry) -> float:
    specificity = score_specificity(entry.content)
    actionability = score_actionability(entry.content)
    freshness = score_freshness(entry.provenance.timestamp)
    source_quality = SOURCE_QUALITY_MAP[entry.provenance.model]
    
    return 0.3*specificity + 0.3*actionability + 0.2*freshness + 0.2*source_quality

Deliverables

  • Quality scorer: compounding-intelligence/quality_gate.py
  • Threshold: entries scoring < 0.5 are filtered out
  • Report: quality distribution of harvested knowledge
  • Integration with training data pipeline

Labels: quality, filtering, priority:high

## Epic: #136 (Knowledge Pipeline v2) ### Task Score each knowledge entry on quality and filter low-value entries. ### Quality Dimensions - **Specificity**: concrete examples vs vague generalities - **Actionability**: can this be used to do something? - **Freshness**: is this still accurate? - **Source quality**: was the model/provider reliable? ### Scoring ```python def quality_score(entry: KnowledgeEntry) -> float: specificity = score_specificity(entry.content) actionability = score_actionability(entry.content) freshness = score_freshness(entry.provenance.timestamp) source_quality = SOURCE_QUALITY_MAP[entry.provenance.model] return 0.3*specificity + 0.3*actionability + 0.2*freshness + 0.2*source_quality ``` ### Deliverables - [ ] Quality scorer: `compounding-intelligence/quality_gate.py` - [ ] Threshold: entries scoring < 0.5 are filtered out - [ ] Report: quality distribution of harvested knowledge - [ ] Integration with training data pipeline ### Labels: quality, filtering, priority:high
hermes was assigned by Rockachopa 2026-04-15 16:23:29 +00:00
hermes was unassigned by Rockachopa 2026-04-17 05:06:16 +00:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: Timmy_Foundation/compounding-intelligence#198