Indexing Annotation Hierarchy
Coined by Jason Barnard in 2025. The Indexing Annotation Hierarchy is the five-level framework for understanding how 24+ annotation dimensions are processed during indexing: Gatekeepers (binary elimination), Core Identity (universal meaning), Selection Filters (pool routing), Confidence Multipliers (ranking within pool), and Extraction Quality (deployment format). Each annotation carries its own confidence score. This hierarchy reveals why good content fails - it may pass some levels but fail others.
┌─────────────────────────────────────────────────────────────────────────────┐ │ INDEXING ANNOTATION HIERARCHY │ │ How Search Bots Process 24+ Annotation Dimensions During Indexing │ │ Coined by Jason Barnard, 2025 │ └─────────────────────────────────────────────────────────────────────────────┘ ┌─────────────────────────────────────────────────────────────────────────────┐ │ THE FIVE-LEVEL HIERARCHY │ ├─────────────────────────────────────────────────────────────────────────────┤ │ │ │ CHUNK ENTERS INDEXING │ │ │ │ │ ▼ │ │ ┌─────────────────────────────────────────────────────────────────────┐ │ │ │ LEVEL 1: GATEKEEPERS (Binary: IN or OUT) │ │ │ │ ═══════════════════════════════════════ │ │ │ │ • Temporal/Freshness • Geographic Relevance │ │ │ │ • Language Quality • Entity Disambiguation │ │ │ │ │ │ │ │ EFFECT: Eliminate chunk instantly if mismatch │ │ │ │ FAIL ANY GATE → CHUNK ELIMINATED FROM THAT QUERY POOL │ │ │ └─────────────────────────┬───────────────────────────────────────────┘ │ │ │ PASS │ │ ▼ │ │ ┌─────────────────────────────────────────────────────────────────────┐ │ │ │ LEVEL 2: CORE IDENTITY (Universal: Always Extracted) │ │ │ │ ════════════════════════════════════════════════ │ │ │ │ • Entities (focus + supporting) • Attributes (facts) │ │ │ │ • Relationships (connections) • Sentiment (tone) │ │ │ │ │ │ │ │ EFFECT: Define what the chunk IS about │ │ │ │ WITHOUT THESE → CHUNK HAS NO MEANING │ │ │ └─────────────────────────┬───────────────────────────────────────────┘ │ │ │ │ │ ▼ │ │ ┌─────────────────────────────────────────────────────────────────────┐ │ │ │ LEVEL 3: SELECTION FILTERS (Routing: Which Pool?) │ │ │ │ ═════════════════════════════════════════════════ │ │ │ │ • Intent Match • Expertise Level │ │ │ │ • Claim Type • Actionability │ │ │ │ │ │ │ │ EFFECT: Route chunk to correct competition queue │ │ │ │ WRONG POOL → COMPETES AGAINST WRONG CONTENT │ │ │ └─────────────────────────┬───────────────────────────────────────────┘ │ │ │ │ │ ▼ │ │ ┌─────────────────────────────────────────────────────────────────────┐ │ │ │ LEVEL 4: CONFIDENCE MULTIPLIERS (Ranking: Where in Pool?) │ │ │ │ ══════════════════════════════════════════════════════ │ │ │ │ • Verifiability • Provenance • Corroboration Count │ │ │ │ • Specificity • Evidence Type • Controversy Score │ │ │ │ • Outlier Flag │ │ │ │ │ │ │ │ EFFECT: Scale confidence up or down within the pool │ │ │ │ LOW MULTIPLIERS → RANKED BELOW COMPETITORS │ │ │ └─────────────────────────┬───────────────────────────────────────────┘ │ │ │ │ │ ▼ │ │ ┌─────────────────────────────────────────────────────────────────────┐ │ │ │ LEVEL 5: EXTRACTION QUALITY (Usage: How Deployed?) │ │ │ │ ══════════════════════════════════════════════════ │ │ │ │ • Sufficiency • Dependency • Standalone Score │ │ │ │ • Entity Salience • Entity Role │ │ │ │ │ │ │ │ EFFECT: Determine how chunk appears in AI output │ │ │ │ LOW QUALITY → SUMMARIZED NOT QUOTED, NEEDS CONTEXT │ │ │ └─────────────────────────────────────────────────────────────────────┘ │ │ │ │ ════════════════════════════════════════════════════════════════════════ │ │ │ │ ┌─────────────────────────────────────────────────────────────────────┐ │ │ │ META: CONFIDENCE SCORE │ │ │ │ ═══════════════════════ │ │ │ │ Applied to EVERY annotation at EVERY level │ │ │ │ = Model's certainty in the accuracy of that specific annotation │ │ │ │ │ │ │ │ 24 annotations × confidence scores = true annotation complexity │ │ │ └─────────────────────────────────────────────────────────────────────┘ │ │ │ └─────────────────────────────────────────────────────────────────────────────┘ ┌─────────────────────────────────────────────────────────────────────────────┐ │ WHY GOOD CONTENT FAILS │ ├─────────────────────────────────────────────────────────────────────────────┤ │ │ │ ┌─────────────────────────────────────────────────────────────────────┐ │ │ │ │ │ │ │ FAILURE AT LEVEL 1 (Gatekeepers) │ │ │ │ → Content eliminated instantly, never competes │ │ │ │ Example: Outdated date on time-sensitive query │ │ │ │ │ │ │ │ FAILURE AT LEVEL 2 (Core Identity) │ │ │ │ → Content has no clear meaning, ignored │ │ │ │ Example: Ambiguous entity references │ │ │ │ │ │ │ │ FAILURE AT LEVEL 3 (Selection Filters) │ │ │ │ → Content competes in wrong pool, loses to better matches │ │ │ │ Example: Expert content shown for beginner query │ │ │ │ │ │ │ │ FAILURE AT LEVEL 4 (Confidence Multipliers) │ │ │ │ → Content ranked below competitors in same pool │ │ │ │ Example: First-party claims without third-party validation │ │ │ │ │ │ │ │ FAILURE AT LEVEL 5 (Extraction Quality) │ │ │ │ → Content used poorly: summarized not quoted, needs context │ │ │ │ Example: Incomplete answer requiring surrounding chunks │ │ │ │ │ │ │ └─────────────────────────────────────────────────────────────────────┘ │ │ │ │ ┌─────────────────────────────────────────────────────────────────────┐ │ │ │ │ │ │ │ THE INSIGHT: Content can pass four levels and fail at one. │ │ │ │ Understanding the hierarchy reveals WHERE optimization is needed.│ │ │ │ │ │ │ └─────────────────────────────────────────────────────────────────────┘ │ │ │ └─────────────────────────────────────────────────────────────────────────────┘
No posts found for this tag.
No pages found for this tag.
