Lv.1 0 XP

Confidence Scoring — Routing Decisions Based on Model Certainty

Core 7 min +30 XP
💡
THE ANALOGY

A radiologist's reporting system. 'Definitive finding — act immediately.' 'Probable finding — recommend follow-up.' 'Possible finding — note for awareness.' Each tier triggers a different workflow. Claude's confidence scoring does the same — routing outputs to the right downstream process based on certainty.

⚠️ EXAM TRAP — The Wrong Answer People Choose

Treating confidence scoring as a binary pass/fail gate. The exam tests that confidence enables TIERED ROUTING — different confidence levels trigger different workflows, not just accept/reject.

KEY POINTS
1 Three tiers: high confidence → auto-process, medium → spot-check sample, low → human review.
2 Field-level confidence: individual fields in one extraction can have different confidence levels.
3 Claude self-reports confidence in structured output — route based on reported certainty.
4 Calibration: self-reported confidence must be measured against actual accuracy to be trusted.
5 Stratified sampling for calibration: sample from each tier and measure actual accuracy.