Conformal prediction provides rigorous distribution-free finite-sample guarantees for marginal coverage under the assumption of exchangeability, but may exhibit systematic undercoverage or overcoverage for specific subpopulations. Assessing conditional validity is challenging, as standard stratification methods suffer from the curse of dimensionality. We propose Conformal Prediction Assessment (CPA), a framework that reframes the evaluation of conditional coverage as a supervised learning task by training a reliability estimator that predicts instance-level coverage probabilities. Building on this estimator, we introduce the Conditional Validity Index (CVI), which decomposes reliability into safety (undercoverage risk) and efficiency (overcoverage cost). We establish convergence rates for the reliability estimator and prove the consistency of CVI-based model selection. Extensive experiments on synthetic and real-world datasets demonstrate that CPA effectively diagnoses local failure modes and that CC-Select, our CVI-based model selection algorithm, consistently identifies predictors with superior conditional coverage performance.
翻译:共形预测在可交换性假设下提供了严格的分布无关有限样本边际覆盖保证,但可能对特定子群体出现系统性欠覆盖或过覆盖。由于标准分层方法受维数灾难影响,条件有效性的评估具有挑战性。我们提出共形预测评估(CPA)框架,通过训练可靠性估计器预测实例级覆盖概率,将条件覆盖评估重新定义为监督学习任务。基于该估计器,我们引入条件有效性指数(CVI),将可靠性分解为安全性(欠覆盖风险)与效率(过覆盖成本)。我们建立了可靠性估计器的收敛速率,并证明了基于CVI模型选择的一致性。在合成与真实数据集上的大量实验表明,CPA能有效诊断局部失效模式,而基于CVI的模型选择算法CC-Select始终能识别具有更优条件覆盖性能的预测器。