Score-based generative modeling, informally referred to as diffusion models, continue to grow in popularity across several important domains and tasks. While they provide high-quality and diverse samples from empirical distributions, important questions remain on the reliability and trustworthiness of these sampling procedures for their responsible use in critical scenarios. Conformal prediction is a modern tool to construct finite-sample, distribution-free uncertainty guarantees for any black-box predictor. In this work, we focus on image-to-image regression tasks and we present a generalization of the Risk-Controlling Prediction Sets (RCPS) procedure, that we term $K$-RCPS, which allows to $(i)$ provide entrywise calibrated intervals for future samples of any diffusion model, and $(ii)$ control a certain notion of risk with respect to a ground truth image with minimal mean interval length. Differently from existing conformal risk control procedures, ours relies on a novel convex optimization approach that allows for multidimensional risk control while provably minimizing the mean interval length. We illustrate our approach on two real-world image denoising problems: on natural images of faces as well as on computed tomography (CT) scans of the abdomen, demonstrating state of the art performance.
翻译:基于分数的生成建模(俗称扩散模型)在多个重要领域和任务中持续增长其流行度。尽管这些模型能从经验分布中生成高质量且多样化的样本,但在关键场景中负责任地使用它们时,关于采样过程的可靠性和可信度仍存在重要问题。共形预测是一种现代工具,可为任何黑箱预测器提供有限样本、无分布假设的不确定性保证。本文聚焦于图像到图像的回归任务,提出了一种风险控制预测集(RCPS)流程的推广,称为$K$-RCPS。该方法能够:(i)为未来任意扩散模型的样本提供逐项校准的区间;(ii)在最小化平均区间长度的前提下,控制相对于真实图像的某种风险度量。与现有的共形风险控制流程不同,我们的方法基于一种新颖的凸优化方法,可在实现多维风险控制的同时,可证明地最小化平均区间长度。我们在两个真实图像去噪问题中展示了该方法:面部自然图像以及腹部计算机断层扫描(CT)影像,并实现了最优性能。