Score-based generative modeling, informally referred to as diffusion models, continue to grow in popularity across several important domains and tasks. While they provide high-quality and diverse samples from empirical distributions, important questions remain on the reliability and trustworthiness of these sampling procedures for their responsible use in critical scenarios. Conformal prediction is a modern tool to construct finite-sample, distribution-free uncertainty guarantees for any black-box predictor. In this work, we focus on image-to-image regression tasks and we present a generalization of the Risk-Controlling Prediction Sets (RCPS) procedure, that we term $K$-RCPS, which allows to $(i)$ provide entrywise calibrated intervals for future samples of any diffusion model, and $(ii)$ control a certain notion of risk with respect to a ground truth image with minimal mean interval length. Differently from existing conformal risk control procedures, ours relies on a novel convex optimization approach that allows for multidimensional risk control while provably minimizing the mean interval length. We illustrate our approach on two real-world image denoising problems: on natural images of faces as well as on computed tomography (CT) scans of the abdomen, demonstrating state of the art performance.
翻译:基于分数的生成建模(俗称扩散模型)在多个重要领域和任务中持续受到欢迎。尽管这些模型能从经验分布中生成高质量且多样化的样本,但在关键场景中负责任地使用这些采样过程时,其可靠性和可信度仍存在重大问题。共形预测是一种现代工具,能为任何黑箱预测器提供有限样本、无分布假设的不确定性保证。在本工作中,我们聚焦于图像到图像的回归任务,并提出了一种风险控制预测集(RCPS)过程的泛化方法,我们称之为$K$-RCPS,该方法能够$(i)$为任何扩散模型的未来样本提供逐元素校准的区间,且$(ii)$以最小平均区间长度控制相对于真实图像的风险概念。与现有的共形风险控制程序不同,我们的方法依赖于一种新颖的凸优化方法,该方法能在可证明地最小化平均区间长度的同时实现多维风险控制。我们通过两个真实世界的图像去噪问题(自然面部图像以及腹部计算机断层扫描(CT)图像)展示了该方法的性能达到了现有最优水平。