PseudoCal: A Source-Free Approach to Unsupervised Uncertainty Calibration in Domain Adaptation

Unsupervised domain adaptation (UDA) has witnessed remarkable advancements in improving the accuracy of models for unlabeled target domains. However, the calibration of predictive uncertainty in the target domain, a crucial aspect of the safe deployment of UDA models, has received limited attention. The conventional in-domain calibration method, \textit{temperature scaling} (TempScal), encounters challenges due to domain distribution shifts and the absence of labeled target domain data. Recent approaches have employed importance-weighting techniques to estimate the target-optimal temperature based on re-weighted labeled source data. Nonetheless, these methods require source data and suffer from unreliable density estimates under severe domain shifts, rendering them unsuitable for source-free UDA settings. To overcome these limitations, we propose PseudoCal, a source-free calibration method that exclusively relies on unlabeled target data. Unlike previous approaches that treat UDA calibration as a \textit{covariate shift} problem, we consider it as an unsupervised calibration problem specific to the target domain. Motivated by the factorization of the negative log-likelihood (NLL) objective in TempScal, we generate a labeled pseudo-target set that captures the structure of the real target. By doing so, we transform the unsupervised calibration problem into a supervised one, enabling us to effectively address it using widely-used in-domain methods like TempScal. Finally, we thoroughly evaluate the calibration performance of PseudoCal by conducting extensive experiments on 10 UDA methods, considering both traditional UDA settings and recent source-free UDA scenarios. The experimental results consistently demonstrate the superior performance of PseudoCal, exhibiting significantly reduced calibration error compared to existing calibration methods.

翻译：无监督领域自适应（UDA）在提升面向无标注目标域的模型准确性方面取得了显著进展。然而，作为确保UDA模型安全部署的关键因素，目标域预测不确定性的校准问题却鲜受关注。传统的域内校准方法——温度缩放（TempScal）——因领域分布偏移及缺乏带标注目标域数据而面临挑战。近期方法采用重要性加权技术，基于重新加权的带标签源数据估计目标域最优温度。但这类方法不仅需要源数据，且在严重领域偏移下易产生不可靠的密度估计，因此不适用于无源UDA场景。为克服这些限制，本文提出PseudoCal——一种仅依赖无标注目标数据的无源校准方法。与将UDA校准视为协变量偏移问题的传统思路不同，我们将其定位为面向目标域的特定无监督校准问题。受TempScal中负对数似然（NLL）目标函数分解的启发，我们生成能捕获真实目标域结构的带标签伪目标集，从而将无监督校准问题转化为有监督问题，进而可借助TempScal等广泛应用的域内方法有效求解。最后，通过在10种UDA方法（涵盖传统UDA设置与近期无源UDA场景）上开展大量实验，全面评估了PseudoCal的校准性能。实验结果表明，PseudoCal始终表现出卓越性能，其校准误差较现有校准方法显著降低。