Generative Model Based Noise Robust Training for Unsupervised Domain Adaptation

Target domain pseudo-labelling has shown effectiveness in unsupervised domain adaptation (UDA). However, pseudo-labels of unlabeled target domain data are inevitably noisy due to the distribution shift between source and target domains. This paper proposes a Generative model-based Noise-Robust Training method (GeNRT), which eliminates domain shift while mitigating label noise. GeNRT incorporates a Distribution-based Class-wise Feature Augmentation (D-CFA) and a Generative-Discriminative classifier Consistency (GDC), both based on the class-wise target distributions modelled by generative models. D-CFA minimizes the domain gap by augmenting the source data with distribution-sampled target features, and trains a noise-robust discriminative classifier by using target domain knowledge from the generative models. GDC regards all the class-wise generative models as generative classifiers and enforces a consistency regularization between the generative and discriminative classifiers. It exploits an ensemble of target knowledge from all the generative models to train a noise-robust discriminative classifier and eventually gets theoretically linked to the Ben-David domain adaptation theorem for reducing the domain gap. Extensive experiments on Office-Home, PACS, and Digit-Five show that our GeNRT achieves comparable performance to state-of-the-art methods under single-source and multi-source UDA settings.

翻译：目标域伪标签技术在无监督域适应（UDA）中已展现出有效性。然而，由于源域与目标域之间的分布偏移，未标注目标域数据的伪标签不可避免包含噪声。本文提出一种基于生成模型的噪声鲁棒训练方法（GeNRT），该方法在消除域偏移的同时缓解标签噪声。GeNRT 融合了基于分布的类特征增强（D-CFA）和生成-判别分类器一致性（GDC），两者均基于生成模型建模的逐类目标分布。D-CFA 通过利用分布采样的目标特征增强源数据来缩小域差距，并借助生成模型中的目标域知识训练噪声鲁棒判别分类器。GDC 将所有逐类生成模型视为生成分类器，强制实施生成分类器与判别分类器间的一致性正则化。该方法整合所有生成模型中的目标域集成知识以训练噪声鲁棒判别分类器，最终在理论上与 Ben-David 域适应定理相关联，从而降低域间隙。在 Office-Home、PACS 和 Digit-Five 上的大量实验表明，在单源和多源 UDA 设置下，我们的 GeNRT 性能与最先进方法相当。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

专知会员服务

30+阅读 · 2022年2月22日

NeurIPS2021 | Cycle Self-Training：领域自适应的循环自训练方法与理论

专知会员服务

20+阅读 · 2021年11月13日

【CVPR2020-Oral】无监督域内自适应语义分割，Unsupervised Intra-domain Adaptation

专知会员服务

71+阅读 · 2020年4月20日

近期必读的6篇CVPR 2020【域自适应（Domain Adaptation）】相关论文和代码

专知会员服务

96+阅读 · 2020年3月24日