Collaborating Foundation Models for Domain Generalized Semantic Segmentation

Domain Generalized Semantic Segmentation (DGSS) deals with training a model on a labeled source domain with the aim of generalizing to unseen domains during inference. Existing DGSS methods typically effectuate robust features by means of Domain Randomization (DR). Such an approach is often limited as it can only account for style diversification and not content. In this work, we take an orthogonal approach to DGSS and propose to use an assembly of CoLlaborative FOUndation models for Domain Generalized Semantic Segmentation (CLOUDS). In detail, CLOUDS is a framework that integrates FMs of various kinds: (i) CLIP backbone for its robust feature representation, (ii) generative models to diversify the content, thereby covering various modes of the possible target distribution, and (iii) Segment Anything Model (SAM) for iteratively refining the predictions of the segmentation model. Extensive experiments show that our CLOUDS excels in adapting from synthetic to real DGSS benchmarks and under varying weather conditions, notably outperforming prior methods by 5.6% and 6.7% on averaged miou, respectively. The code is available at : https://github.com/yasserben/CLOUDS

翻译：域泛化语义分割（DGSS）旨在利用带标签的源域训练模型，使其在推理阶段泛化至未见过的目标域。现有DGSS方法通常通过域随机化（DR）实现鲁棒特征学习，但此类方法存在局限性，仅能处理风格多样化而无法应对内容变化。本研究提出一种正交策略，通过构建协作基础模型集成框架（CLOUDS）实现域泛化语义分割。具体而言，CLOUDS整合了多种类型的基础模型：（i）利用CLIP骨干网络提取鲁棒特征表示；（ii）采用生成模型增强内容多样性，从而覆盖目标分布的多种模态；（iii）引入Segment Anything Model（SAM）迭代优化分割模型的预测结果。大量实验表明，CLOUDS在合成到真实场景DGSS基准测试及不同天气条件下的域适应任务中表现优异，平均交并比分别较此前方法提升5.6%和6.7%。代码已开源：https://github.com/yasserben/CLOUDS

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/