Unsupervised Domain Transfer for Science: Exploring Deep Learning Methods for Translation between LArTPC Detector Simulations with Differing Response Models

MoDELS · 无监督 · Learning · 可约的 · domain shift ·

2023 年 4 月 25 日

翻译：科学中的无监督领域迁移：探索深度学习方法在具有不同响应模型的LArTPC探测器模拟间翻译

Yi Huang,Dmitrii Torbunov,Brett Viren,Haiwang Yu,Jin Huang,Meifeng Lin,Yihui Ren

Deep learning (DL) techniques have broad applications in science, especially in seeking to streamline the pathway to potential solutions and discoveries. Frequently, however, DL models are trained on the results of simulation yet applied to real experimental data. As such, any systematic differences between the simulated and real data may degrade the model's performance -- an effect known as "domain shift." This work studies a toy model of the systematic differences between simulated and real data. It presents a fully unsupervised, task-agnostic method to reduce differences between two systematically different samples. The method is based on the recent advances in unpaired image-to-image translation techniques and is validated on two sets of samples of simulated Liquid Argon Time Projection Chamber (LArTPC) detector events, created to illustrate common systematic differences between the simulated and real data in a controlled way. LArTPC-based detectors represent the next-generation particle detectors, producing unique high-resolution particle track data. This work open-sources the generated LArTPC data set, called Simple Liquid-Argon Track Samples (or SLATS), allowing researchers from diverse domains to study the LArTPC-like data for the first time.

翻译：深度学习技术广泛应用于科学领域，尤其在简化潜在解决方案与发现的路径方面具有显著价值。然而，深度学习模型常基于模拟结果进行训练，却应用于真实实验数据。因此，模拟数据与真实数据之间的任何系统性差异都可能降低模型性能——这种现象被称为“领域偏移”。本研究构建了一个模拟数据与真实数据间系统性差异的简化模型，并提出了一种完全无监督、任务无关的方法来减小两组系统差异样本之间的差距。该方法基于近期无配对图像到图像翻译技术的进展，并在两组模拟液态氩时间投影室（LArTPC）探测器事件样本上进行了验证。这些样本通过可控方式展现了模拟数据与真实数据之间常见的系统性差异。基于LArTPC的探测器代表了下一代粒子探测器技术，可生成独特的高分辨率粒子径迹数据。本研究开源了所生成的LArTPC数据集——称为简单液态氩径迹样本（SLATS），使不同领域的研究人员能够首次研究类似LArTPC的数据。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日