cDVGAN: One Flexible Model for Multi-class Gravitational Wave Signal and Glitch Generation

Simulating realistic time-domain observations of gravitational waves (GWs) and GW detector glitches can help in advancing GW data analysis. Simulated data can be used in downstream tasks by augmenting datasets for signal searches, balancing data sets for machine learning, and validating detection schemes. In this work, we present Conditional Derivative GAN (cDVGAN), a novel conditional model in the Generative Adversarial Network framework for simulating multiple classes of time-domain observations that represent gravitational waves (GWs) and detector glitches. cDVGAN can also generate generalized hybrid samples that span the variation between classes through interpolation in the conditioned class vector. cDVGAN introduces an additional player into the typical 2-player adversarial game of GANs, where an auxiliary discriminator analyzes the first-order derivative time-series. Our results show that this provides synthetic data that better captures the features of the original data. cDVGAN conditions on three classes, two denoised from LIGO blip and tomte glitch events from its 3rd observing run (O3), and the third representing binary black hole (BBH) mergers. Our proposed cDVGAN outperforms 4 different baseline GAN models in replicating the features of the three classes. Specifically, our experiments show that training convolutional neural networks (CNNs) with our cDVGAN-generated data improves the detection of samples embedded in detector noise beyond the synthetic data from other state-of-the-art GAN models. Our best synthetic dataset yields as much as a 4.2% increase in area-under-the-curve (AUC) performance compared to synthetic datasets from baseline GANs. Moreover, training the CNN with hybrid samples from our cDVGAN outperforms CNNs trained only on the standard classes, when identifying real samples embedded in LIGO detector background (4% AUC improvement for cDVGAN).

翻译：模拟引力波（GW）及探测器毛刺的时域观测数据有助于推动引力波数据分析的发展。模拟数据可服务于下游任务，包括扩充信号搜索数据集、平衡机器学习数据集以及验证检测方案。本文提出条件导数生成对抗网络（cDVGAN），这是一种基于生成对抗网络框架的新型条件模型，用于模拟代表引力波和探测器毛刺的多类时域观测数据。cDVGAN还能通过对条件类别向量进行插值，生成跨越类别间变化的广义混合样本。cDVGAN在标准GAN的双玩家对抗博弈中引入额外角色，通过辅助鉴别器分析一阶导数时间序列。结果表明，该方法生成的合成数据能更好地捕捉原始数据特征。cDVGAN对三类数据施加条件：其中两类来自LIGO第三次观测运行（O3）中经降噪处理的blip和tomte毛刺事件，第三类代表双黑洞（BBH）并合。我们提出的cDVGAN在复制三类数据特征方面优于四种基准GAN模型。具体而言，实验表明，使用cDVGAN生成数据训练卷积神经网络（CNN），在检测嵌入探测器噪声的样本时，其性能优于其他先进GAN模型生成的合成数据。与基准GAN的合成数据集相比，最优合成数据集的AUC（曲线下面积）性能提升高达4.2%。此外，使用cDVGAN生成的混合样本训练CNN，在识别嵌入LIGO探测器背景的真实样本时，优于仅使用标准类别训练的CNN（cDVGAN的AUC提升4%）。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日