Slim multi-scale convolutional autoencoder-based reduced-order models for interpretable features of a complex dynamical system

In recent years, data-driven deep learning models have gained significant interest in the analysis of turbulent dynamical systems. Within the context of reduced-order models (ROMs), convolutional autoencoders (CAEs) pose a universally applicable alternative to conventional approaches. They can learn nonlinear transformations directly from data, without prior knowledge of the system. However, the features generated by such models lack interpretability. Thus, the resulting model is a black-box which effectively reduces the complexity of the system, but does not provide insights into the meaning of the latent features. To address this critical issue, we introduce a novel interpretable CAE approach for high-dimensional fluid flow data that maintains the reconstruction quality of conventional CAEs and allows for feature interpretation. Our method can be easily integrated into any existing CAE architecture with minor modifications of the training process. We compare our approach to Proper Orthogonal Decomposition (POD) and two existing methods for interpretable CAEs. We apply all methods to three different experimental turbulent Rayleigh-B\'enard convection datasets with varying complexity. Our results show that the proposed method is lightweight, easy to train, and achieves relative reconstruction performance improvements of up to 6.4% over POD for 64 modes. The relative improvement increases to up to 229.8% as the number of modes decreases. Additionally, our method delivers interpretable features similar to those of POD and is significantly less resource-intensive than existing CAE approaches, using less than 2% of the parameters. These approaches either trade interpretability for reconstruction performance or only provide interpretability to a limited extend.

翻译：近年来，数据驱动的深度学习模型在湍流动力系统分析中受到广泛关注。在降阶模型框架下，卷积自编码器为传统方法提供了一种普适性替代方案。它们能够直接从数据中学习非线性变换，无需系统先验知识。然而，此类模型生成的特征缺乏可解释性，导致最终模型成为黑箱系统：虽能有效降低系统复杂度，却无法揭示潜在特征的内在含义。为应对这一关键问题，我们提出了一种面向高维流体流动数据的可解释卷积自编码器新方法，该方法在保持传统卷积自编码器重建质量的同时实现了特征可解释化。本方法仅需对训练过程进行微调即可轻松集成到现有卷积自编码器架构中。我们将所提方法与本征正交分解及两种现有可解释卷积自编码器方法进行对比，并将所有方法应用于三个复杂度各异的实验性湍流Rayleigh-Bénard对流数据集。实验结果表明：所提方法具有轻量化、易训练的特性，在64模态下相比本征正交分解实现最高6.4%的相对重建性能提升，且随着模态数减少，相对提升率最高可达229.8%。此外，本方法生成的可解释特征与本征正交分解具有相似性，其资源消耗显著低于现有卷积自编码器方法（参数量使用率低于2%）。现有方法往往需要在可解释性与重建性能之间进行权衡，或仅能提供有限程度的可解释性。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

随机特征核近似综述: 算法与理论，Random Features for Kernel Approximation: A Survey in Algorithms, Theory, and Beyond

专知会员服务

33+阅读 · 2020年4月26日