The Galactica simulation database is a platform designed to assist computational astrophysicists with their open science approach based on FAIR (Findable, Accessible, Interoperable, Reusable) principles. It offers the means to publish their numerical simulation projects, whatever their field of application or research theme and provides access to reduced datasets and object catalogs online. The application implements the Simulation Datamodel IVOA standard. To provide the scientific community indirect access to raw simulation data, Galactica can generate, on an "on-demand" basis, custom high-level data products to meet specific user requirements. These data products, accessible through online WebServices, are produced remotely from the raw simulation datasets. To that end, the Galactica central web application communicates with a high-scalability ecosystem of data-processing servers called Terminus by means of an industry-proven asynchronous task management system. Each Terminus node, hosted in a research institute, a regional or national supercomputing facility, contributes to the ecosystem by providing both the storage and the computational resources required to store the massive simulation datasets and post-process them to create the data products requested on Galactica, hence guaranteeing fine-grained sovereignty over data and resources. This distributed architecture is very versatile, it can be interfaced with any kind of data-processing software, written in any language, handling raw data produced by every type of simulation code used in the field of computational astrophysics. Its generality and versatility, together with its excellent scalability makes it a powerful tool for the scientific community to disseminate numerical models in astrophysics in the exascale era.
翻译:Galactica模拟数据库是一个旨在帮助计算天体物理学家基于FAIR(可发现、可访问、可互操作、可重用)原则实施开放科学方法的平台。该平台提供了发布数值模拟项目的能力,无论其应用领域或研究主题为何,并允许在线访问精简数据集和天体目录。该应用实现了IVOA(国际虚拟天文台联盟)的模拟数据模型标准。为向科学界提供对原始模拟数据的间接访问,Galactica可根据用户特定需求,以"按需"方式生成定制化高级数据产品。这些通过在线WebServices访问的数据产品,是在原始模拟数据集的基础上进行远程处理生成的。为此,Galactica中央网络应用程序通过一个经过工业验证的异步任务管理系统,与一个名为Terminus的高可扩展性数据处理服务器生态系统进行通信。每个部署在研究机构、区域或国家级超级计算设施的Terminus节点,通过提供存储海量模拟数据集及后续处理所需的计算资源来贡献于该生态系统,从而生成Galactica平台所请求的数据产品,这保障了对数据和资源的精细主权控制。这种分布式架构具有高度通用性,可与任何类型的数据处理软件对接——无论采用何种编程语言编写,都能处理计算天体物理学领域各类模拟代码产生的原始数据。其普适性、多功能性以及卓越的可扩展性,使其成为科学界在百亿亿次计算时代传播天体物理学数值模型的强大工具。