SRN-SZ: Deep Leaning-Based Scientific Error-bounded Lossy Compression with Super-resolution Neural Networks

The fast growth of computational power and scales of modern super-computing systems have raised great challenges for the management of exascale scientific data. To maintain the usability of scientific data, error-bound lossy compression is proposed and developed as an essential technique for the size reduction of scientific data with constrained data distortion. Among the diverse datasets generated by various scientific simulations, certain datasets cannot be effectively compressed by existing error-bounded lossy compressors with traditional techniques. The recent success of Artificial Intelligence has inspired several researchers to integrate neural networks into error-bounded lossy compressors. However, those works still suffer from limited compression ratios and/or extremely low efficiencies. To address those issues and improve the compression on the hard-to-compress datasets, in this paper, we propose SRN-SZ, which is a deep learning-based scientific error-bounded lossy compressor leveraging the hierarchical data grid expansion paradigm implemented by super-resolution neural networks. SRN-SZ applies the most advanced super-resolution network HAT for its compression, which is free of time-costing per-data training. In experiments compared with various state-of-the-art compressors, SRN-SZ achieves up to 75% compression ratio improvements under the same error bound and up to 80% compression ratio improvements under the same PSNR than the second-best compressor.

翻译：现代超级计算系统的计算能力和规模快速增长，给百亿亿级科学数据的管理带来了巨大挑战。为保持科学数据的可用性，误差有界有损压缩被提出并发展为一种在约束数据失真的前提下缩减科学数据规模的关键技术。在各类科学模拟生成的多维数据集中，部分数据集无法被现有采用传统技术的误差有界有损压缩器有效压缩。人工智能领域的近期成功促使研究者将神经网络集成到误差有界有损压缩器中。然而，这些工作仍存在压缩率有限或效率极低等问题。为解决上述难题并提升对难压缩数据集的压缩效果，本文提出SRN-SZ——一种基于深度学习的科学误差有界有损压缩器，其利用超分辨率神经网络实现分级数据网格扩展范式。SRN-SZ采用当前最先进的超分辨率网络HAT进行压缩，该方法无需耗时的逐数据训练。在与多种最先进压缩器的对比实验中，相同误差界条件下，SRN-SZ的压缩率提升最高达75%；在相同峰值信噪比条件下，相较于次优压缩器，其压缩率提升最高达80%。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日