Standardized Benchmark Dataset for Localized Exposure to a Realistic Source at 10$-$90 GHz

The lack of freely available standardized datasets represents an aggravating factor during the development and testing the performance of novel computational techniques in exposure assessment and dosimetry research. This hinders progress as researchers are required to generate numerical data (field, power and temperature distribution) anew using simulation software for each exposure scenario. Other than being time consuming, this approach is highly susceptible to errors that occur during the configuration of the electromagnetic model. To address this issue, in this paper, the limited available data on the incident power density and resultant maximum temperature rise on the skin surface considering various steady-state exposure scenarios at 10$-$90 GHz have been statistically modeled. The synthetic data have been sampled from the fitted statistical multivariate distribution with respect to predetermined dosimetric constraints. We thus present a comprehensive and open-source dataset compiled of the high-fidelity numerical data considering various exposures to a realistic source. Furthermore, different surrogate models for predicting maximum temperature rise on the skin surface were fitted based on the synthetic dataset. All surrogate models were tested on the originally available data where satisfactory predictive performance has been demonstrated. A simple technique of combining quadratic polynomial and tensor-product spline surrogates, each operating on its own cluster of data, has achieved the lowest mean absolute error of 0.058 {\deg}C. Therefore, overall experimental results indicate the validity of the proposed synthetic dataset.

翻译：缺乏免费可用的标准化数据集是暴露评估与剂量学研究领域开发和测试新型计算方法性能时的一个加重因素。这阻碍了进展，因为研究人员需要为每个暴露场景重新使用仿真软件生成数值数据（场分布、功率分布和温度分布）。除了耗时之外，该方法极易在电磁模型配置过程中发生错误。为解决此问题，本文对10–90 GHz稳态暴露场景下有限的入射功率密度及皮肤表面最大温升数据进行了统计建模。从拟合的统计多元分布中采样合成数据，并满足预定的剂量学约束。因此，我们提供了一个综合且开源的数据集，其中包含考虑真实源多种暴露情况的高保真数值数据。此外，基于合成数据集拟合了用于预测皮肤表面最大温升的不同替代模型。所有替代模型均在原始可用数据上进行了测试，并展现出令人满意的预测性能。结合二次多项式与张量积样条替代模型的简单技术（每种模型在其自身数据簇上运行）实现了最低平均绝对误差0.058°C。因此，整体实验结果表明所提出的合成数据集具有有效性。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

60+阅读 · 2019年10月17日