Impact of using a privacy model on smart buildings data for CO2 prediction

There is a constant trade-off between the utility of the data collected and processed by the many systems forming the Internet of Things (IoT) revolution and the privacy concerns of the users living in the spaces hosting these sensors. Privacy models, such as the SITA (Spatial, Identity, Temporal, and Activity) model, can help address this trade-off. In this paper, we focus on the problem of $CO_2$ prediction, which is crucial for health monitoring but can be used to monitor occupancy, which might reveal some private information. We apply a number of transformations on a real dataset from a Smart Building to simulate different SITA configurations on the collected data. We use the transformed data with multiple Machine Learning (ML) techniques to analyse the performance of the models to predict $CO_{2}$ levels. Our results show that, for different algorithms, different SITA configurations do not make one algorithm perform better or worse than others, compared to the baseline data; also, in our experiments, the temporal dimension was particularly sensitive, with scores decreasing up to $18.9\%$ between the original and the transformed data. The results can be useful to show the effect of different levels of data privacy on the data utility of IoT applications, and can also help to identify which parameters are more relevant for those systems so that higher privacy settings can be adopted while data utility is still preserved.

翻译：在构成物联网（IoT）革命的众多系统所收集和处理的数据的实用性，与居住在这些传感器所在空间的用户的隐私关切之间，始终存在一种权衡。隐私模型，例如SITA（空间、身份、时间和活动）模型，可以帮助应对这种权衡。在本文中，我们聚焦于CO₂预测问题，这对于健康监测至关重要，但也可用于监测占用情况，这可能会泄露一些私人信息。我们对来自智能建筑的真实数据集应用了一系列变换，以模拟所收集数据上的不同SITA配置。我们将变换后的数据与多种机器学习（ML）技术结合使用，分析模型预测CO₂水平的性能。我们的结果显示，对于不同的算法，与基线数据相比，不同的SITA配置并未使某一算法的性能优于或差于其他算法；此外，在我们的实验中，时间维度尤为敏感，原始数据与变换后数据之间的评分下降高达18.9%。这些结果可用于展示不同数据隐私级别对物联网应用数据效用的影响，并有助于识别哪些参数对这些系统更为重要，从而在仍保持数据效用的前提下，采用更高的隐私设置。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日