This paper focuses on the creation of a new, publicly available Wi-Fi probe request dataset. Probe requests belong to the family of management frames used by the 802.11 (Wi-Fi) protocol. As the situation changes year by year, and technology improves probe request studies are necessary to be done on up-to-date data. We provide a month-long probe request capture in an office environment, including work days, weekends, and holidays consisting of over 1 400 000 probe requests. We provide a description of all the important aspects of the dataset. Apart from the raw packet capture we also provide a Radio Map (RM) of the office to ensure the users of the dataset have all the possible information about the environment. To protect privacy, user information in the dataset is anonymized. This anonymization is done in a way that protects the privacy of users while preserving the ability to analyze the dataset to almost the same level as raw data. Furthermore, we showcase several possible use cases for the dataset, like presence detection, temporal Received Signal Strength Indicator (RSSI) stability, and privacy protection evaluation.
翻译:本文聚焦于创建一个新的、公开可用的 Wi-Fi 探测请求数据集。探测请求属于 802.11(Wi-Fi)协议使用的管理帧家族。随着情况逐年变化以及技术的进步,有必要基于最新数据进行探测请求研究。我们提供了在办公环境中为期一个月的探测请求捕获数据,包括工作日、周末和节假日,共计超过 1 400 000 条探测请求。我们描述了数据集的全部重要方面。除了原始数据包捕获外,我们还提供了办公室的无线电地图(RM),以确保数据集的用户能够获得有关环境的全部可能信息。为了保护隐私,数据集中的用户信息已被匿名化。这种匿名化方式在保护用户隐私的同时,保留了几乎与原始数据相同水平的分析能力。此外,我们展示了该数据集几种可能的用例,例如存在性检测、时间接收信号强度指示符(RSSI)稳定性以及隐私保护评估。