In the last years the pervasive use of sensors, as they exist in smart devices, e.g., phones, watches, medical devices, has increased dramatically the availability of personal data. However, existing research on data collection primarily focuses on the objective view of reality, as provided, for instance, by sensors, often neglecting the integration of subjective human input, as provided, for instance, by user answers to questionnaires. This limits substantially the exploitability of the collected data. In this paper we present a methodology and a platform specifically designed for the collection of a combination of large-scale sensor data and qualitative human feedback. The methodology has been designed to be deployed on top, and enriches the functionalities of, an existing data collection APP, called iLog, which has been used in large scale, worldwide data collection experiments. The main goal is to put the key actors involved in an experiment, i.e., the researcher in charge, the participant, and iLog in better control of the experiment itself, thus enabling a much improved quality and richness of the data collected. The novel functionalities of the resulting platform are: (i) a time-wise representation of the situational context within which the data collection is performed, (ii) an explicit representation of the temporal context within which the data collection is performed, (iii) a calendar-based dashboard for the real-time monitoring of the data collection context(s), and, finally, (iv) a mechanism for the run-time revision of the data collection plan. The practicality and utility of the proposed functionalities are demonstrated by showing how they apply to a case study involving 350 University students.
翻译:近年来,随着传感器在智能设备(如手机、手表、医疗设备)中的普及应用,个人数据的可获得性急剧增加。然而,现有的数据收集研究主要关注传感器所提供的客观现实视角,往往忽视了整合人类主观输入(例如用户对问卷的回答)。这极大地限制了所收集数据的可利用性。本文提出了一种专门用于收集大规模传感器数据与定性人类反馈相结合的方法论及平台。该方法论设计用于部署并增强现有数据收集应用iLog的功能,该应用已在全球范围内的大规模数据收集实验中得到使用。其主要目标是使实验中的关键参与者——即负责的研究人员、参与者以及iLog——能更好地控制实验过程,从而显著提升所收集数据的质量与丰富性。该平台的新功能包括:(i)数据收集所处情境上下文的时间维度表征,(ii)数据收集所处时间上下文的显式表征,(iii)基于日历的实时数据收集上下文监控仪表板,以及(iv)数据收集计划的运行时动态调整机制。通过将其应用于一项涉及350名大学生的案例研究,展示了所提功能模块的实用性与有效性。