While the study of language as typed on smartphones offers valuable insights, existing data collection methods often fall short in providing contextual information and ensuring user privacy. We present a privacy-respectful approach - context-enriched keyboard logging - that allows for the extraction of contextual information on the user's input motive, which is meaningful for linguistics, psychology, and behavioral sciences. In particular, with our approach, we enable distinguishing language contents by their channel (i.e., comments, messaging, search inputs). Filtering by channel allows for better pre-selection of data, which is in the interest of researchers and improves users' privacy. We demonstrate our approach on a large-scale six-month user study (N=624) of language use in smartphone interactions in the wild. Finally, we highlight the implications for research on language use in human-computer interaction and interdisciplinary contexts.
翻译:虽然通过智能手机打字研究语言行为能提供有价值的洞见,但现有数据收集方法往往难以兼顾情境信息获取与用户隐私保护。我们提出一种尊重隐私的方法——情境增强型键盘记录技术——该技术能够提取用户输入动机的情境信息,这对语言学、心理学和行为科学研究具有重要意义。具体而言,该方法使我们能够通过输入渠道(如评论、即时通讯、搜索输入)区分语言内容。按渠道过滤可实现更优的数据预筛选,既符合研究者的需求,又强化了用户隐私保护。我们通过一项为期六个月的大规模野外用户研究(N=624)验证了该方法在智能手机交互中语言使用的有效性。最后,我们阐述了该方法对人机交互及跨学科语境下语言使用研究的重要意义。