Over the years, various approaches have been employed to enhance the productivity of knowledge workers, from addressing psychological well-being to the development of personal knowledge assistants. A significant challenge in this research area has been the absence of a comprehensive, publicly accessible dataset that mirrors real-world knowledge work. Although a handful of datasets exist, many are restricted in access or lack vital information dimensions, complicating meaningful comparison and benchmarking in the domain. This paper presents RLKWiC, a novel dataset of Real-Life Knowledge Work in Context, derived from monitoring the computer interactions of eight participants over a span of two months. As the first publicly available dataset offering a wealth of essential information dimensions (such as explicated contexts, textual contents, and semantics), RLKWiC seeks to address the research gap in the personal information management domain, providing valuable insights for modeling user behavior.
翻译:多年来,研究者采用多种方法提升知识工作者的生产力,从关注心理健康到开发个人知识助手。该领域面临的一个重大挑战是缺乏全面、公开可获取且能反映真实知识工作的数据集。尽管存在少量数据集,但许多数据集存在访问限制或缺失关键信息维度,难以在该领域进行有意义的比较与基准测试。本文提出RLKWiC——一个基于对八名参与者长达两个月计算机交互行为监测构建的新型真实世界情境知识工作数据集。作为首个公开提供丰富核心信息维度(如显式情境、文本内容及语义)的数据集,RLKWiC旨在填补个人信息管理领域的研究空白,为建模用户行为提供重要参考。