We present a reusable dataset and accompanying infrastructure for studying human search behavior in Interactive Information Retrieval (IIR). The dataset combines detailed interaction logs from 61 participants (122 sessions) with user characteristics, including perceptual speed, topic-specific interest, search expertise, and demographic information. To facilitate reproducibility and reuse, we provide a fully documented study setup, a web-based perceptual speed test, and a framework for conducting similar user studies. Our work allows researchers to investigate individual and contextual factors affecting search behavior, and to develop or validate user simulators that account for such variability. We illustrate the datasets potential through an illustrative analysis and release all resources as open-access, supporting reproducible research and resource sharing in the IIR community.
翻译:我们提出了一个可复用的数据集及配套基础设施,用于研究交互式信息检索中的人类搜索行为。该数据集整合了来自61名参与者(122次会话)的详细交互日志,并包含用户特征信息,如感知速度、主题特定兴趣、搜索专业能力及人口统计学数据。为促进研究的可复现性与重复利用,我们提供了完整记录的研究设置、基于网络的感知速度测试框架以及开展类似用户研究的系统架构。本项工作使研究人员能够探究影响搜索行为的个体与情境因素,并支持开发或验证能够解释此类变异性的用户模拟器。我们通过示例分析展示了该数据集的潜在价值,并将所有资源作为开放获取材料发布,以支持IIR领域的可复现性研究与资源共享。