The dataset focuses on Wikipedia users and contains information about demographic and socioeconomic characteristics of the respondents and their activity on Wikipedia. The data was collected using a questionnaire available online between June and July 2023. The link to the questionnaire was distributed via a banner published in 8 languages on the Wikipedia page. Filling out the questionnaire was voluntary and not incentivised in any way. The survey includes 200 questions about: what people were doing on Wikipedia before clicking the link to the questionnaire; how they use Wikipedia as readers (``professional'' and ``personal'' uses); their opinion on the quality, the thematic coverage, the importance of the encyclopaedia; the making of Wikipedia (how they think it is made, if they have ever contributed and how); their social, sport, artistic and cultural activities, both online and offline; their socio-economic characteristics including political beliefs, and trust propensities. More than 200 000 people opened the questionnaire, 100 332 started to answer, and constitute our dataset, and 10 576 finished it. Among other themes identified by future researchers, the dataset can be useful for advancing the research regarding the features of readers vs contributors of online commons, the relationship between trust, information, sources, and the use made of this information.
翻译:本数据集聚焦于维基百科用户,包含受访者的人口统计学特征、社会经济属性及其在维基百科上的活动信息。数据通过2023年6月至7月期间在线可用的问卷收集,问卷链接以8种语言在维基百科页面横幅中发布,填写问卷为完全自愿且无任何激励措施。问卷包含200道题目,涉及以下内容:用户点击问卷链接前在维基百科上的操作;用户作为读者如何利用维基百科(包括“专业用途”与“个人用途”);对百科全书质量、主题覆盖范围及重要性的评价;对维基百科创作机制的理解(包括用户对编辑流程的认知、是否曾参与贡献及参与方式);用户的线上与线下社交、体育、艺术及文化活动;社会经济特征(包括政治信仰与信任倾向)。超过20万人打开问卷链接,100,332人开始作答(构成数据集主体),其中10,576人完成全部问卷。研究者可基于该数据集推进在线公共资源读者与贡献者特征对比研究,以及信任、信息源、信息使用方式之间互动关系的探索。