The dataset focuses on Wikipedia users and contains information about demographic and socioeconomic characteristics of the respondents and their activity on Wikipedia. The data was collected using a questionnaire available online between June and July 2023. The link to the questionnaire was distributed via a banner published in 8 languages on the Wikipedia page. Filling out the questionnaire was voluntary and not incentivised in any way. The survey includes 200 questions about: what people were doing on Wikipedia before clicking the link to the questionnaire; how they use Wikipedia as readers (``professional'' and ``personal'' uses); their opinion on the quality, the thematic coverage, the importance of the encyclopaedia; the making of Wikipedia (how they think it is made, if they have ever contributed and how); their social, sport, artistic and cultural activities, both online and offline; their socio-economic characteristics including political beliefs, and trust propensities. More than 200 000 people opened the questionnaire, 100 332 started to answer, and constitute our dataset, and 10 576 finished it. Among other themes identified by future researchers, the dataset can be useful for advancing the research regarding the features of readers vs contributors of online commons, the relationship between trust, information, sources, and the use made of this information.
翻译:该数据集聚焦维基百科用户,包含受访者人口统计、社会经济特征及其在维基百科上的活动信息。数据通过2023年6月至7月间在线问卷收集,问卷链接以8种语言在维基百科页面横幅发布。问卷填写为自愿参与,未提供任何激励。调查包含200个问题,涵盖:用户点击问卷链接前在维基百科上的行为;作为读者的使用方式(包括"专业"与"个人"用途);对百科全书质量、主题覆盖范围及重要性的评价;维基百科的构建认知(包括用户对其运作机制的理解、是否曾参与贡献及其方式);线上线下的社会、体育、艺术与文化参与;以及包含政治倾向与信任倾向在内的社会经济特征。超过20万人打开问卷,10.0332万人开始作答构成数据集主体,其中1.0576万人完成全部问卷。除未来研究者可发掘的其他主题外,该数据集可有效推进关于在线公共资源读者与贡献者特征、信任与信息源关系、以及信息利用方式等方面的研究。