Globally, there is an increased need for guidelines to produce high-quality data outputs for analysis. No framework currently exists that provides guidelines for a comprehensive approach to producing analysis ready data (ARD). Through critically reviewing and summarising current literature, this paper proposes such guidelines for the creation of ARD. The guidelines proposed in this paper inform ten steps in the generation of ARD: ethics, project documentation, data governance, data management, data storage, data discovery and collection, data cleaning, quality assurance, metadata, and data dictionary. These steps are illustrated through a substantive case study that aimed to create ARD for a digital spatial platform: the Australian Child and Youth Wellbeing Atlas (ACYWA).
翻译:全球范围内,对制定高质量数据分析输出准则的需求日益增长。然而,目前尚不存在为全面生成分析就绪数据(ARD)提供指导的框架。本文通过批判性回顾与总结现有文献,提出了一套针对ARD创建的系统性指南。该指南涵盖了生成ARD的十个关键环节:伦理规范、项目文档、数据治理、数据管理、数据存储、数据发现与采集、数据清洗、质量保障、元数据及数据字典。这些步骤通过一个实质性案例研究得到阐释——即为数字空间平台“澳大利亚儿童与青少年健康福祉图谱”(ACYWA)构建ARD的全过程。