Metadata management for distributed data sources is a long-standing but ever-growing problem. To counter this challenge in a research-data and library-oriented setting, this work constructs a data architecture, derived from the data-lake: the metadata-lake. A proof-of-concept implementation of this proposed metadata aggregator is presented and briefly evaluated.
翻译:分布式数据源的元数据管理是一个长期存在且日益严峻的挑战。为应对研究数据与图书馆场景中的这一难题,本文构建了一种源自数据湖的数据架构——元数据湖。文中提出了这一元数据聚合器的概念验证实现,并对其进行了简要评估。