We introduce MatSynth, a dataset of 4,000+ CC0 ultra-high resolution PBR materials. Materials are crucial components of virtual relightable assets, defining the interaction of light at the surface of geometries. Given their importance, significant research effort was dedicated to their representation, creation and acquisition. However, in the past 6 years, most research in material acquisiton or generation relied either on the same unique dataset, or on company-owned huge library of procedural materials. With this dataset we propose a significantly larger, more diverse, and higher resolution set of materials than previously publicly available. We carefully discuss the data collection process and demonstrate the benefits of this dataset on material acquisition and generation applications. The complete data further contains metadata with each material's origin, license, category, tags, creation method and, when available, descriptions and physical size, as well as 3M+ renderings of the augmented materials, in 1K, under various environment lightings. The MatSynth dataset is released through the project page at: https://www.gvecchio.com/matsynth.
翻译:本文介绍MatSynth数据集,该数据集包含4000余种采用CC0许可的超高分辨率PBR材质。材质作为可重光照虚拟资产的核心组成部分,定义了光线在几何体表面的相互作用规律。鉴于其重要性,学界长期以来致力于材质表示、创建与采集方法的研究。然而在过去六年中,大多数材质采集或生成研究仍依赖同一套独家数据集,或受限于企业所有的海量程序化材质库。本数据集在公开可用材质资源中实现了数量级、多样性与分辨率的显著突破。我们系统阐述了数据采集流程,并通过材质采集与生成应用验证了本数据集的实际价值。完整数据集除基础材质外,还包含每条材质的来源、许可协议、分类标签、创建方法等元数据,在可用情况下更提供材质描述、物理尺寸信息,以及超过300万张增强材质在1K分辨率下、多种环境光照条件下的渲染图像。MatSynth数据集已通过项目页面发布:https://www.gvecchio.com/matsynth。