We introduce MatSynth, a dataset of $4,000+$ CC0 ultra-high resolution PBR materials. Materials are crucial components of virtual relightable assets, defining the interaction of light at the surface of geometries. Given their importance, significant research effort was dedicated to their representation, creation and acquisition. However, in the past 6 years, most research in material acquisiton or generation relied either on the same unique dataset, or on company-owned huge library of procedural materials. With this dataset we propose a significantly larger, more diverse, and higher resolution set of materials than previously publicly available. We carefully discuss the data collection process and demonstrate the benefits of this dataset on material acquisition and generation applications. The complete data further contains metadata with each material's origin, license, category, tags, creation method and, when available, descriptions and physical size, as well as 3M+ renderings of the augmented materials, in 1K, under various environment lightings. The MatSynth dataset is released through the project page at: https://www.gvecchio.com/matsynth.
翻译:我们提出MatSynth数据集,包含4000多个CC0协议的超高分辨率PBR材质。材质是虚拟可重光照资产的关键组成部分,定义了光线在几何体表面的交互方式。鉴于其重要性,大量研究工作致力于材质的表示、生成与采集。然而,过去6年间,大多数材质采集或生成研究要么依赖于同一独特数据集,要么依赖于公司拥有的大型程序化材质库。通过本数据集,我们提供了比以往公开数据集规模更大、种类更丰富、分辨率更高的材质集合。我们详细讨论了数据收集过程,并展示了该数据集在材质采集与生成应用中的优势。完整数据还包含每个材质的元数据(来源、许可协议、类别、标签、生成方法),以及描述信息、物理尺寸(如有),并在多种环境光照下以1K分辨率提供了300多万张增强材质的渲染图。MatSynth数据集通过项目页面发布,网址为:https://www.gvecchio.com/matsynth。