We introduce MatSynth, a dataset of 4,000+ CC0 ultra-high resolution PBR materials. Materials are crucial components of virtual relightable assets, defining the interaction of light at the surface of geometries. Given their importance, significant research effort was dedicated to their representation, creation and acquisition. However, in the past 6 years, most research in material acquisiton or generation relied either on the same unique dataset, or on company-owned huge library of procedural materials. With this dataset we propose a significantly larger, more diverse, and higher resolution set of materials than previously publicly available. We carefully discuss the data collection process and demonstrate the benefits of this dataset on material acquisition and generation applications. The complete data further contains metadata with each material's origin, license, category, tags, creation method and, when available, descriptions and physical size, as well as 3M+ renderings of the augmented materials, in 1K, under various environment lightings. The MatSynth dataset is released through the project page at: https://www.gvecchio.com/matsynth.
翻译:我们推出MatSynth数据集,包含4000余个CC0协议下的超高分辨率PBR材质。材质是虚拟可重光照资产的关键组成部分,决定了光线在几何体表面的相互作用。鉴于其重要性,大量研究工作致力于材质的表示、生成与采集。然而在过去六年中,大多数材质采集或生成研究要么依赖于同一个独特的数据集,要么依赖公司拥有的庞大程序化材质库。通过该数据集,我们提供了一套比以往公开数据规模更大、更多样化、分辨率更高的材质集合。我们详细讨论了数据采集过程,并展示了该数据集在材质采集与生成应用中的优势。完整数据还包含元信息(每种材质的来源、许可协议、类别、标签、创建方法,以及可用的描述与物理尺寸),以及超过300万张在1K分辨率下、多种环境光照条件下增强材质的渲染图。MatSynth数据集通过项目页面发布:https://www.gvecchio.com/matsynth。