WinSyn: A High Resolution Testbed for Synthetic Data

We present WinSyn, a unique dataset and testbed for creating high-quality synthetic data with procedural modeling techniques. The dataset contains high-resolution photographs of windows, selected from locations around the world, with 89,318 individual window crops showcasing diverse geometric and material characteristics. We evaluate a procedural model by training semantic segmentation networks on both synthetic and real images and then comparing their performances on a shared test set of real images. Specifically, we measure the difference in mean Intersection over Union (mIoU) and determine the effective number of real images to match synthetic data's training performance. We design a baseline procedural model as a benchmark and provide 21,290 synthetically generated images. By tuning the procedural model, key factors are identified which significantly influence the model's fidelity in replicating real-world scenarios. Importantly, we highlight the challenge of procedural modeling using current techniques, especially in their ability to replicate the spatial semantics of real-world scenarios. This insight is critical because of the potential of procedural models to bridge to hidden scene aspects such as depth, reflectivity, material properties, and lighting conditions.

翻译：我们提出WinSyn，一个独特的用于通过程序化建模技术创建高质量合成数据的测试平台与数据集。该数据集包含从全球各地采集的高分辨率窗户照片，共计89,318个独立的窗户图像块，展示了多样化的几何与材质特征。我们通过分别在合成图像与真实图像上训练语义分割网络，并在共享的真实图像测试集上对比性能，对程序化模型进行评估。具体而言，我们测量平均交并比（mIoU）的差异，并确定匹配合成数据训练性能所需的有效真实图像数量。我们设计了一个基线程序化模型作为基准，并生成了21,290张合成图像。通过调整该程序化模型，我们识别出显著影响模型复现真实场景保真度的关键因素。重要的是，我们揭示了当前技术下程序化建模面临的挑战，尤其是其在复现真实场景空间语义方面的局限性。这一发现具有关键意义，因为程序化模型具备将深度、反射率、材质属性及光照条件等隐藏场景特征关联起来的潜力。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日