Methods that use neural networks for synthesizing 3D shapes in the form of a part-based representation have been introduced over the last few years. These methods represent shapes as a graph or hierarchy of parts and enable a variety of applications such as shape sampling and reconstruction. However, current methods do not allow easily regenerating individual shape parts according to user preferences. In this paper, we investigate techniques that allow the user to generate multiple, diverse suggestions for individual parts. Specifically, we experiment with multimodal deep generative models that allow sampling diverse suggestions for shape parts and focus on models which have not been considered in previous work on shape synthesis. To provide a comparative study of these techniques, we introduce a method for synthesizing 3D shapes in a part-based representation and evaluate all the part suggestion techniques within this synthesis method. In our method, which is inspired by previous work, shapes are represented as a set of parts in the form of implicit functions which are then positioned in space to form the final shape. Synthesis in this representation is enabled by a neural network architecture based on an implicit decoder and a spatial transformer. We compare the various multimodal generative models by evaluating their performance in generating part suggestions. Our contribution is to show with qualitative and quantitative evaluations which of the new techniques for multimodal part generation perform the best and that a synthesis method based on the top-performing techniques allows the user to more finely control the parts that are generated in the 3D shapes while maintaining high shape fidelity when reconstructing shapes.
翻译:近年来,基于部件表示的神经网络方法已被引入用于合成三维形状。这些方法将形状表示为部件图或部件层级结构,能够支持形状采样、重建等多种应用。然而,现有方法无法根据用户偏好轻松重新生成单个形状部件。本文研究能够根据用户需求生成多种多样化部件建议的技术。具体而言,我们尝试采用多模态深度生成模型对形状部件进行多样化采样,重点关注此前三维形状合成工作中尚未被研究的模型。为对这些技术进行对比研究,我们提出了一种基于部件表示的三维形状合成方法,并在此合成框架内评估所有部件建议技术。受前人工作启发,该方法将形状表示为隐式函数形式的部件集合,随后通过空间定位形成最终形状。该表示下的合成能力由基于隐式解码器和空间变换器的神经网络架构实现。我们通过评估各种多模态生成模型在部件建议生成任务中的表现进行对比。本研究的贡献在于:通过定性与定量评估,揭示哪些新型多模态部件生成技术性能最优,并证明基于最优技术的合成方法能够在保持高形状保真度的同时,使用户更精细地控制三维形状中生成的部件。