Scene-Conditional 3D Object Stylization and Composition

Recently, 3D generative models have made impressive progress, enabling the generation of almost arbitrary 3D assets from text or image inputs. However, these approaches generate objects in isolation without any consideration for the scene where they will eventually be placed. In this paper, we propose a framework that allows for the stylization of an existing 3D asset to fit into a given 2D scene, and additionally produce a photorealistic composition as if the asset was placed within the environment. This not only opens up a new level of control for object stylization, for example, the same assets can be stylized to reflect changes in the environment, such as summer to winter or fantasy versus futuristic settings-but also makes the object-scene composition more controllable. We achieve this by combining modeling and optimizing the object's texture and environmental lighting through differentiable ray tracing with image priors from pre-trained text-to-image diffusion models. We demonstrate that our method is applicable to a wide variety of indoor and outdoor scenes and arbitrary objects.

翻译：近来，3D生成模型取得了显著进展，能够从文本或图像输入中生成几乎任意形状的3D资产。然而，这些方法在生成物体时孤立地进行，并未考虑这些物体最终将被放置的场景。本文提出了一种框架，允许对现有3D资产进行风格化处理，使其融入给定的2D场景，并生成照片级逼真的合成效果，仿佛该物体被放置于该环境中。这不仅为物体风格化提供了新的控制维度——例如，同一资产可根据环境变化（如夏季到冬季、奇幻与科幻设定）进行风格化调整——还使物体-场景合成更加可控。我们通过结合可微光线追踪建模与优化物体纹理及环境光照，并利用预训练的文本到图像扩散模型的图像先验来实现这一目标。实验表明，我们的方法适用于广泛的室内外场景及任意物体。

相关内容

ASSETS

关注 0

ACM SIGACCESS Conference on Computers and Accessibility是为残疾人和老年人提供与计算机相关的设计、评估、使用和教育研究的首要论坛。我们欢迎提交原始的高质量的有关计算和可访问性的主题。今年，ASSETS首次将其范围扩大到包括关于计算机无障碍教育相关主题的原创高质量研究。官网链接：http://assets19.sigaccess.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日