Synthesizing Artistic Cinemagraphs from Text

We introduce Artistic Cinemagraph, a fully automated method for creating cinemagraphs from text descriptions - an especially challenging task when prompts feature imaginary elements and artistic styles, given the complexity of interpreting the semantics and motions of these images. Existing single-image animation methods fall short on artistic inputs, and recent text-based video methods frequently introduce temporal inconsistencies, struggling to keep certain regions static. To address these challenges, we propose an idea of synthesizing image twins from a single text prompt - a pair of an artistic image and its pixel-aligned corresponding natural-looking twin. While the artistic image depicts the style and appearance detailed in our text prompt, the realistic counterpart greatly simplifies layout and motion analysis. Leveraging existing natural image and video datasets, we can accurately segment the realistic image and predict plausible motion given the semantic information. The predicted motion can then be transferred to the artistic image to create the final cinemagraph. Our method outperforms existing approaches in creating cinemagraphs for natural landscapes as well as artistic and other-worldly scenes, as validated by automated metrics and user studies. Finally, we demonstrate two extensions: animating existing paintings and controlling motion directions using text.

翻译：我们提出“艺术动态照片”（Artistic Cinemagraph），一种完全自动化的方法，可根据文本描述生成动态照片——这是一项极具挑战性的任务，尤其当提示包含虚构元素和艺术风格时，因为需要解读这些图像的语义与运动。现有单图像动画方法在艺术输入上表现不佳，而近期基于文本的视频方法常引入时间不一致性，难以维持特定区域的静止。为应对这些挑战，我们提出从单一文本提示合成“图像孪生体”的思路：即生成一对艺术图像与其像素对齐的自然外观孪生体。艺术图像展现文本提示中详细描述的风格与外观，而真实感对应物则极大简化了布局与运动分析。利用现有自然图像与视频数据集，我们可精确分割真实感图像，并根据语义信息预测合理运动。预测的运动随后可迁移至艺术图像，以生成最终动态照片。通过自动评估指标与用户研究验证，我们的方法在自然景观以及艺术及超现实场景的动态照片生成中均优于现有方法。最后，我们展示两项扩展：对现有画作进行动画化，以及利用文本控制运动方向。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

37+阅读 · 2019年10月17日