Synthesizing Artistic Cinemagraphs from Text

We introduce Text2Cinemagraph, a fully automated method for creating cinemagraphs from text descriptions - an especially challenging task when prompts feature imaginary elements and artistic styles, given the complexity of interpreting the semantics and motions of these images. Existing single-image animation methods fall short on artistic inputs, and recent text-based video methods frequently introduce temporal inconsistencies, struggling to keep certain regions static. To address these challenges, we propose an idea of synthesizing image twins from a single text prompt - a pair of an artistic image and its pixel-aligned corresponding natural-looking twin. While the artistic image depicts the style and appearance detailed in our text prompt, the realistic counterpart greatly simplifies layout and motion analysis. Leveraging existing natural image and video datasets, we can accurately segment the realistic image and predict plausible motion given the semantic information. The predicted motion can then be transferred to the artistic image to create the final cinemagraph. Our method outperforms existing approaches in creating cinemagraphs for natural landscapes as well as artistic and other-worldly scenes, as validated by automated metrics and user studies. Finally, we demonstrate two extensions: animating existing paintings and controlling motion directions using text.

翻译：我们提出Text2Cinemagraph，一种从文本描述自动生成电影级动态图像的全新方法——当提示词包含想象元素和艺术风格时，这项任务尤为具有挑战性，因为需要解析这些图像的语义和运动信息。现有单图像动画方法在艺术输入上表现不足，而近期基于文本的视频方法常出现时间不一致问题，难以保持特定区域的静态效果。为解决这些挑战，我们提出从单一文本提示生成图像孪生对的概念——即一对艺术图像及其像素对齐的自然风格孪生图像。艺术图像呈现文本提示中描述的样式与外观，而逼真的对应图则大幅简化布局与运动分析。利用现有自然图像与视频数据集，我们可准确分割逼真图像，并基于语义信息预测合理运动。随后，预测的运动可迁移至艺术图像，生成最终的电影级动态图像。在自然景观、艺术风格及奇幻场景的电影级动态图像生成任务中，我们的方法在自动化指标与用户研究中均优于现有方法。最后，我们展示两项扩展应用：将已有画作动画化，以及通过文本控制运动方向。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日