点提示：基于视频扩散模型的反事实追踪 (Point Prompting: Counterfactual Tracking with Video Diffusion Models)

Trackers and video generators solve closely related problems: the former analyze motion, while the latter synthesize it. We show that this connection enables pretrained video diffusion models to perform zero-shot point tracking by simply prompting them to visually mark points as they move over time. We place a distinctively colored marker at the query point, then regenerate the rest of the video from an intermediate noise level. This propagates the marker across frames, tracing the point's trajectory. To ensure that the marker remains visible in this counterfactual generation, despite such markers being unlikely in natural videos, we use the unedited initial frame as a negative prompt. Through experiments with multiple image-conditioned video diffusion models, we find that these "emergent" tracks outperform those of prior zero-shot methods and persist through occlusions, often obtaining performance that is competitive with specialized self-supervised models.

翻译：追踪器与视频生成器解决的是密切相关的问题：前者分析运动，后者合成运动。本文证明，这种关联性使得预训练的视频扩散模型能够通过简单的视觉提示——随时间推移标记移动点——实现零样本点追踪。我们在查询点处放置一个颜色独特的标记，然后从中间噪声级别重新生成视频的其余部分。这一操作使标记在帧间传播，从而描绘出点的运动轨迹。为确保标记在此反事实生成过程中保持可见（尽管此类标记在自然视频中出现的可能性较低），我们使用未经编辑的初始帧作为负向提示。通过对多个图像条件视频扩散模型（如Stable Video Diffusion）的实验，我们发现这些“涌现”的轨迹优于先前的零样本方法，并能持续穿透遮挡，其性能常可与专用自监督模型相媲美。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日