Context-Aware Iteration Policy Network for Efficient Optical Flow Estimation

Existing recurrent optical flow estimation networks are computationally expensive since they use a fixed large number of iterations to update the flow field for each sample. An efficient network should skip iterations when the flow improvement is limited. In this paper, we develop a Context-Aware Iteration Policy Network for efficient optical flow estimation, which determines the optimal number of iterations per sample. The policy network achieves this by learning contextual information to realize whether flow improvement is bottlenecked or minimal. On the one hand, we use iteration embedding and historical hidden cell, which include previous iterations information, to convey how flow has changed from previous iterations. On the other hand, we use the incremental loss to make the policy network implicitly perceive the magnitude of optical flow improvement in the subsequent iteration. Furthermore, the computational complexity in our dynamic network is controllable, allowing us to satisfy various resource preferences with a single trained model. Our policy network can be easily integrated into state-of-the-art optical flow networks. Extensive experiments show that our method maintains performance while reducing FLOPs by about 40%/20% for the Sintel/KITTI datasets.

翻译：现有递归光流估计网络计算开销较大，原因在于其对每个样本采用固定的大迭代次数来更新光流场。高效网络应在光流改进有限时跳过迭代步骤。本文提出一种面向高效光流估计的上下文感知迭代策略网络，可为每个样本确定最优迭代次数。该策略网络通过学习上下文信息来判断光流改进是否达到瓶颈或趋于极小。一方面，我们利用包含先前迭代信息的迭代嵌入与历史隐藏状态单元，来反映光流相对于先前迭代的变化趋势；另一方面，通过引入增量损失函数，使策略网络隐式感知后续迭代中光流改进的幅度。此外，我们设计的动态网络计算复杂度可控，单一训练模型即可满足不同资源偏好。该策略网络可轻松集成至当前最优光流网络中。大量实验表明，我们的方法在保持性能的同时，在Sintel/KITTI数据集上分别降低约40%/20%的FLOPs。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日