360VFI: A Dataset and Benchmark for Omnidirectional Video Frame Interpolation

With the development of VR-related techniques, viewers can enjoy a realistic and immersive experience through a head-mounted display, while omnidirectional video with a low frame rate can lead to user dizziness. However, the prevailing plane frame interpolation methodologies are unsuitable for Omnidirectional Video Interpolation, chiefly due to the lack of models tailored to such videos with strong distortion, compounded by the scarcity of valuable datasets for Omnidirectional Video Frame Interpolation. In this paper, we introduce the benchmark dataset, 360VFI, for Omnidirectional Video Frame Interpolation. We present a practical implementation that introduces a distortion prior from omnidirectional video into the network to modulate distortions. We especially propose a pyramid distortion-sensitive feature extractor that uses the unique characteristics of equirectangular projection (ERP) format as prior information. Moreover, we devise a decoder that uses an affine transformation to facilitate the synthesis of intermediate frames further. 360VFI is the first dataset and benchmark that explores the challenge of Omnidirectional Video Frame Interpolation. Through our benchmark analysis, we presented four different distortion conditions scenes in the proposed 360VFI dataset to evaluate the challenge triggered by distortion during interpolation. Besides, experimental results demonstrate that Omnidirectional Video Interpolation can be effectively improved by modeling for omnidirectional distortion.

翻译：随着VR相关技术的发展，观众可通过头戴式显示器获得逼真且沉浸式的体验，然而低帧率的全景视频可能导致用户眩晕。然而，当前主流的平面帧插值方法并不适用于全景视频插值，主要原因在于缺乏针对此类强失真视频的定制化模型，加之可用于全景视频帧插值的优质数据集稀缺。本文提出了用于全景视频帧插值的基准数据集360VFI。我们提出了一种实用实现方案，将全景视频的失真先验引入网络以调制失真。特别地，我们设计了一种金字塔式失真敏感特征提取器，该提取器利用等距柱状投影（ERP）格式的独特特性作为先验信息。此外，我们构建了一种采用仿射变换的解码器，以进一步促进中间帧的合成。360VFI是首个探索全景视频帧插值挑战的数据集与基准。通过基准分析，我们在所提出的360VFI数据集中构建了四种不同失真条件的场景，以评估插值过程中由失真引发的挑战。此外，实验结果表明，通过对全景失真进行建模，可有效提升全景视频插值性能。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日