The 2023 Video Similarity Dataset and Challenge

Ed Pizzi,Giorgos Kordopatis-Zilos,Hiral Patel,Gheorghe Postelnicu,Sugosh Nagavara Ravindra,Akshay Gupta,Symeon Papadopoulos,Giorgos Tolias,Matthijs Douze

This work introduces a dataset, benchmark, and challenge for the problem of video copy detection and localization. The problem comprises two distinct but related tasks: determining whether a query video shares content with a reference video ("detection"), and additionally temporally localizing the shared content within each video ("localization"). The benchmark is designed to evaluate methods on these two tasks, and simulates a realistic needle-in-haystack setting, where the majority of both query and reference videos are "distractors" containing no copied content. We propose a metric that reflects both detection and localization accuracy. The associated challenge consists of two corresponding tracks, each with restrictions that reflect real-world settings. We provide implementation code for evaluation and baselines. We also analyze the results and methods of the top submissions to the challenge. The dataset, baseline methods and evaluation code is publicly available and will be discussed at a dedicated CVPR'23 workshop.

翻译：本工作提出了一个针对视频复制检测与定位问题的数据集、基准测试及挑战赛。该问题包含两个相关但不同的任务：判断查询视频是否与参考视频存在内容复用（“检测”），以及进一步在每段视频中定位共享内容的起止时间（“定位”）。基准测试旨在评估方法在这两个任务上的表现，并模拟了现实的“大海捞针”场景——大多数查询视频和参考视频均为不含复制内容的“干扰项”。我们提出了一种能够同时反映检测与定位精度的评估指标。相关挑战赛分为两个赛道，每个赛道均设有反映现实场景的限制条件。我们提供评估与基线方法的实现代码，并对顶尖参赛方案的结果与创新方法进行了分析。该数据集、基线方法及评估代码已公开，并将在CVPR'23专题研讨会上进行讨论。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日