CN-Celeb-AV: A Multi-Genre Audio-Visual Dataset for Person Recognition

Audio-visual person recognition (AVPR) has received extensive attention. However, most datasets used for AVPR research so far are collected in constrained environments, and thus cannot reflect the true performance of AVPR systems in real-world scenarios. To meet the request for research on AVPR in unconstrained conditions, this paper presents a multi-genre AVPR dataset collected `in the wild', named CN-Celeb-AV. This dataset contains more than 419k video segments from 1,136 persons from public media. In particular, we put more emphasis on two real-world complexities: (1) data in multiple genres; (2) segments with partial information. A comprehensive study was conducted to compare CN-Celeb-AV with two popular public AVPR benchmark datasets, and the results demonstrated that CN-Celeb-AV is more in line with real-world scenarios and can be regarded as a new benchmark dataset for AVPR research. The dataset also involves a development set that can be used to boost the performance of AVPR systems in real-life situations. The dataset is free for researchers and can be downloaded from http://cnceleb.org/.

翻译：音视频人物识别（AVPR）近年来受到广泛关注。然而，目前用于AVPR研究的大多数数据集均在受控环境中采集，无法真实反映AVPR系统在现实场景中的性能。为满足非约束条件下AVPR研究的迫切需求，本文提出一个在真实场景中采集的多类型AVPR数据集——CN-Celeb-AV。该数据集包含来自1,136名公众人物的超过41.9万个视频片段。我们重点突出两种现实复杂性：（1）多类型数据；（2）包含部分信息的片段。通过与两个主流公开AVPR基准数据集进行综合对比研究，结果表明CN-Celeb-AV更贴近真实场景，可作为AVPR研究的新基准数据集。该数据集还包含一个可用于提升AVPR系统实际性能的开发集。数据集免费向研究人员开放，可从http://cnceleb.org/下载。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日