Commonly used datasets for evaluating video codecs are all very high quality and not representative of video typically used in video conferencing scenarios. We present the Video Conferencing Dataset (VCD) for evaluating video codecs for real-time communication, the first such dataset focused on video conferencing. VCD includes a wide variety of camera qualities and spatial and temporal information. It includes both desktop and mobile scenarios and two types of video background processing. We report the compression efficiency of H.264, H.265, H.266, and AV1 in low-delay settings on VCD and compare it with the non-video conferencing datasets UVC, MLC-JVC, and HEVC. The results show the source quality and the scenarios have a significant effect on the compression efficiency of all the codecs. VCD enables the evaluation and tuning of codecs for this important scenario. The VCD is publicly available as an open-source dataset at https://github.com/microsoft/VCD.
翻译:常用评估视频编解码器的数据集均为高质量视频,无法代表视频会议场景中的典型视频内容。本文提出视频会议数据集(VCD)用于评估实时通信场景下的视频编解码器,这是首个聚焦视频会议的数据集。VCD包含多种摄像机质量、空间及时间信息,涵盖桌面端与移动端场景,以及两种视频背景处理方式。我们报告了H.264、H.265、H.266及AV1在低延迟设置下对VCD的压缩效率,并将其与非视频会议数据集UVC、MLC-JVC和HEVC进行对比。结果表明,所有编解码器的压缩效率均受源质量和场景显著影响。VCD为这一重要场景下的编解码器评估与调优提供了支持。该数据集已作为开源资源发布于https://github.com/microsoft/VCD。