Commonly used datasets for evaluating video codecs are all very high quality and not representative of video typically used in video conferencing scenarios. We present the Video Conferencing Dataset (VCD) for evaluating video codecs for real-time communication, the first such dataset focused on video conferencing. VCD includes a wide variety of camera qualities and spatial and temporal information. It includes both desktop and mobile scenarios and two types of video background processing. We report the compression efficiency of H.264, H.265, H.266, and AV1 in low-delay settings on VCD and compare it with the non-video conferencing datasets UVC, MLC-JVC, and HEVC. The results show the source quality and the scenarios have a significant effect on the compression efficiency of all the codecs. VCD enables the evaluation and tuning of codecs for this important scenario. The VCD is publicly available as an open-source dataset at https://github.com/microsoft/VCD.
翻译:常用于评估视频编解码器的数据集均为高质量视频,无法代表视频会议场景中的典型视频。我们提出了视频会议数据集(VCD),用于评估实时通信中的视频编解码器,这是首个聚焦视频会议的数据集。VCD包含多种摄像头质量及丰富的空间与时间信息,涵盖桌面端和移动端场景,以及两类视频背景处理方式。我们报告了H.264、H.265、H.266和AV1在低延迟设置下对VCD的压缩效率,并将其与非视频会议数据集UVC、MLC-JVC及HEVC进行对比。结果表明,源质量与场景对所有编解码器的压缩效率具有显著影响。VCD能够针对这一重要场景评估和调优编解码器。该数据集作为开源资源发布于https://github.com/microsoft/VCD。