USTC-TD: A Test Dataset and Benchmark for Image and Video Coding in 2020s

Image/video coding has been a remarkable research area for both academia and industry for many years. Testing datasets, especially high-quality image/video datasets are desirable for the justified evaluation of coding-related research, practical applications, and standardization activities. We put forward a test dataset namely USTC-TD, which has been successfully adopted in the practical end-to-end image/video coding challenge of the IEEE International Conference on Visual Communications and lmage Processing (VCIP) in 2022 and 2023. USTC-TD contains 40 images at 4K spatial resolution and 10 video sequences at 1080p spatial resolution, featuring various content due to the diverse environmental factors (e.g. scene type, texture, motion, view) and the designed imaging factors (e.g. illumination, lens, shadow). We quantitatively evaluate USTC-TD on different image/video features (spatial, temporal, color, lightness), and compare it with the previous image/video test datasets, which verifies the wider coverage and more diversity of the proposed dataset. We also evaluate both classic standardized and recent learned image/video coding schemes on USTC-TD with PSNR and MS-SSIM, and provide an extensive benchmark for the evaluated schemes. Based on the characteristics and specific design of the proposed test dataset, we analyze the benchmark performance and shed light on the future research and development of image/video coding. All the data are released online: https://esakak.github.io/USTC-TD .

翻译：图像/视频编码多年来一直是学术界与工业界的重要研究领域。测试数据集，尤其是高质量的图像/视频数据集，对于编码相关研究、实际应用及标准化活动的合理评估至关重要。本文提出了一个名为USTC-TD的测试数据集，该数据集已成功应用于2022年与2023年IEEE视觉通信与图像处理国际会议（VCIP）的实际端到端图像/视频编码挑战赛中。USTC-TD包含40幅4K空间分辨率的图像与10段1080p空间分辨率的视频序列，其内容因多样的环境因素（如场景类型、纹理、运动、视角）与设计的成像因素（如光照、镜头、阴影）而具有丰富变化。我们从不同图像/视频特征（空间、时间、色彩、亮度）对USTC-TD进行了定量评估，并与先前的图像/视频测试数据集进行了比较，验证了本数据集更广的覆盖范围与更高的多样性。我们还基于PSNR与MS-SSIM指标，在USTC-TD上评估了经典标准化编码方案与近年基于学习的图像/视频编码方案，并为所评估方案提供了全面的基准测试结果。基于所提出测试数据集的特点与具体设计，我们分析了基准性能，并对图像/视频编码的未来研究与发展方向进行了展望。所有数据均已在线发布：https://esakak.github.io/USTC-TD。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日