JPEG AI Image Compression Visual Artifacts: Detection Methods and Dataset

Learning-based image compression methods have improved in recent years and started to outperform traditional codecs. However, neural-network approaches can unexpectedly introduce visual artifacts in some images. We therefore propose methods to separately detect three types of artifacts (texture and boundary degradation, color change, and text corruption), to localize the affected regions, and to quantify the artifact strength. We consider only those regions that exhibit distortion due solely to the neural compression but that a traditional codec recovers successfully at a comparable bitrate. We employed our methods to collect artifacts for the JPEG AI verification model with respect to HM-18.0, the H.265 reference software. We processed about 350,000 unique images from the Open Images dataset using different compression-quality parameters; the result is a dataset of 46,440 artifacts validated through crowd-sourced subjective assessment. Our proposed dataset and methods are valuable for testing neural-network-based image codecs, identifying bugs in these codecs, and enhancing their performance. We make source code of the methods and the dataset publicly available.

翻译：近年来，基于学习的图像压缩方法不断改进，并开始超越传统编解码器。然而，神经网络方法可能会在某些图像中意外引入视觉伪影。因此，我们提出了分别检测三种伪影类型（纹理与边界退化、颜色变化以及文本损坏）的方法，以定位受影响区域并量化伪影强度。我们仅考虑那些仅因神经压缩而出现失真、但传统编解码器在可比比特率下能成功恢复的区域。我们采用这些方法，针对JPEG AI验证模型相对于H.265参考软件HM-18.0收集了伪影数据。我们使用不同的压缩质量参数处理了来自Open Images数据集的约35万张独特图像；结果得到了一个包含46,440个经过众包主观评估验证的伪影数据集。我们提出的数据集和方法对于测试基于神经网络的图像编解码器、识别这些编解码器中的错误以及提升其性能具有重要价值。我们已公开提供方法的源代码及数据集。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日