Adversarial attacks and defenses have gained increasing interest on computer vision systems in recent years, but as of today, most investigations are limited to images. However, many artificial intelligence models actually handle documentary data, which is very different from real world images. Hence, in this work, we try to apply the adversarial attack philosophy on documentary and natural data and to protect models against such attacks. We focus our work on untargeted gradient-based, transfer-based and score-based attacks and evaluate the impact of adversarial training, JPEG input compression and grey-scale input transformation on the robustness of ResNet50 and EfficientNetB0 model architectures. To the best of our knowledge, no such work has been conducted by the community in order to study the impact of these attacks on the document image classification task.
翻译:近年来,对抗攻击与防御在计算机视觉系统中日益受到关注,但迄今为止,大多数研究仅限于自然图像。然而,许多人工智能模型实际处理的是文档数据,这与真实世界图像存在显著差异。因此,本工作尝试将对抗攻击方法应用于文档与自然数据,并保护模型免受此类攻击。我们聚焦于无目标梯度攻击、迁移攻击和分数攻击,并评估对抗训练、JPEG输入压缩及灰度输入变换对ResNet50与EfficientNetB0模型架构鲁棒性的影响。据我们所知,目前尚未有研究系统探讨这些攻击对文档图像分类任务的影响。