Problem-dependent attention and effort in neural networks with applications to image resolution and model selection

This paper introduces two new ensemble-based methods to reduce the data and computation costs of image classification. They can be used with any set of classifiers and do not require additional training. In the first approach, data usage is reduced by only analyzing a full-sized image if the model has low confidence in classifying a low-resolution pixelated version. When applied on the best performing classifiers considered here, data usage is reduced by 61.2% on MNIST, 69.6% on KMNIST, 56.3% on FashionMNIST, 84.6% on SVHN, 40.6% on ImageNet, and 27.6% on ImageNet-V2, all with a less than 5% reduction in accuracy. However, for CIFAR-10, the pixelated data are not particularly informative, and the ensemble approach increases data usage while reducing accuracy. In the second approach, compute costs are reduced by only using a complex model if a simpler model has low confidence in its classification. Computation cost is reduced by 82.1% on MNIST, 47.6% on KMNIST, 72.3% on FashionMNIST, 86.9% on SVHN, 89.2% on ImageNet, and 81.5% on ImageNet-V2, all with a less than 5% reduction in accuracy; for CIFAR-10 the corresponding improvements are smaller at 13.5%. When cost is not an object, choosing the projection from the most confident model for each observation increases validation accuracy to 81.0% from 79.3% for ImageNet and to 69.4% from 67.5% for ImageNet-V2.

翻译：本文提出两种新型集成方法，用于降低图像分类的数据与计算成本。这些方法可应用于任意分类器组合，且无需额外训练。第一种方法通过仅在模型对低分辨率像素化版本分类置信度较低时才分析全尺寸图像，从而降低数据使用量。将其应用于本研究中最优分类器时，在MNIST上数据使用量减少61.2%，KMNIST减少69.6%，FashionMNIST减少56.3%，SVHN减少84.6%，ImageNet减少40.6%，ImageNet-V2减少27.6%，且准确率降幅均低于5%。然而，对于CIFAR-10数据集，像素化数据信息量不足，集成方法反而增加数据使用量并降低准确率。第二种方法通过仅在简单模型分类置信度较低时使用复杂模型，从而降低计算成本。计算成本在MNIST上减少82.1%，KMNIST减少47.6%，FashionMNIST减少72.3%，SVHN减少86.9%，ImageNet减少89.2%，ImageNet-V2减少81.5%，准确率降幅均低于5%；而在CIFAR-10上对应改进幅度较小，仅为13.5%。当计算成本无约束时，针对每个观测选择置信度最高模型的投影结果，可将ImageNet验证准确率从79.3%提升至81.0%，ImageNet-V2从67.5%提升至69.4%。

相关内容

ImageNet (数据集)

关注 22

ImageNet项目是一个用于视觉对象识别软件研究的大型可视化数据库。超过1400万的图像URL被ImageNet手动注释，以指示图片中的对象;在至少一百万个图像中，还提供了边界框。ImageNet包含2万多个类别; [2]一个典型的类别，如“气球”或“草莓”，包含数百个图像。第三方图像URL的注释数据库可以直接从ImageNet免费获得;但是，实际的图像不属于ImageNet。自2010年以来，ImageNet项目每年举办一次软件比赛，即ImageNet大规模视觉识别挑战赛（ILSVRC），软件程序竞相正确分类检测物体和场景。 ImageNet挑战使用了一个“修剪”的1000个非重叠类的列表。2012年在解决ImageNet挑战方面取得了巨大的突破，被广泛认为是2010年的深度学习革命的开始。

机器学习损失函数概述，Loss Functions in Machine Learning

专知会员服务

84+阅读 · 2022年3月19日

近期必读的五篇ICLR 2021【图神经网络（GNN）】相关论文和代码

专知会员服务

69+阅读 · 2021年2月25日

近期必读的七篇 NeurIPS 2020【视觉目标检测】相关论文和代码

专知会员服务

31+阅读 · 2020年12月22日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日