Equity through Access: A Case for Small-scale Deep Learning

The recent advances in deep learning (DL) have been accelerated by access to large-scale data and compute. These large-scale resources have been used to train progressively larger models which are resource intensive in terms of compute, data, energy, and carbon emissions. These costs are becoming a new type of entry barrier to researchers and practitioners with limited access to resources at such scale, particularly in the Global South. In this work, we take a comprehensive look at the landscape of existing DL models for vision tasks and demonstrate their usefulness in settings where resources are limited. To account for the resource consumption of DL models, we introduce a novel measure to estimate the performance per resource unit, which we call the PePR score. Using a diverse family of 131 unique DL architectures (spanning 1M to 130M trainable parameters) and three medical image datasets, we capture trends about the performance-resource trade-offs. In applications like medical image analysis, we argue that small-scale, specialized models are better than striving for large-scale models. Furthermore, we show that using pretrained models can significantly reduce the computational resources and data required. We hope this work will encourage the community to focus on improving AI equity by developing methods and models with smaller resource footprints.

翻译：深度学习（DL）的最新进展得益于大规模数据和计算资源的获取。这些大规模资源被用于训练日益庞大的模型，这类模型在计算、数据、能源和碳排放方面消耗巨大。这些成本正成为获取有限规模资源的研究者和从业者（尤其是在全球南方地区）面临的新型准入门槛。本文系统审视了现有面向视觉任务的深度学习模型格局，并论证了它们在资源受限环境中的实用价值。为衡量深度学习模型的资源消耗，我们提出了一种新型度量指标——每资源单元性能评分（PePR分数），用以估算单位资源对应的性能产出。通过131种独特深度学习架构（参数量跨度从1M到130M）与三个医学影像数据集的实验，我们捕捉到性能与资源之间的权衡趋势。在医学图像分析等应用中，我们认为小规模专业化模型优于追求大规模模型。此外，研究表明使用预训练模型可显著降低所需计算资源与数据量。希望这项工作能激励学界通过开发具有更小资源足迹的方法与模型，聚焦提升人工智能公平性。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/