GreenMachine: Automatic Design of Zero-Cost Proxies for Energy-Efficient NAS

Artificial Intelligence (AI) has driven innovations and created new opportunities across various sectors. However, leveraging domain-specific knowledge often requires automated tools to design and configure models effectively. In the case of Deep Neural Networks (DNNs), researchers and practitioners usually resort to Neural Architecture Search (NAS) approaches, which are resource- and time-intensive, requiring the training and evaluation of numerous candidate architectures. This raises sustainability concerns, particularly due to the high energy demands involved, creating a paradox: the pursuit of the most effective model can undermine sustainability goals. To mitigate this issue, zero-cost proxies have emerged as a promising alternative. These proxies estimate a model's performance without the need for full training, offering a more efficient approach. This paper addresses the challenges of model evaluation by automatically designing zero-cost proxies to assess DNNs efficiently. Our method begins with a randomly generated set of zero-cost proxies, which are evolved and tested using the NATS-Bench benchmark. We assess the proxies' effectiveness using both randomly sampled and stratified subsets of the search space, ensuring they can differentiate between low- and high-performing networks and enhance generalizability. Results show our method outperforms existing approaches on the stratified sampling strategy, achieving strong correlations with ground truth performance, including a Kendall correlation of 0.89 on CIFAR-10 and 0.77 on CIFAR-100 with NATS-Bench-SSS and a Kendall correlation of 0.78 on CIFAR-10 and 0.71 on CIFAR-100 with NATS-Bench-TSS.

翻译：人工智能（AI）推动了各领域的创新并创造了新的机遇。然而，有效利用领域特定知识通常需要自动化工具来设计和配置模型。对于深度神经网络（DNN），研究者和实践者通常采用神经架构搜索（NAS）方法，但这些方法资源消耗大、耗时长，需要训练和评估大量候选架构。这引发了可持续性担忧，特别是所涉及的高能耗需求造成了一个悖论：追求最高效的模型可能破坏可持续性目标。为缓解此问题，零成本代理已成为一种有前景的替代方案。这些代理无需完整训练即可评估模型性能，提供了一种更高效的途径。本文通过自动设计零成本代理来高效评估DNN，以应对模型评估的挑战。我们的方法从随机生成的一组零成本代理开始，利用NATS-Bench基准进行演化与测试。我们通过随机采样和搜索空间分层抽样两种方式评估代理的有效性，确保其能区分低性能与高性能网络并提升泛化能力。实验结果表明，在分层抽样策略上，我们的方法优于现有方案，与真实性能指标呈现强相关性：在NATS-Bench-SSS上获得CIFAR-10的肯德尔相关系数0.89和CIFAR-100的0.77；在NATS-Bench-TSS上获得CIFAR-10的0.78和CIFAR-100的0.71。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日