OpenCarbonEval: A Unified Carbon Emission Estimation Framework in Large-Scale AI Models

In recent years, large-scale auto-regressive models have made significant progress in various tasks, such as text or video generation. However, the environmental impact of these models has been largely overlooked, with a lack of assessment and analysis of their carbon footprint. To address this gap, we introduce OpenCarbonEval, a unified framework for integrating large-scale models across diverse modalities to predict carbon emissions, which could provide AI service providers and users with a means to estimate emissions beforehand and help mitigate the environmental pressure associated with these models. In OpenCarbonEval, we propose a dynamic throughput modeling approach that could capture workload and hardware fluctuations in the training process for more precise emissions estimates. Our evaluation results demonstrate that OpenCarbonEval can more accurately predict training emissions than previous methods, and can be seamlessly applied to different modal tasks. Specifically, we show that OpenCarbonEval achieves superior performance in predicting carbon emissions for both visual models and language models. By promoting sustainable AI development and deployment, OpenCarbonEval can help reduce the environmental impact of large-scale models and contribute to a more environmentally responsible future for the AI community.

翻译：近年来，大规模自回归模型在文本或视频生成等多种任务中取得了显著进展。然而，这些模型的环境影响在很大程度上被忽视，缺乏对其碳足迹的评估与分析。为填补这一空白，我们提出了OpenCarbonEval，一个用于整合跨不同模态的大规模模型以预测碳排放的统一框架，可为AI服务提供商和用户提供事先估算排放量的手段，帮助缓解这些模型带来的环境压力。在OpenCarbonEval中，我们提出了一种动态吞吐量建模方法，能够捕捉训练过程中的工作负载和硬件波动，从而实现更精确的排放估算。我们的评估结果表明，OpenCarbonEval相比以往方法能更准确地预测训练排放，并可无缝应用于不同模态的任务。具体而言，我们证明OpenCarbonEval在预测视觉模型和语言模型的碳排放方面均表现出优越性能。通过促进可持续的AI开发与部署，OpenCarbonEval有助于减少大规模模型的环境影响，为AI社区迈向更环保的未来作出贡献。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日