ChipNeMo: Domain-Adapted LLMs for Chip Design

Mingjie Liu,Teodor-Dumitru Ene,Robert Kirby,Chris Cheng,Nathaniel Pinckney,Rongjian Liang,Jonah Alben,Himyanshu Anand,Sanmitra Banerjee,Ismet Bayraktaroglu,Bonita Bhaskaran,Bryan Catanzaro,Arjun Chaudhuri,Sharon Clay,Bill Dally,Laura Dang,Parikshit Deshpande,Siddhanth Dhodhi,Sameer Halepete,Eric Hill,Jiashang Hu,Sumit Jain,Ankit Jindal,Brucek Khailany,George Kokai,Kishor Kunal,Xiaowei Li,Charley Lind,Hao Liu,Stuart Oberman,Sujeet Omar,Ghasem Pasandi,Sreedhar Pratty,Jonathan Raiman,Ambar Sarkar,Zhengjiang Shao,Hanfei Sun,Pratik P Suthar,Varun Tej,Walker Turner,Kaizhe Xu,Haoxing Ren

from arxiv, Updated results for ChipNeMo-70B model

ChipNeMo aims to explore the applications of large language models (LLMs) for industrial chip design. Instead of directly deploying off-the-shelf commercial or open-source LLMs, we instead adopt the following domain adaptation techniques: domain-adaptive tokenization, domain-adaptive continued pretraining, model alignment with domain-specific instructions, and domain-adapted retrieval models. We evaluate these methods on three selected LLM applications for chip design: an engineering assistant chatbot, EDA script generation, and bug summarization and analysis. Our evaluations demonstrate that domain-adaptive pretraining of language models, can lead to superior performance in domain related downstream tasks compared to their base LLaMA2 counterparts, without degradations in generic capabilities. In particular, our largest model, ChipNeMo-70B, outperforms the highly capable GPT-4 on two of our use cases, namely engineering assistant chatbot and EDA scripts generation, while exhibiting competitive performance on bug summarization and analysis. These results underscore the potential of domain-specific customization for enhancing the effectiveness of large language models in specialized applications.

翻译：ChipNeMo旨在探索大语言模型（LLMs）在工业芯片设计中的应用。我们并未直接部署现有的商业或开源大语言模型，而是采用了以下领域自适应技术：领域自适应分词、领域自适应持续预训练、基于领域特定指令的模型对齐，以及领域自适应检索模型。我们针对芯片设计中的三个选定大语言模型应用场景评估了这些方法：工程辅助聊天机器人、EDA脚本生成，以及缺陷总结与分析。评估结果表明，与基础LLaMA2模型相比，对语言模型进行领域自适应预训练可在领域相关下游任务中取得更优性能，且不损害通用能力。特别地，我们最大的模型ChipNeMo-70B在工程辅助聊天机器人和EDA脚本生成两个用例中超越了能力强大的GPT-4，同时在缺陷总结与分析中展现出具有竞争力的性能。这些结果突显了领域特定定制化在增强大语言模型专业应用有效性方面的潜力。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日