Can GPT-4 Perform Neural Architecture Search?

We investigate the potential of GPT-4~\cite{gpt4} to perform Neural Architecture Search (NAS) -- the task of designing effective neural architectures. Our proposed approach, \textbf{G}PT-4 \textbf{I}nformed \textbf{N}eural \textbf{A}rchitecture \textbf{S}earch (GINAS),leverages the generative capabilities of GPT-4 as a black-box optimiser to quickly navigate the architecture search space, pinpoint promising candidates, and iteratively refine these candidates to improve performance.We assess GINAS across several benchmarks, comparing it with existing state-of-the-art NAS techniques to illustrate its effectiveness. Rather than targeting state-of-the-art performance, our objective is to highlight GPT-4's potential to assist research on a challenging technical problem through a simple prompting scheme that requires relatively limited domain expertise. More broadly, we believe our preliminary results point to future research that harnesses general purpose language models for diverse optimisation tasks. We also highlight important limitations to our study, and note implications for AI safety.

翻译：我们研究了GPT-4~\cite{gpt4}执行神经架构搜索（NAS）——即设计有效神经架构任务的潜力。我们提出的方法——**G**PT-4 **I**nformed **N**eural **A**rchitecture **S**earch（GINAS）——利用GPT-4的生成能力作为黑箱优化器，快速导航架构搜索空间，定位有潜力的候选架构，并通过迭代优化这些候选架构以提升性能。我们在多个基准测试上评估了GINAS，并与现有最先进的NAS技术进行了对比，以展示其有效性。我们的目标并非追求最先进的性能，而是通过一种仅需相对有限领域知识的简单提示方案，凸显GPT-4在辅助解决技术难题方面的潜力。更广泛地，我们认为初步结果表明，未来研究可借助通用语言模型处理多样化优化任务。同时，我们指出了本研究的重要局限性，并探讨了对人工智能安全的影响。

相关内容

GPT-4

关注 29

北京时间2023年3月15日凌晨，ChatGPT开发商OpenAI 发布了发布了全新的多模态预训练大模型 GPT-4，可以更可靠、更具创造力、能处理更细节的指令，根据图片和文字提示都能生成相应内容。具体来说来说，GPT-4 相比上一代的模型，实现了飞跃式提升：支持图像和文本输入，拥有强大的识图能力；大幅提升了文字输入限制，在ChatGPT模式下，GPT-4可以处理超过2.5万字的文本，可以处理一些更加细节的指令；回答准确性也得到了显著提高。

评估ChatGPT的信息提取能力:对性能、可解释性、校准和忠实度的评估

专知会员服务

77+阅读 · 2023年4月26日

生成式推荐: 迈向下一代推荐系统新范式

专知会员服务

49+阅读 · 2023年4月15日

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

《校准自主性中的信任》2022最新16页slides

专知会员服务

21+阅读 · 2022年12月7日