Evolutionary Computation in the Era of Large Language Model: Survey and Roadmap

from arxiv, evolutionary algorithm (EA), large language model (LLM), optimization problem, prompt engineering, algorithm generation, neural architecture search

Large language models (LLMs) have not only revolutionized natural language processing but also extended their prowess to various domains, marking a significant stride towards artificial general intelligence. The interplay between LLMs and evolutionary algorithms (EAs), despite differing in objectives and methodologies, share a common pursuit of applicability in complex problems. Meanwhile, EA can provide an optimization framework for LLM's further enhancement under black-box settings, empowering LLM with flexible global search capacities. On the other hand, the abundant domain knowledge inherent in LLMs could enable EA to conduct more intelligent searches. Furthermore, the text processing and generative capabilities of LLMs would aid in deploying EAs across a wide range of tasks. Based on these complementary advantages, this paper provides a thorough review and a forward-looking roadmap, categorizing the reciprocal inspiration into two main avenues: LLM-enhanced EA and EA-enhanced LLM. Some integrated synergy methods are further introduced to exemplify the complementarity between LLMs and EAs in diverse scenarios, including code generation, software engineering, neural architecture search, and various generation tasks. As the first comprehensive review focused on the EA research in the era of LLMs, this paper provides a foundational stepping stone for understanding the collaborative potential of LLMs and EAs. The identified challenges and future directions offer guidance for researchers and practitioners to unlock the full potential of this innovative collaboration in propelling advancements in optimization and artificial intelligence. We have created a GitHub repository to index the relevant papers: https://github.com/wuxingyu-ai/LLM4EC.

翻译：大语言模型（LLMs）不仅彻底改变了自然语言处理领域，更将其强大能力扩展至众多其他领域，标志着向通用人工智能迈出了重要一步。尽管大语言模型与进化算法（EAs）在目标和方法上存在差异，但二者在复杂问题适用性方面有着共同的追求。一方面，进化算法能在黑盒设置下为大语言模型的进一步优化提供框架，赋予大语言模型灵活的全局搜索能力。另一方面，大语言模型内蕴的丰富领域知识可使进化算法进行更智能的搜索。此外，大语言模型的文本处理与生成能力将有助于进化算法在广泛任务中的部署。基于这些互补优势，本文进行了全面综述并提出了前瞻性路线图，将二者的相互启发归纳为两大主线：LLM增强的EA与EA增强的LLM。文中进一步介绍了一些综合协同方法，以例证大语言模型与进化算法在代码生成、软件工程、神经架构搜索及各类生成任务等多样化场景中的互补性。作为首篇聚焦大语言模型时代进化计算研究的全面综述，本文为理解大语言模型与进化算法的协作潜力提供了基础性阶梯。文中指出的挑战与未来方向为研究者和实践者提供了指引，以充分释放这一创新协作在推动优化与人工智能进步方面的潜力。我们已创建GitHub仓库以索引相关论文：https://github.com/wuxingyu-ai/LLM4EC。

相关内容

大语言模型

关注 66

大语言模型是基于海量文本数据训练的深度学习模型。它不仅能够生成自然语言文本，还能够深入理解文本含义，处理各种自然语言任务，如文本摘要、问答、翻译等。2023年，大语言模型及其在人工智能领域的应用已成为全球科技研究的热点，其在规模上的增长尤为引人注目，参数量已从最初的十几亿跃升到如今的一万亿。参数量的提升使得模型能够更加精细地捕捉人类语言微妙之处，更加深入地理解人类语言的复杂性。在过去的一年里，大语言模型在吸纳新知识、分解复杂任务以及图文对齐等多方面都有显著提升。随着技术的不断成熟，它将不断拓展其应用范围，为人类提供更加智能化和个性化的服务，进一步改善人们的生活和生产方式。

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日