Large Language Models (LLMs) have emerged as powerful tools capable of accomplishing a broad spectrum of tasks. Their abilities span numerous areas, and one area where they have made a significant impact is in the domain of code generation. In this context, we view LLMs as mutation and crossover tools. Meanwhile, Quality-Diversity (QD) algorithms are known to discover diverse and robust solutions. By merging the code-generating abilities of LLMs with the diversity and robustness of QD solutions, we introduce LLMatic, a Neural Architecture Search (NAS) algorithm. While LLMs struggle to conduct NAS directly through prompts, LLMatic uses a procedural approach, leveraging QD for prompts and network architecture to create diverse and highly performant networks. We test LLMatic on the CIFAR-10 image classification benchmark, demonstrating that it can produce competitive networks with just $2,000$ searches, even without prior knowledge of the benchmark domain or exposure to any previous top-performing models for the benchmark.
翻译:大语言模型(LLMs)已成为能够完成广泛任务的强大工具。其能力覆盖众多领域,在代码生成方面尤为突出。在此背景下,我们将LLMs视为变异与交叉操作工具。与此同时,质量多样性(QD)算法以发现多样化且鲁棒的解决方案而著称。通过结合LLMs的代码生成能力与QD解决方案的多样性和鲁棒性,我们提出了LLMatic——一种神经架构搜索(NAS)算法。尽管LLMs难以直接通过提示进行NAS,但LLMatic采用程序化方法,利用QD优化提示与网络架构,从而生成多样化且高性能的网络。我们在CIFAR-10图像分类基准测试上验证了LLMatic,结果表明,即便在无先验基准域知识、也未接触该基准任何先前最优模型的情况下,该算法仅需2000次搜索即可生成具有竞争力的网络。