Inductive-bias Learning: Generating Code Models with Large Language Model

Large Language Models(LLMs) have been attracting attention due to a ability called in-context learning(ICL). ICL, without updating the parameters of a LLM, it is possible to achieve highly accurate inference based on rules ``in the context'' by merely inputting a training data into the prompt. Although ICL is a developing field with many unanswered questions, LLMs themselves serves as a inference model, seemingly realizing inference without explicitly indicate ``inductive bias''. On the other hand, a code generation is also a highlighted application of LLMs. The accuracy of code generation has dramatically improved, enabling even non-engineers to generate code to perform the desired tasks by crafting appropriate prompts. In this paper, we propose a novel ``learning'' method called an ``Inductive-Bias Learning (IBL)'', which combines the techniques of ICL and code generation. An idea of IBL is straightforward. Like ICL, IBL inputs a training data into the prompt and outputs a code with a necessary structure for inference (we referred to as ``Code Model'') from a ``contextual understanding''. Despite being a seemingly simple approach, IBL encompasses both a ``property of inference without explicit inductive bias'' inherent in ICL and a ``readability and explainability'' of the code generation. Surprisingly, generated Code Models have been found to achieve predictive accuracy comparable to, and in some cases surpassing, ICL and representative machine learning models. Our IBL code is open source: https://github.com/fuyu-quant/IBLM

翻译：大型语言模型（LLMs）因其被称为上下文学习（ICL）的能力而备受关注。通过ICL，无需更新LLM的参数，仅将训练数据输入提示中，即可基于“上下文”中的规则实现高精度推理。尽管ICL是一个充满未解之谜的新兴领域，但LLM本身充当推理模型，似乎在无需明确指定“归纳偏置”的情况下实现了推理。另一方面，代码生成也是LLM的一个突出应用。代码生成的准确性已显著提升，即使是非工程师也能通过精心设计提示，生成执行所需任务的代码。本文提出了一种新颖的“学习”方法，称为“归纳偏置学习（IBL）”，它融合了ICL和代码生成的技术。IBL的思想直截了当：如同ICL一样，IBL将训练数据输入提示中，并通过“上下文理解”输出具有推理所需结构的代码（我们称之为“代码模型”）。尽管这种方法看似简单，但IBL同时包含了ICL固有的“无需显式归纳偏置的推理特性”和代码生成的“可读性与可解释性”。令人惊讶的是，生成的代码模型在预测准确性上已达到与ICL及代表性机器学习模型相当甚至更高的水平。我们的IBL代码已开源：https://github.com/fuyu-quant/IBLM

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日