In-Context Operator Learning with Data Prompts for Differential Equation Problems

from arxiv, The second and third authors contributed equally. This is an outdated preprint. Please refer to the updated version published in PNAS: www.pnas.org/doi/10.1073/pnas.2310142120 See code in https://github.com/LiuYangMage/in-context-operator-networks

This paper introduces a new neural-network-based approach, namely In-Context Operator Networks (ICON), to simultaneously learn operators from the prompted data and apply it to new questions during the inference stage, without any weight update. Existing methods are limited to using a neural network to approximate a specific equation solution or a specific operator, requiring retraining when switching to a new problem with different equations. By training a single neural network as an operator learner, we can not only get rid of retraining (even fine-tuning) the neural network for new problems, but also leverage the commonalities shared across operators so that only a few demos in the prompt are needed when learning a new operator. Our numerical results show the neural network's capability as a few-shot operator learner for a diversified type of differential equation problems, including forward and inverse problems of ordinary differential equations (ODEs), partial differential equations (PDEs), and mean-field control (MFC) problems, and also show that it can generalize its learning capability to operators beyond the training distribution.

翻译：本文提出了一种新的基于神经网络的方法，即上下文算子网络（ICON），该方法能够同时从提示数据中学习算子，并在推理阶段将其应用于新问题，而无需任何权重更新。现有方法仅限于使用神经网络逼近特定方程的解或特定算子，当切换到不同方程的新问题时需要重新训练。通过训练单个神经网络作为算子学习器，我们不仅能够避免针对新问题重新训练（甚至微调）神经网络，还能利用算子间的共性，从而在学习新算子时只需少量提示示例。我们的数值结果表明，该神经网络作为小样本算子学习器，能够处理多种类型的微分方程问题，包括常微分方程（ODEs）、偏微分方程（PDEs）的正向与逆向问题以及平均场控制（MFC）问题，并且能够将其学习能力泛化到训练分布之外的算子。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日