In-context learning for model-free system identification

In traditional system identification, we estimate a model of an unknown dynamical system based on given input/output sequences and available physical knowledge. Yet, is it also possible to understand the intricacies of dynamical systems not solely from their input/output patterns, but by observing the behavior of other systems within the same class? This central question drives the study presented in this paper. In response to this query, we introduce a novel paradigm for system identification, addressing two primary tasks: one-step-ahead prediction and multi-step simulation. Unlike conventional methods, we do not directly estimate a model for the specific system. Instead, we pretrain a meta model that represents a class of dynamical systems. This meta model is trained from a potentially infinite stream of synthetic data, generated by systems randomly extracted from a certain distribution. At its core, the meta model serves as an implicit representation of the main characteristics of a class of dynamical systems. When provided with a brief context from a new system - specifically, a short input/output sequence - the meta model implicitly discerns its dynamics, enabling predictions of its behavior. The proposed approach harnesses the power of Transformer architectures, renowned for their in-context learning capabilities in Natural Language Processing tasks. For one-step prediction, a GPT-like decoder-only architecture is utilized, whereas the simulation problem employs an encoder-decoder structure. Initial experimental results affirmatively answer our foundational question, opening doors to fresh research avenues in system identification.

翻译：在传统系统辨识中，我们基于给定的输入/输出序列及已知物理知识来估计未知动力学系统的模型。然而，是否可能不仅通过输入/输出模式，还能通过观察同一类中其他系统的行为来理解复杂动力学系统？这一核心问题驱动了本文的研究。针对此问题，我们提出了一种新的系统辨识范式，主要解决两个任务：单步预测和多步仿真。与传统方法不同，我们并不直接估计特定系统的模型，而是预训练一个能代表某类动力学系统的元模型。该元模型通过可能无限长的合成数据流进行训练，这些数据由从特定分布中随机抽取的系统生成。本质上，元模型隐式表征了一类动力学系统的主要特征。当提供来自新系统的简短上下文（即短输入/输出序列）时，元模型能隐式辨识其动力学特性，从而对其行为进行预测。所提方法利用了Transformer架构的优势——该架构以其在自然语言处理任务中的上下文学习能力而著称。对于单步预测，我们采用类似GPT的解码器专用架构；而仿真问题则使用编码-解码结构。初步实验结果肯定地回答了我们的基本问题，为系统辨识领域开辟了新的研究方向。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日