具有人工动力学的自组织状态空间模型 (Self-Organized State-Space Models with Artificial Dynamics)

We consider a state-space model (SSM) parametrized by some parameter $\theta$, and our aim is to perform joint parameter and state inference. A simple idea to carry out this task, which almost dates back to the origin of the Kalman filter, is to replace the static parameter $\theta$ by a Markov chain $(\theta_t)_{t\geq 0}$ and then to apply a filtering algorithm to the extended, or self-organized SSM (SO-SSM). However, the practical implementation of this idea in a theoretically justified way has remained an open problem. In this paper we fill this gap by introducing various possible constructions of $(\theta_t)_{t\geq 0}$ that ensure the validity of the SO-SSM for joint parameter and state inference. Notably, we show that such SO-SSMs can be defined even if $\|\mathrm{Var}(\theta_{t}|\theta_{t-1})\|\rightarrow 0$ slowly as $t\rightarrow\infty$. This result is important since, as illustrated in our numerical experiments, these models can be efficiently approximated using particle filter algorithms. While SO-SSMs have been introduced for online inference, the development of iterated filtering (IF) algorithms has shown that they can also serve for computing the maximum likelihood estimator of a given SSM. In this work, we also derive constructions of $(\theta_t)_{t\geq 0}$ and theoretical guarantees tailored to these specific applications of SO-SSMs and, as a result, introduce new IF algorithms. From a practical point of view, the algorithms we develop have the merit of being simple to implement and only requiring minimal tuning to perform well.

翻译：我们考虑一个由参数$\theta$参数化的状态空间模型(SSM)，目标是进行参数与状态的联合推断。执行该任务的一个简单思路——几乎可追溯至卡尔曼滤波的起源——是将静态参数$\theta$替换为马尔可夫链$(\theta_t)_{t\geq 0}$，然后对扩展的或自组织的SSM(SO-SSM)应用滤波算法。然而，以理论合理的方式实现这一思路在实践中始终是未解决的难题。本文通过引入多种确保SO-SSM适用于联合参数与状态推断的$(\theta_t)_{t\geq 0}$构造方案填补了这一空白。值得注意的是，我们证明即使当$\|\mathrm{Var}(\theta_{t}|\theta_{t-1})\|\rightarrow 0$随$t\rightarrow\infty$缓慢趋近于零时，此类SO-SSM仍然可以定义。这一结果具有重要意义，因为如数值实验所示，这些模型可通过粒子滤波算法进行高效近似。虽然SO-SSM最初是为在线推断而提出，但迭代滤波(IF)算法的发展表明它们同样可用于计算给定SSM的最大似然估计量。本工作中，我们还针对SO-SSM的这些特定应用场景推导了$(\theta_t)_{t\geq 0}$的构造方案与理论保证，并由此提出了新的IF算法。从实践角度看，所开发的算法具有实现简单、仅需最小化调参即可获得良好性能的优点。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日