Input Convex LSTM: A Convex Approach for Fast Lyapunov-Based Model Predictive Control

Leveraging Input Convex Neural Networks (ICNNs), ICNN-based Model Predictive Control (MPC) successfully attains globally optimal solutions by upholding convexity within the MPC framework. However, current ICNN architectures encounter the issue of vanishing/exploding gradients, which limits their ability to serve as deep neural networks for complex tasks. Additionally, the current neural network-based MPC, including conventional neural network-based MPC and ICNN-based MPC, faces slower convergence speed when compared to MPC based on first-principles models. In this study, we leverage the principles of ICNNs to propose a novel Input Convex LSTM for Lyapunov-based MPC, with the specific goal of reducing convergence time and mitigating the vanishing/exploding gradient problem while ensuring closed-loop stability. From a simulation study of a nonlinear chemical reactor, we observed a mitigation of vanishing/exploding gradient problem and a reduction in convergence time, with a percentage decrease of 46.7%, 31.3%, and 20.2% compared to baseline plain RNN, plain LSTM, and Input Convex Recurrent Neural Network, respectively.

翻译：利用输入凸神经网络（ICNN）的ICNN-based模型预测控制（MPC）通过保持MPC框架内的凸性，成功获得了全局最优解。然而，现有ICNN架构存在梯度消失/爆炸问题，限制了其作为深度神经网络处理复杂任务的能力。此外，当前基于神经网络的MPC（包括传统神经网络MPC与ICNN-based MPC）相比基于第一性原理模型的MPC，收敛速度较慢。本研究借鉴ICNN原理提出了一种面向李雅普诺夫MPC的新型输入凸LSTM，旨在减少收敛时间、缓解梯度消失/爆炸问题，同时确保闭环稳定性。通过对非线性化学反应器的仿真研究，我们发现梯度消失/爆炸问题得到缓解，收敛时间相比基线普通RNN、普通LSTM和输入凸递归神经网络分别减少了46.7%、31.3%和20.2%。

相关内容

长短期记忆网络

关注 120

长短期记忆网络(LSTM)是一种用于深度学习领域的人工回归神经网络(RNN)结构。与标准的前馈神经网络不同，LSTM具有反馈连接。它不仅可以处理单个数据点(如图像)，还可以处理整个数据序列(如语音或视频)。例如，LSTM适用于未分段、连接的手写识别、语音识别、网络流量或IDSs(入侵检测系统)中的异常检测等任务。

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日