Input Convex LSTM: A Convex Approach for Fast Lyapunov-Based Model Predictive Control

Leveraging Input Convex Neural Networks (ICNNs), ICNN-based Model Predictive Control (MPC) successfully attains globally optimal solutions by upholding convexity within the MPC framework. However, current ICNN architectures encounter the issue of vanishing gradients, which limits their ability to serve as deep neural networks for complex tasks. Additionally, the current neural network-based MPC, including conventional neural network-based MPC and ICNN-based MPC, faces slower convergence speed when compared to MPC based on first-principles models. In this study, we leverage the principles of ICNNs to propose a novel Input Convex LSTM for Lyapunov-based MPC, with the specific goal of reducing convergence time and mitigating the vanishing gradient problem while ensuring closed-loop stability. From a simulation study of a nonlinear chemical reactor, we observed a mitigation of vanishing gradient problem and a reduction in convergence time, with a percentage decrease of 46.7%, 31.3%, and 20.2% compared to baseline plain RNN, plain LSTM, and Input Convex Recurrent Neural Network, respectively.

翻译：利用输入凸神经网络（ICNNs），基于ICNN的模型预测控制（MPC）通过在MPC框架内保持凸性，成功实现了全局最优解。然而，当前ICNN架构面临梯度消失问题，这限制了其作为深度神经网络处理复杂任务的能力。此外，与基于第一性原理模型的MPC相比，当前的神经网络MPC（包括传统神经网络MPC和ICNN-based MPC）收敛速度较慢。本研究利用ICNN原理，提出了一种新型输入凸LSTM用于李雅普诺夫MPC，旨在减少收敛时间并缓解梯度消失问题，同时确保闭环稳定性。通过对一个非线性化学反应器的仿真研究，我们观察到梯度消失问题得到缓解，收敛时间减少，与基线普通RNN、普通LSTM和输入凸循环神经网络相比，分别减少了46.7%、31.3%和20.2%。

相关内容

长短期记忆网络

关注 120

长短期记忆网络(LSTM)是一种用于深度学习领域的人工回归神经网络(RNN)结构。与标准的前馈神经网络不同，LSTM具有反馈连接。它不仅可以处理单个数据点(如图像)，还可以处理整个数据序列(如语音或视频)。例如，LSTM适用于未分段、连接的手写识别、语音识别、网络流量或IDSs(入侵检测系统)中的异常检测等任务。

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

14+阅读 · 2022年3月12日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日