QKAN-LSTM: Quantum-inspired Kolmogorov-Arnold Long Short-term Memory

Long short-term memory (LSTM) models are a particular type of recurrent neural networks (RNNs) that are central to sequential modeling tasks in domains such as urban telecommunication forecasting, where temporal correlations and nonlinear dependencies dominate. However, conventional LSTMs suffer from high parameter redundancy and limited nonlinear expressivity. In this work, we propose the Quantum-inspired Kolmogorov-Arnold Long Short-Term Memory (QKAN-LSTM), which integrates Data Re-Uploading Activation (DARUAN) modules into the gating structure of LSTMs. Each DARUAN acts as a quantum variational activation function (QVAF), enhancing frequency adaptability and enabling an exponentially enriched spectral representation without multi-qubit entanglement. The resulting architecture preserves quantum-level expressivity while remaining fully executable on classical hardware. Empirical evaluations on three datasets, Damped Simple Harmonic Motion, Bessel Function, and Urban Telecommunication, demonstrate that QKAN-LSTM achieves superior predictive accuracy and generalization with a 79% reduction in trainable parameters compared to classical LSTMs. We extend the framework to the Jiang-Huang-Chen-Goan Network (JHCG Net), which generalizes KAN to encoder-decoder structures, and then further use QKAN to realize the latent KAN, thereby creating a Hybrid QKAN (HQKAN) for hierarchical representation learning. The proposed HQKAN-LSTM thus provides a scalable and interpretable pathway toward quantum-inspired sequential modeling in real-world data environments.

翻译：长短期记忆（LSTM）模型是一种特殊的循环神经网络（RNN），在城市电信预测等时序建模任务中至关重要，其中时间相关性和非线性依赖关系占主导地位。然而，传统LSTM存在参数冗余度高和非线性表达能力有限的问题。本文提出量子启发的Kolmogorov-Arnold长短期记忆网络（QKAN-LSTM），该模型将数据重上传激活（DARUAN）模块集成到LSTM的门控结构中。每个DARUAN模块作为量子变分激活函数（QVAF），增强了频率适应性，并实现了指数级丰富的频谱表示，而无需多量子比特纠缠。所得架构保持了量子级别的表达能力，同时完全可在经典硬件上执行。在阻尼简谐运动、贝塞尔函数和城市电信三个数据集上的实证评估表明，相较于经典LSTM，QKAN-LSTM以可训练参数减少79%的代价，实现了更优的预测精度和泛化能力。我们将该框架扩展至Jiang-Huang-Chen-Goan网络（JHCG Net），该网络将KAN推广至编码器-解码器结构，并进一步利用QKAN实现潜在KAN，从而构建用于分层表示学习的混合QKAN（HQKAN）。所提出的HQKAN-LSTM因此为现实数据环境中的量子启发时序建模提供了一条可扩展且可解释的路径。

相关内容

长短期记忆网络

关注 120

长短期记忆网络(LSTM)是一种用于深度学习领域的人工回归神经网络(RNN)结构。与标准的前馈神经网络不同，LSTM具有反馈连接。它不仅可以处理单个数据点(如图像)，还可以处理整个数据序列(如语音或视频)。例如，LSTM适用于未分段、连接的手写识别、语音识别、网络流量或IDSs(入侵检测系统)中的异常检测等任务。

管理 LLM 智能体中的演进式记忆：风险、机理及稳定性与安全性受控记忆（SSGM）框架

专知会员服务

16+阅读 · 3月14日

【AAAI2026】NeSTR：一种用于大型语言模型的神经-符号可溯因框架，用于时间推理

专知会员服务

17+阅读 · 2025年12月10日

【NeurIPS2025】VideoLucy：用于长视频理解的深度记忆回溯机制

专知会员服务

9+阅读 · 2025年10月15日