In this paper, we propose the Continuous Time Fractional Topic Model (cFTM), a new method for dynamic topic modeling. This approach incorporates fractional Brownian motion~(fBm) to effectively identify positive or negative correlations in topic and word distribution over time, revealing long-term dependency or roughness. Our theoretical analysis shows that the cFTM can capture these long-term dependency or roughness in both topic and word distributions, mirroring the main characteristics of fBm. Moreover, we prove that the parameter estimation process for the cFTM is on par with that of LDA, traditional topic models. To demonstrate the cFTM's property, we conduct empirical study using economic news articles. The results from these tests support the model's ability to identify and track long-term dependency or roughness in topics over time.
翻译:本文提出连续时间分数主题模型(cFTM),一种用于动态主题建模的新方法。该方法引入分数布朗运动(fBm),有效识别主题和词分布随时间呈现的正或负相关性,揭示长期依赖性或粗糙性。理论分析表明,cFTM能够捕捉主题和词分布中的长期依赖性或粗糙性,反映fBm的核心特征。此外,我们证明cFTM的参数估计过程与LDA等传统主题模型相当。为验证cFTM的特性,我们利用经济新闻文章开展实证研究。实验结果支持该模型识别和追踪主题随时间产生的长期依赖性或粗糙性的能力。