Advancing continual lifelong learning in neural information retrieval: definition, dataset, framework, and empirical evaluation

Continual learning refers to the capability of a machine learning model to learn and adapt to new information, without compromising its performance on previously learned tasks. Although several studies have investigated continual learning methods for information retrieval tasks, a well-defined task formulation is still lacking, and it is unclear how typical learning strategies perform in this context. To address this challenge, a systematic task formulation of continual neural information retrieval is presented, along with a multiple-topic dataset that simulates continuous information retrieval. A comprehensive continual neural information retrieval framework consisting of typical retrieval models and continual learning strategies is then proposed. Empirical evaluations illustrate that the proposed framework can successfully prevent catastrophic forgetting in neural information retrieval and enhance performance on previously learned tasks. The results indicate that embedding-based retrieval models experience a decline in their continual learning performance as the topic shift distance and dataset volume of new tasks increase. In contrast, pretraining-based models do not show any such correlation. Adopting suitable learning strategies can mitigate the effects of topic shift and data augmentation.

翻译：持续学习指的是机器学习模型学习和适应新信息的能力，同时不损害其在先前学习任务上的性能。尽管已有若干研究探讨了信息检索任务的持续学习方法，但一个明确定义的任务表述仍然缺乏，并且典型的学习策略在此背景下的表现尚不明确。为应对这一挑战，本文提出了持续神经信息检索的系统化任务表述，以及一个模拟连续信息检索的多主题数据集。随后，提出了一个由典型检索模型和持续学习策略组成的综合性持续神经信息检索框架。实证评估表明，所提框架能够成功防止神经信息检索中的灾难性遗忘，并提升在先前学习任务上的性能。结果表明，基于嵌入的检索模型的持续学习性能会随着新任务的主题偏移距离和数据集规模的增加而下降。相比之下，基于预训练的模型未显示出任何此类相关性。采用合适的学习策略可以减轻主题偏移和数据增强的影响。

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

【AI应用】Facebook-利用神经网络求解高等数学方程, Using neural networks to solve advanced mathematics equations

专知会员服务

34+阅读 · 2020年1月15日