Measuring the stability and plasticity of recommender systems - 专知论文

会员服务 ·

0

算法 · 可塑性 · 系统 · 数据集 · 交互 ·

Measuring the stability and plasticity of recommender systems

翻译：测量推荐系统的稳定性与可塑性

Maria João Lavoura,Robert Jungnickel,João Vinagre

from arxiv, Final version published in the proceedings of ACM UMAP 2026: https://doi.org/10.1145/3774935.3812707

The typical offline protocol to evaluate recommendation algorithms is to collect a dataset of user-item interactions and then use a part of this dataset to train a model, and the remaining data to measure how closely the model recommendations match the observed user interactions. This protocol is straightforward, useful and practical, but it only provides snapshot performance. We know, however, that online systems evolve over time. In general, it is a good idea that models are frequently retrained with recent data. But if this is the case, to what extent can we trust previous evaluations? How will a model perform when a different pattern (re)emerges? In this paper we propose a methodology to study how recommendation models behave when they are retrained. The idea is to profile algorithms according to their ability to, on the one hand, retain past patterns - stability - and, on the other hand, (quickly) adapt to changes - plasticity. We devise an offline evaluation protocol that provides detail on the long-term behavior of models, and that is agnostic to datasets, algorithms and metrics. To illustrate the potential of this framework, we present preliminary results of three different types of algorithms on the GoodReads dataset that suggest different stability and plasticity profiles depending on the algorithmic technique, and a possible trade-off between stability and plasticity. We further discuss the potential and limitations of the proposal and advance some possible improvements.

翻译：典型的推荐算法离线评估协议是：收集用户-物品交互数据集，利用其中一部分训练模型，再用剩余数据衡量模型推荐结果与观测到的用户交互的吻合程度。该协议简洁实用且具操作性，但仅能提供静态快照式的性能评估。然而我们知道，在线系统会随时间动态演化。一般而言，定期用最新数据重新训练模型是合理做法。但若如此，我们能在多大程度上信任先前的评估结果？当不同模式（重新）出现时，模型将如何表现？本文提出一种方法论，用于研究推荐模型在重新训练时的行为特性。其核心思路是通过算法两方面的能力进行特征刻画：一是保留历史模式的能力（稳定性），二是快速适应变化的能力（可塑性）。我们设计了一套离线评估协议，能够详细揭示模型的长期行为特征，且该协议与数据集、算法和评估指标无关。为展示该框架的潜力，我们基于GoodReads数据集对三类不同算法进行了初步实验，结果表明不同算法技术会呈现差异化的稳定-可塑性特征，且两者间可能存在权衡关系。本文还进一步讨论了该方法的潜力与局限性，并提出若干可能的改进方向。

0

相关内容

在数学和计算机科学之中，算法（Algorithm）为一个计算的具体步骤，常用于计算、数据处理和自动推理。精确而言，算法是一个表示为有限长列表的有效方法。算法应包含清晰定义的指令用于计算函数。来自维基百科：算法

基于因果推断的推荐系统去偏研究

基于因果推断的推荐系统去偏研究

专知会员服务

21+阅读 · 2024年11月10日

推荐系统技术综述

推荐系统技术综述

专知会员服务

55+阅读 · 2023年5月13日

推荐如何用多模态信息？南洋理工最新《多模态推荐系统》综述，33页pdf阐述多模态推荐系统的分类、评价和未来方向

推荐如何用多模态信息？南洋理工最新《多模态推荐系统》综述，33页pdf阐述多模态推荐系统的分类、评价和未来方向

专知会员服务

49+阅读 · 2023年2月13日

推荐系统如何可信？罗格斯大学最新《可信推荐系统》综述，43页pdf阐述可信RS组成与技术

推荐系统如何可信？罗格斯大学最新《可信推荐系统》综述，43页pdf阐述可信RS组成与技术

专知会员服务

33+阅读 · 2022年8月8日

【深度推荐系统：基础与进展】密歇根州立大学、香港理工大学、百度专家联合推出教程，Deep Recommender System: Fundamentals and Advances

【深度推荐系统：基础与进展】密歇根州立大学、香港理工大学、百度专家联合推出教程，Deep Recommender System: Fundamentals and Advances

专知会员服务

20+阅读 · 2022年2月25日

【干货书】实战推荐系统，Practical Recommender Systems，432页pdf

【干货书】实战推荐系统，Practical Recommender Systems，432页pdf

专知会员服务

181+阅读 · 2020年4月17日

对话推荐系统综述论文，35页pdf，A Survey on Conversational Recommender Systems

对话推荐系统综述论文，35页pdf，A Survey on Conversational Recommender Systems

专知会员服务

117+阅读 · 2020年4月3日

【WSDM2020 Tutorial】图学习与推理的推荐系统，130页ppt，Learning and Reasoning on Graph for Recommendation，新加坡国立大学

【WSDM2020 Tutorial】图学习与推理的推荐系统，130页ppt，Learning and Reasoning on Graph for Recommendation，新加坡国立大学

专知会员服务

98+阅读 · 2020年2月7日

【书籍推荐】Practical Recommender Systems一书涵盖推荐系统原理与实战技巧

【书籍推荐】Practical Recommender Systems一书涵盖推荐系统原理与实战技巧

专知会员服务

66+阅读 · 2019年10月25日

【RecSys 2019报告】推荐系统的意图，算法以及指标（Recommending for Impact:Intentions, Algorithms, and Metrics）

【RecSys 2019报告】推荐系统的意图，算法以及指标（Recommending for Impact:Intentions, Algorithms, and Metrics）

专知会员服务

37+阅读 · 2019年10月9日

推荐系统（一）：推荐系统基础

推荐系统（一）：推荐系统基础

菜鸟的机器学习

25+阅读 · 2019年9月2日

推荐系统产品与算法概述 | 深度

推荐系统产品与算法概述 | 深度

AI100

11+阅读 · 2019年6月13日

深度 | 推荐系统评估

深度 | 推荐系统评估

AI100

24+阅读 · 2019年3月16日

详解 | 推荐系统的工程实现

详解 | 推荐系统的工程实现

AI100

42+阅读 · 2019年3月15日

推荐系统

炼数成金订阅号

28+阅读 · 2019年1月17日

推荐系统概述

推荐系统概述

Linux爱好者

20+阅读 · 2018年9月6日

36页最新《深度学习在推荐系统上的应用》综述论文，209篇参考论文

36页最新《深度学习在推荐系统上的应用》综述论文，209篇参考论文

专知

24+阅读 · 2018年9月6日

深度学习在推荐系统中的应用综述（最全）

深度学习在推荐系统中的应用综述（最全）

七月在线实验室

17+阅读 · 2018年5月5日

一文读懂推荐系统知识体系-下（评估、实战、学习资料）

一文读懂推荐系统知识体系-下（评估、实战、学习资料）

AI100

34+阅读 · 2017年11月7日

推荐系统杂谈

推荐系统杂谈

架构文摘

28+阅读 · 2017年9月15日

推荐系统的信息核挖掘及其应用研究

国家自然科学基金

8+阅读 · 2015年12月31日

面向推荐系统中异构隐式反馈建模的迁移学习技术研究

国家自然科学基金

5+阅读 · 2015年12月31日

基于在线消费者购买意向挖掘的个性化推荐研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于异构信息网络的分类算法推荐方法研究

国家自然科学基金

7+阅读 · 2015年12月31日

基于领域知识和链路预测的个性化推荐研究

国家自然科学基金

4+阅读 · 2014年12月31日

在线服务信誉可比较性及其保障机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于自适应模型检测的安全协议自动建模与设计研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于人眼关注度与情感分析的电子商务智能推荐计算

国家自然科学基金

0+阅读 · 2014年12月31日

基于第三方的APP软件质量度量和评估方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

具有可靠性增长的系统可靠性试验鉴定方法研究

国家自然科学基金

10+阅读 · 2013年12月31日

From Top-1 to Top-K: A Reproducibility Study and Benchmarking of Counterfactual Explanations for Recommender Systems

Arxiv

0+阅读 · 4月21日

On the Accuracy Limits of Sequential Recommender Systems: An Entropy-Based Approach

Arxiv

0+阅读 · 4月14日

The Unreasonable Effectiveness of Data for Recommender Systems

Arxiv

0+阅读 · 4月9日

The Unreasonable Effectiveness of Data for Recommender Systems

Arxiv

0+阅读 · 4月7日

Measuring the Predictability of Recommender Systems using Structural Complexity Metrics

Arxiv

0+阅读 · 3月31日

On the Accuracy Limits of Sequential Recommender Systems: An Entropy-Based Approach

Arxiv

0+阅读 · 3月30日

Rethinking Recommendation Paradigms: From Pipelines to Agentic Recommender Systems

Arxiv

0+阅读 · 3月27日

Deep Research for Recommender Systems

Arxiv

0+阅读 · 3月8日

A Survey on Bundle Recommendation: Methods, Applications, and Challenges

Arxiv

0+阅读 · 2月26日

A Comprehensive Survey on Multimodal Recommender Systems: Taxonomy, Evaluation, and Future Directions

Arxiv

16+阅读 · 2023年2月9日

VIP会员

文章信息

相关主题

最新内容

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

专知会员服务

1+阅读 · 今天15:02

综述 | 3D场景图：开放挑战与未来方向

综述 | 3D场景图：开放挑战与未来方向

专知会员服务

1+阅读 · 今天15:00

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

专知会员服务

2+阅读 · 今天14:30

21世纪的无人机战争

21世纪的无人机战争

专知会员服务

2+阅读 · 今天14:05

《伊朗与以色列-美国热战及其对数字技术的影响》

《伊朗与以色列-美国热战及其对数字技术的影响》

专知会员服务

2+阅读 · 今天13:55

《量子技术的军事任务技术适配与利用》

《量子技术的军事任务技术适配与利用》

专知会员服务

2+阅读 · 今天13:51

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

专知会员服务

2+阅读 · 今天13:48

美国从乌克兰无人机战争中学习经验

美国从乌克兰无人机战争中学习经验

专知会员服务

7+阅读 · 6月21日

ICML 2026 | 面向视觉语言模型的语义鲁棒性认证

ICML 2026 | 面向视觉语言模型的语义鲁棒性认证

专知会员服务

5+阅读 · 6月21日

综述 | 智能体电子设计自动化：从“交接有效性”重新理解Agentic EDA

综述 | 智能体电子设计自动化：从“交接有效性”重新理解Agentic EDA

专知会员服务

7+阅读 · 6月21日

深入解读 Palantir AIP：全球最具争议的人工智能平台究竟如何运作

深入解读 Palantir AIP：全球最具争议的人工智能平台究竟如何运作

专知会员服务

20+阅读 · 6月20日

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

专知会员服务

5+阅读 · 6月19日

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

专知会员服务

8+阅读 · 6月19日

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

专知会员服务

7+阅读 · 6月18日

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

专知会员服务

9+阅读 · 6月18日

相关VIP内容

基于因果推断的推荐系统去偏研究

基于因果推断的推荐系统去偏研究

专知会员服务

21+阅读 · 2024年11月10日

推荐系统技术综述

推荐系统技术综述

专知会员服务

55+阅读 · 2023年5月13日

推荐如何用多模态信息？南洋理工最新《多模态推荐系统》综述，33页pdf阐述多模态推荐系统的分类、评价和未来方向

推荐如何用多模态信息？南洋理工最新《多模态推荐系统》综述，33页pdf阐述多模态推荐系统的分类、评价和未来方向

专知会员服务

49+阅读 · 2023年2月13日

推荐系统如何可信？罗格斯大学最新《可信推荐系统》综述，43页pdf阐述可信RS组成与技术

推荐系统如何可信？罗格斯大学最新《可信推荐系统》综述，43页pdf阐述可信RS组成与技术

专知会员服务

33+阅读 · 2022年8月8日

【深度推荐系统：基础与进展】密歇根州立大学、香港理工大学、百度专家联合推出教程，Deep Recommender System: Fundamentals and Advances

【深度推荐系统：基础与进展】密歇根州立大学、香港理工大学、百度专家联合推出教程，Deep Recommender System: Fundamentals and Advances

专知会员服务

20+阅读 · 2022年2月25日

【干货书】实战推荐系统，Practical Recommender Systems，432页pdf

【干货书】实战推荐系统，Practical Recommender Systems，432页pdf

专知会员服务

181+阅读 · 2020年4月17日

对话推荐系统综述论文，35页pdf，A Survey on Conversational Recommender Systems

对话推荐系统综述论文，35页pdf，A Survey on Conversational Recommender Systems

专知会员服务

117+阅读 · 2020年4月3日

【WSDM2020 Tutorial】图学习与推理的推荐系统，130页ppt，Learning and Reasoning on Graph for Recommendation，新加坡国立大学

【WSDM2020 Tutorial】图学习与推理的推荐系统，130页ppt，Learning and Reasoning on Graph for Recommendation，新加坡国立大学

专知会员服务

98+阅读 · 2020年2月7日

【书籍推荐】Practical Recommender Systems一书涵盖推荐系统原理与实战技巧

【书籍推荐】Practical Recommender Systems一书涵盖推荐系统原理与实战技巧

专知会员服务

66+阅读 · 2019年10月25日

【RecSys 2019报告】推荐系统的意图，算法以及指标（Recommending for Impact:Intentions, Algorithms, and Metrics）

【RecSys 2019报告】推荐系统的意图，算法以及指标（Recommending for Impact:Intentions, Algorithms, and Metrics）

专知会员服务

37+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

综述 | 3D场景图：开放挑战与未来方向

21世纪的无人机战争

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

相关资讯

推荐系统（一）：推荐系统基础

推荐系统（一）：推荐系统基础

菜鸟的机器学习

25+阅读 · 2019年9月2日

推荐系统产品与算法概述 | 深度

推荐系统产品与算法概述 | 深度

AI100

11+阅读 · 2019年6月13日

深度 | 推荐系统评估

深度 | 推荐系统评估

AI100

24+阅读 · 2019年3月16日

详解 | 推荐系统的工程实现

详解 | 推荐系统的工程实现

AI100

42+阅读 · 2019年3月15日

推荐系统

炼数成金订阅号

28+阅读 · 2019年1月17日

推荐系统概述

推荐系统概述

Linux爱好者

20+阅读 · 2018年9月6日

36页最新《深度学习在推荐系统上的应用》综述论文，209篇参考论文

36页最新《深度学习在推荐系统上的应用》综述论文，209篇参考论文

专知

24+阅读 · 2018年9月6日

深度学习在推荐系统中的应用综述（最全）

深度学习在推荐系统中的应用综述（最全）

七月在线实验室

17+阅读 · 2018年5月5日

一文读懂推荐系统知识体系-下（评估、实战、学习资料）

一文读懂推荐系统知识体系-下（评估、实战、学习资料）

AI100

34+阅读 · 2017年11月7日

推荐系统杂谈

推荐系统杂谈

架构文摘

28+阅读 · 2017年9月15日

相关论文

From Top-1 to Top-K: A Reproducibility Study and Benchmarking of Counterfactual Explanations for Recommender Systems

Arxiv

0+阅读 · 4月21日

On the Accuracy Limits of Sequential Recommender Systems: An Entropy-Based Approach

Arxiv

0+阅读 · 4月14日

The Unreasonable Effectiveness of Data for Recommender Systems

Arxiv

0+阅读 · 4月9日

The Unreasonable Effectiveness of Data for Recommender Systems

Arxiv

0+阅读 · 4月7日

Measuring the Predictability of Recommender Systems using Structural Complexity Metrics

Arxiv

0+阅读 · 3月31日

On the Accuracy Limits of Sequential Recommender Systems: An Entropy-Based Approach

Arxiv

0+阅读 · 3月30日

Rethinking Recommendation Paradigms: From Pipelines to Agentic Recommender Systems

Arxiv

0+阅读 · 3月27日

Deep Research for Recommender Systems

Arxiv

0+阅读 · 3月8日

A Survey on Bundle Recommendation: Methods, Applications, and Challenges

Arxiv

0+阅读 · 2月26日

A Comprehensive Survey on Multimodal Recommender Systems: Taxonomy, Evaluation, and Future Directions

Arxiv

16+阅读 · 2023年2月9日

相关基金

推荐系统的信息核挖掘及其应用研究

国家自然科学基金

8+阅读 · 2015年12月31日

面向推荐系统中异构隐式反馈建模的迁移学习技术研究

国家自然科学基金

5+阅读 · 2015年12月31日

基于在线消费者购买意向挖掘的个性化推荐研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于异构信息网络的分类算法推荐方法研究

国家自然科学基金

7+阅读 · 2015年12月31日

基于领域知识和链路预测的个性化推荐研究

国家自然科学基金

4+阅读 · 2014年12月31日

在线服务信誉可比较性及其保障机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于自适应模型检测的安全协议自动建模与设计研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于人眼关注度与情感分析的电子商务智能推荐计算

国家自然科学基金

0+阅读 · 2014年12月31日

基于第三方的APP软件质量度量和评估方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

具有可靠性增长的系统可靠性试验鉴定方法研究

国家自然科学基金

10+阅读 · 2013年12月31日

微信扫码咨询专知VIP会员