New Perspectives on the Evaluation of Link Prediction Algorithms for Dynamic Graphs

There is a fast-growing body of research on predicting future links in dynamic networks, with many new algorithms. Some benchmark data exists, and performance evaluations commonly rely on comparing the scores of observed network events (positives) with those of randomly generated ones (negatives). These evaluation measures depend on both the predictive ability of the model and, crucially, the type of negative samples used. Besides, as generally the case with temporal data, prediction quality may vary over time. This creates a complex evaluation space. In this work, we catalog the possibilities for negative sampling and introduce novel visualization methods that can yield insight into prediction performance and the dynamics of temporal networks. We leverage these visualization tools to investigate the effect of negative sampling on the predictive performance, at the node and edge level. We validate empirically, on datasets extracted from recent benchmarks that the error is typically not evenly distributed across different data segments. Finally, we argue that such visualization tools can serve as powerful guides to evaluate dynamic link prediction methods at different levels.

翻译：动态网络中的未来链路预测研究正快速增长，众多新算法不断涌现。现有基准数据集通常通过将观测到的网络事件（正样本）与随机生成的样本（负样本）的分数进行比较来评估性能。这些评估指标不仅取决于模型的预测能力，更关键地取决于所采用的负样本类型。此外，作为时序数据的普遍特征，预测质量会随时间变化，从而形成复杂的评估空间。本文系统梳理了负采样方法的可能性，并引入新型可视化技术以揭示预测性能与时序网络动态特性。我们利用这些可视化工具，从节点和边两个层面探究负采样对预测性能的影响。基于近期基准数据集进行的实证验证表明，误差通常未在不同数据片段间均匀分布。最后，我们论证了此类可视化工具可作为多层级评估动态链路预测方法的有力指导。

相关内容

链路预测

关注 14

网络中的链路预测(Link Prediction)是指如何通过已知的网络节点以及网络结构等信息预测网络中尚未产生连边的两个节点之间产生链接的可能性。这种预测既包含了对未知链接（exist yet unknown links）的预测也包含了对未来链接（future links）的预测。该问题的研究在理论和应用两个方面都具有重要的意义和价值。

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日