Network diffusion models are used to study things like disease transmission, information spread, and technology adoption. However, small amounts of mismeasurement are extremely likely in the networks constructed to operationalize these models. We show that estimates of diffusions are highly non-robust to this measurement error. First, we show that even when measurement error is vanishingly small, such that the share of missed links is close to zero, forecasts about the extent of diffusion will greatly underestimate the truth. Second, a small mismeasurement in the identity of the initial seed generates a large shift in the locations of expected diffusion path. We show that both of these results still hold when the vanishing measurement error is only local in nature. Such non-robustness in forecasting exists even under conditions where the basic reproductive number is consistently estimable. Possible solutions, such as estimating the measurement error or implementing widespread detection efforts, still face difficulties because the number of missed links are so small. Finally, we conduct Monte Carlo simulations on simulated networks, and real networks from three settings: travel data from the COVID-19 pandemic in the western US, a mobile phone marketing campaign in rural India, and in an insurance experiment in China.
翻译:网络扩散模型用于研究疾病传播、信息扩散和技术采纳等现象。然而,在构建用于实施这些模型的网络时,极有可能存在少量测量误差。我们表明,扩散估计对这种测量误差高度不稳健。首先,即使测量误差极小(缺失链接的比例接近于零),关于扩散程度的预测也会严重低估真实情况。其次,初始种子节点身份的微小测量误差会导致预期扩散路径的位置发生巨大偏移。我们证明,即使这种微小测量误差仅具有局部性质,上述结果仍然成立。即使在基本再生数可一致估计的条件下,这种预测的非稳健性依然存在。可能的解决方案(如估计测量误差或实施广泛检测手段)仍面临困难,因为缺失链接的数量非常少。最后,我们在模拟网络以及来自三种场景的真实网络上进行了蒙特卡洛模拟:美国西部COVID-19疫情期间的出行数据、印度农村的手机营销活动以及中国的保险实验。