We consider sequential treatment regimes where each unit is exposed to combinations of interventions over time. When interventions are described by qualitative labels, such as ``close schools for a month due to a pandemic'' or ``promote this podcast to this user during this week'', it is unclear which appropriate structural assumptions allow us to generalize behavioral predictions to previously unseen combinatorial sequences. Standard black-box approaches mapping sequences of categorical variables to outputs are applicable, but they rely on poorly understood assumptions on how reliable generalization can be obtained, and may underperform under sparse sequences, temporal variability, and large action spaces. To approach that, we pose an explicit model for \emph{composition}, that is, how the effect of sequential interventions can be isolated into modules, clarifying which data conditions allow for the identification of their combined effect at different units and time steps. We show the identification properties of our compositional model, inspired by advances in causal matrix factorization methods but focusing on predictive models for novel compositions of interventions instead of matrix completion tasks and causal effect estimation. We compare our approach to flexible but generic black-box models to illustrate how structure aids prediction in sparse data conditions.
翻译:我们考虑序列处理策略,其中每个单元随时间暴露于多种干预的组合。当干预措施由定性标签描述时,例如“因疫情关闭学校一个月”或“本周向该用户推广此播客”,目前尚不清楚何种适当的结构假设能让我们将行为预测推广到先前未见过的组合序列。虽然可将分类变量序列映射到输出的标准黑盒方法适用,但这些方法依赖于对如何获得可靠泛化的理解不足的假设,并且在稀疏序列、时间变异性和大动作空间下可能表现不佳。为此,我们为“组合”提出了一个显式模型,即如何将序列干预的效果分离为模块,从而阐明哪些数据条件允许识别它们在不同单元和时间步上的组合效应。我们展示了我们组合模型的识别特性,该模型受到因果矩阵分解方法进展的启发,但专注于针对干预新组合的预测模型,而非矩阵补全任务和因果效应估计。我们将我们的方法与灵活但通用的黑盒模型进行比较,以说明结构如何在稀疏数据条件下辅助预测。