We investigate trade-offs in static and dynamic evaluation of hierarchical queries with arbitrary free variables. In the static setting, the trade-off is between the time to partially compute the query result and the delay needed to enumerate its tuples. In the dynamic setting, we additionally consider the time needed to update the query result under single-tuple inserts or deletes to the database. Our approach observes the degree of values in the database and uses different computation and maintenance strategies for high-degree (heavy) and low-degree (light) values. For the latter it partially computes the result, while for the former it computes enough information to allow for on-the-fly enumeration. We define the preprocessing time, the update time, and the enumeration delay as functions of the light/heavy threshold. By appropriately choosing this threshold, our approach recovers a number of prior results when restricted to hierarchical queries. We show that for a restricted class of hierarchical queries, our approach achieves worst-case optimal update time and enumeration delay conditioned on the Online Matrix-Vector Multiplication Conjecture.
翻译:我们研究了具有任意自由变量的层次查询在静态与动态评估中的权衡。在静态场景下,权衡体现在查询结果的部分计算时间与元组枚举所需延迟之间。在动态场景中,我们额外考虑了数据库单条元组插入或删除操作下更新查询结果所需的时间。该方法观测数据库中值的度数,对高度数(重值)与低度数(轻值)采用不同的计算与维护策略。针对后者采用部分计算结果的方式,而前者则计算足够信息以实现即时枚举。我们将预处理时间、更新时间及枚举延迟定义为轻重值阈值的函数。通过合理选择该阈值,我们的方法在限定于层次查询时能复现多项先前研究结果。我们证明,对于受限的层次查询类别,基于在线矩阵-向量乘法猜想,该方法可实现最坏情况下的最优更新时间与枚举延迟。