Operationalizing Property-Based Testing for Data-Intensive Scalable Computing Systems

While fuzzing effectively catches crashes, its shallow oracles often miss semantic drifts and optimization-related errors in data-intensive scalable computing (DISC) frameworks. Property-based testing (PBT) addresses this limitation by checking general semantic invariants across diverse workloads and inputs, rather than relying on specific expected outputs. However, systematically operationalizing PBT for DISC systems remains difficult because it requires both reusable property definitions and effective instantiation into valid workloads and data. We present DiscPBT, a property-based testing engine for Apache Spark. DiscPBT introduces eight reusable meta-properties for DISC semantic testing, spanning equivalence rewriting, data decomposition, computation decomposition, and operator-local semantic relations. To operationalize these meta-properties, DiscPBT provides reusable generators for synthesizing valid workload skeletons and input data, together with an instantiation framework that realizes each meta-property in schema-compatible contexts through compatible operators, expressions, and UDFs. Our evaluation on PySpark shows that DiscPBT achieves 1.2$\times$ higher branch coverage and 1153$\times$ greater plan diversity than CometFuzz. Across 66 concrete properties, DiscPBT reveals cross-version semantic drift as well as subtle corner-case pitfalls involving NaN and empty inputs, that are not captured by crash-based fuzzing alone. These results demonstrate the value of systematic PBT for uncovering semantic issues in DISC frameworks.

翻译：尽管模糊测试能有效捕获程序崩溃，但其浅层判断标准常忽略数据密集型可扩展计算（DISC）框架中的语义漂移及优化相关错误。属性化测试通过检验跨多样化工作负载与输入的通用语义不变性，而非依赖特定预期输出，弥补了这一局限。然而，为DISC系统系统化地实践属性化测试仍面临挑战，因为这既需要可复用的属性定义，又需将其有效实例化为合法的工作负载与数据。本文提出DiscPBT——针对Apache Spark的属性化测试引擎。DiscPBT引入八种面向DISC语义测试的可复用元属性，涵盖等价重写、数据分解、计算分解及算子局部语义关系。为实践这些元属性，DiscPBT提供可复用的生成器用于合成合法的工作骨架与输入数据，并构建实例化框架，通过兼容的算子、表达式及UDF将各元属性在模式兼容的上下文中实现。我们在PySpark上的评估表明，DiscPBT比CometFuzz实现1.2倍的分支覆盖率和1153倍的查询计划多样性。在66个具体属性测试中，DiscPBT揭示了跨版本语义漂移以及涉及NaN和空输入等细粒度边界案例缺陷——这些无法单独基于崩溃的模糊测试捕获。实验结果证明了系统化属性化测试在揭示DISC框架语义问题中的价值。

相关内容

属性

关注 2

一个具体事物，总是有许许多多的性质与关系，我们把一个事物的性质与关系，都叫作事物的属性。事物与属性是不可分的，事物都是有属性的事物，属性也都是事物的属性。一个事物与另一个事物的相同或相异，也就是一个事物的属性与另一事物的属性的相同或相异。由于事物属性的相同或相异，客观世界中就形成了许多不同的事物类。具有相同属性的事物就形成一类，具有不同属性的事物就分别地形成不同的类。

《利用模型系统工程软件测试技术增强作战飞行试验研究》最新213页

专知会员服务

26+阅读 · 2025年9月13日

重新审视测试时扩展：一项综述与面向多样性的高效推理方法

专知会员服务

10+阅读 · 2025年6月8日

【NeurIPS2024】面向视觉-语言模型测试时泛化的双原型演化

专知会员服务

18+阅读 · 2024年10月17日

【普林斯顿博士论文】深度学习优化的隐性偏差：数学考察，391页pdf

专知会员服务

29+阅读 · 2024年10月4日