Distributed System Fuzzing

Grey-box fuzzing is the lightweight approach of choice for finding bugs in sequential programs. It provides a balance between efficiency and effectiveness by conducting a biased random search over the domain of program inputs using a feedback function from observed test executions. For distributed system testing, however, the state-of-practice is represented today by only black-box tools that do not attempt to infer and exploit any knowledge of the system's past behaviours to guide the search for bugs. In this work, we present Mallory: the first framework for grey-box fuzz-testing of distributed systems. Unlike popular black-box distributed system fuzzers, such as Jepsen, that search for bugs by randomly injecting network partitions and node faults or by following human-defined schedules, Mallory is adaptive. It exercises a novel metric to learn how to maximize the number of observed system behaviors by choosing different sequences of faults, thus increasing the likelihood of finding new bugs. The key enablers for our approach are the new ideas of timeline-driven testing and timeline abstraction that provide the feedback function guiding a biased random search for failures. Mallory dynamically constructs Lamport timelines of the system behaviour, abstracts these timelines into happens-before summaries, and introduces faults guided by its real-time observation of the summaries. We have evaluated Mallory on a diverse set of widely-used industrial distributed systems. Compared to the start-of-the-art black-box fuzzer Jepsen, Mallory explores more behaviours and takes less time to find bugs. Mallory discovered 22 zero-day bugs (of which 18 were confirmed by developers), including 10 new vulnerabilities, in rigorously-tested distributed systems such as Braft, Dqlite, and Redis. 6 new CVEs have been assigned.

翻译：灰盒模糊测试是用于发现顺序程序中漏洞的轻量级首选方法。它通过利用从测试执行观察中获得的反馈函数，对程序输入域进行有偏随机搜索，从而在效率与有效性之间取得平衡。然而，在当前分布式系统测试实践中，主流方法仍仅局限于黑盒工具，这些工具不会尝试推断和利用系统历史行为的知识来引导漏洞搜索。本文提出Mallory：首个面向分布式系统的灰盒模糊测试框架。与Jepsen等主流黑盒分布式系统模糊测试工具（这类工具通过随机注入网络分区和节点故障，或遵循人工定义的时间表来搜索漏洞）不同，Mallory具有自适应性。它运用一种新型度量标准，通过选择不同的故障序列来学习如何最大化观测到的系统行为数量，从而提升发现新漏洞的概率。本方法的关键支撑在于时间线驱动测试和时间线抽象这两个创新概念：前者提供引导有偏随机搜索故障的反馈函数，后者则动态构建系统行为的Lamport时间线，将其抽象为"发生先于"摘要，并依据对这些摘要的实时观测结果来注入故障。我们在多种广泛应用的工业级分布式系统上评估了Mallory。与当前最先进的黑盒模糊测试工具Jepsen相比，Mallory能探索更多系统行为，且发现漏洞所需时间更短。在Braft、Dqlite、Redis等经过严格测试的分布式系统中，Mallory共发现22个零日漏洞（其中18个已获开发者确认），包括10个新安全漏洞，并已分配6个全新CVE编号。

相关内容

黑盒

关注 1

在科学，计算和工程学中，黑盒是一种设备，系统或对象，可以根据其输入和输出（或传输特性）对其进行查看，而无需对其内部工作有任何了解。它的实现是“不透明的”（黑色）。几乎任何事物都可以被称为黑盒：晶体管，引擎，算法，人脑，机构或政府。为了使用典型的“黑匣子方法”来分析建模为开放系统的事物，仅考虑刺激/响应的行为，以推断（未知）盒子。该黑匣子系统的通常表示形式是在该方框中居中的数据流程图。黑盒的对立面是一个内部组件或逻辑可用于检查的系统，通常将其称为白盒（有时也称为“透明盒”或“玻璃盒”）。

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

105+阅读 · 2022年2月10日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日