Augmenting Greybox Fuzzing with Generative AI

Real-world programs expecting structured inputs often has a format-parsing stage gating the deeper program space. Neither a mutation-based approach nor a generative approach can provide a solution that is effective and scalable. Large language models (LLM) pre-trained with an enormous amount of natural language corpus have proved to be effective for understanding the implicit format syntax and generating format-conforming inputs. In this paper, propose ChatFuzz, a greybox fuzzer augmented by generative AI. More specifically, we pick a seed in the fuzzer's seed pool and prompt ChatGPT generative models to variations, which are more likely to be format-conforming and thus of high quality. We conduct extensive experiments to explore the best practice for harvesting the power of generative LLM models. The experiment results show that our approach improves the edge coverage by 12.77\% over the SOTA greybox fuzzer (AFL++) on 12 target programs from three well-tested benchmarks. As for vulnerability detection, \sys is able to perform similar to or better than AFL++ for programs with explicit syntax rules but not for programs with non-trivial syntax.

翻译：现实世界中期望结构化输入的程序通常具有一个格式解析阶段，这会阻碍对更深层程序空间的探索。基于变异的方法和生成式方法均无法提供既有效又可扩展的解决方案。通过海量自然语言语料库预训练的大型语言模型（LLM）已被证明能够有效理解隐式格式语法并生成符合格式的输入。本文提出ChatFuzz——一种由生成式人工智能增强的灰盒模糊测试工具。具体而言，我们从模糊测试工具的种子池中选取一个种子，并提示ChatGPT生成式模型产生变异，这些变异更可能符合格式要求，因此具有高质量。我们进行了大量实验，以探索利用生成式LLM模型的最佳实践。实验结果表明，在三个经过充分测试的基准测试中的12个目标程序上，我们的方法相较于最先进的灰盒模糊测试工具（AFL++）将边覆盖率提升了12.77%。在漏洞检测方面，对于具有显式语法规则的程序，ChatFuzz的表现与AFL++相当或更优；但对于具有非平凡语法的程序，其性能则有限。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日