On the Effectiveness of Adversarial Samples against Ensemble Learning-based Windows PE Malware Detectors

Recently, there has been a growing focus and interest in applying machine learning (ML) to the field of cybersecurity, particularly in malware detection and prevention. Several research works on malware analysis have been proposed, offering promising results for both academic and practical applications. In these works, the use of Generative Adversarial Networks (GANs) or Reinforcement Learning (RL) can aid malware creators in crafting metamorphic malware that evades antivirus software. In this study, we propose a mutation system to counteract ensemble learning-based detectors by combining GANs and an RL model, overcoming the limitations of the MalGAN model. Our proposed FeaGAN model is built based on MalGAN by incorporating an RL model called the Deep Q-network anti-malware Engines Attacking Framework (DQEAF). The RL model addresses three key challenges in performing adversarial attacks on Windows Portable Executable malware, including format preservation, executability preservation, and maliciousness preservation. In the FeaGAN model, ensemble learning is utilized to enhance the malware detector's evasion ability, with the generated adversarial patterns. The experimental results demonstrate that 100\% of the selected mutant samples preserve the format of executable files, while certain successes in both executability preservation and maliciousness preservation are achieved, reaching a stable success rate.

翻译：近年来，将机器学习（ML）应用于网络安全领域，特别是恶意软件检测与防御方面，日益受到关注和重视。已有多项关于恶意软件分析的研究工作被提出，在学术和实际应用中都取得了令人鼓舞的成果。在这些工作中，生成对抗网络（GAN）或强化学习（RL）的使用能够帮助恶意软件制作者制造出可逃避反病毒软件的变体恶意软件。本研究中，我们提出了一种结合GAN和RL模型的变异系统，以对抗基于集成学习的检测器，从而克服了MalGAN模型的局限性。我们提出的FeaGAN模型基于MalGAN构建，并整合了一个名为深度Q网络反恶意软件引擎攻击框架（DQEAF）的RL模型。该RL模型解决了对Windows可移植可执行文件进行对抗攻击时面临的三个关键挑战：格式保持、可执行性保持以及恶意性保持。在FeaGAN模型中，通过利用集成学习，使用生成的对抗模式来增强恶意软件检测器的逃逸能力。实验结果表明，100%选定的变异样本能够保持可执行文件的格式，同时在可执行性保持和恶意性保持方面也取得了一定的成功，达到了稳定的成功率。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日