A Survey on Malware Detection with Graph Representation Learning - 专知论文

会员服务 ·

0

软件 · 攻击 · 基于图形 · 表示 · 综述 ·

2023 年 3 月 28 日

A Survey on Malware Detection with Graph Representation Learning

翻译：面向图表示学习的恶意软件检测综述

Tristan Bilot,Nour El Madhoun,Khaldoun Al Agha,Anis Zouaoui

from arxiv, Preprint, submitted to ACM Computing Surveys on March 2023. For any suggestions or improvements, please contact me directly by e-mail

Malware detection has become a major concern due to the increasing number and complexity of malware. Traditional detection methods based on signatures and heuristics are used for malware detection, but unfortunately, they suffer from poor generalization to unknown attacks and can be easily circumvented using obfuscation techniques. In recent years, Machine Learning (ML) and notably Deep Learning (DL) achieved impressive results in malware detection by learning useful representations from data and have become a solution preferred over traditional methods. More recently, the application of such techniques on graph-structured data has achieved state-of-the-art performance in various domains and demonstrates promising results in learning more robust representations from malware. Yet, no literature review focusing on graph-based deep learning for malware detection exists. In this survey, we provide an in-depth literature review to summarize and unify existing works under the common approaches and architectures. We notably demonstrate that Graph Neural Networks (GNNs) reach competitive results in learning robust embeddings from malware represented as expressive graph structures, leading to an efficient detection by downstream classifiers. This paper also reviews adversarial attacks that are utilized to fool graph-based detection methods. Challenges and future research directions are discussed at the end of the paper.

翻译：恶意软件检测因恶意软件数量与复杂性的增加而备受关注。基于签名和启发式的传统检测方法被用于恶意软件检测，但遗憾的是，这些方法对未知攻击的泛化能力较差，且易被混淆技术规避。近年来，机器学习（ML）尤其是深度学习（DL）通过从数据中学习有效表征，在恶意软件检测中取得了显著成果，并成为优于传统方法的解决方案。最近，此类技术在图表征数据上的应用已在多个领域达到最先进性能，并在从恶意软件中学习更鲁棒的表征方面展现出前景。然而，目前尚无聚焦于基于图的深度学习用于恶意软件检测的文献综述。本综述通过深入文献回顾，总结并统一了现有工作在常用方法与架构下的研究成果。我们特别证明了图神经网络（GNN）在从以表达性图结构表示的恶意软件中学习鲁棒嵌入方面具有竞争力，从而通过下游分类器实现高效检测。本文还综述了用于欺骗基于图检测方法的对抗攻击。最后讨论了面临的挑战与未来研究方向。

0

相关内容

软件（中国大陆及香港用语，台湾作软体，英文：Software）是一系列按照特定顺序组织的计算机数据和指令的集合。一般来讲软件被划分为编程语言、系统软件、应用软件和介于这两者之间的中间件。软件就是程序加文档的集合体。

【图机器学习进展与趋势@ICML2022】Graph Machine Learning @ ICML 2022

【图机器学习进展与趋势@ICML2022】Graph Machine Learning @ ICML 2022

专知会员服务

40+阅读 · 2022年7月25日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【中科院自动化所】深度图生成方法及应用综述，A Survey on Deep Graph Generation: Methods and Applications

【中科院自动化所】深度图生成方法及应用综述，A Survey on Deep Graph Generation: Methods and Applications

专知会员服务

24+阅读 · 2022年3月15日

【新加破南洋理工】点云的无监督表示学习综述，Unsupervised Representation Learning for Point Clouds: A Survey

【新加破南洋理工】点云的无监督表示学习综述，Unsupervised Representation Learning for Point Clouds: A Survey

专知会员服务

29+阅读 · 2022年3月2日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

最新6篇ICLR2021篇图神经网络论文推荐

专知会员服务

57+阅读 · 2021年1月26日

最新《图神经网络知识图谱补全综述论文》A Survey on Graph Neural Networks for Knowledge Graph Completion

最新《图神经网络知识图谱补全综述论文》A Survey on Graph Neural Networks for Knowledge Graph Completion

专知会员服务

137+阅读 · 2020年7月29日

【论文推荐】深度学习中的异常实例检测:综述，Anomalous Instance Detection in Deep Learning: A Survey

【论文推荐】深度学习中的异常实例检测:综述，Anomalous Instance Detection in Deep Learning: A Survey

专知会员服务

97+阅读 · 2020年3月17日

【论文推荐】基于机器学习的5G网络异常检测，Machine Learning based Anomaly Detection for 5G Networks

【论文推荐】基于机器学习的5G网络异常检测，Machine Learning based Anomaly Detection for 5G Networks

专知会员服务

36+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

【KDD2020-Tutorial】深度学习异常检测，180页ppt

【KDD2020-Tutorial】深度学习异常检测，180页ppt

专知

49+阅读 · 2020年8月28日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

图神经网络（Graph Neural Networks，GNN）综述

图神经网络（Graph Neural Networks，GNN）综述

极市平台

104+阅读 · 2019年11月27日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

基于网络活动分析的窃密木马检测技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

在线社交网络上恶意网址的实时预警

国家自然科学基金

2+阅读 · 2014年12月31日

计算可靠且可组合安全的复杂密码协议符号化分析方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

恶意软件静态分析与检测关键技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

Nrf2-ARE信号通路在氢气干预新生儿坏死性小肠结肠炎中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

自适应软件设计模式组合理论研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于API的静态插桩技术与Android平台恶意代码检测

国家自然科学基金

0+阅读 · 2012年12月31日

P2P僵尸网络检测关键技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于多主体认知逻辑模型检测的Web服务组合验证

国家自然科学基金

0+阅读 · 2011年12月31日

基于抽象的软件符号模型检测研究

国家自然科学基金

0+阅读 · 2011年12月31日

Survey of Malware Analysis through Control Flow Graph using Machine Learning

Arxiv

0+阅读 · 2023年5月15日

Towards Large-Scale Small Object Detection: Survey and Benchmarks

Arxiv

41+阅读 · 2022年7月28日

A Survey on Data Augmentation for Text Classification

A Survey on Data Augmentation for Text Classification

Arxiv

16+阅读 · 2021年7月7日

Graph Learning: A Survey

Arxiv

58+阅读 · 2021年5月3日

A Survey on Multi-Task Learning

Arxiv

32+阅读 · 2021年3月29日

Deep Graph Structure Learning for Robust Representations: A Survey

Arxiv

21+阅读 · 2021年3月4日

Image/Video Deep Anomaly Detection: A Survey

Arxiv

16+阅读 · 2021年3月2日

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Arxiv

17+阅读 · 2019年10月9日

Deep Learning on Graphs: A Survey

Arxiv

53+阅读 · 2018年12月11日

Deep Learning for Generic Object Detection: A Survey

Deep Learning for Generic Object Detection: A Survey

Arxiv

14+阅读 · 2018年9月6日

VIP会员

文章信息

相关主题

最新内容

深入解读 Palantir AIP：全球最具争议的人工智能平台究竟如何运作

深入解读 Palantir AIP：全球最具争议的人工智能平台究竟如何运作

专知会员服务

7+阅读 · 6月20日

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

专知会员服务

4+阅读 · 6月19日

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

专知会员服务

6+阅读 · 6月19日

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

专知会员服务

6+阅读 · 6月18日

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

专知会员服务

7+阅读 · 6月18日

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

专知会员服务

11+阅读 · 6月18日

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

专知会员服务

11+阅读 · 6月18日

ICML 2026 | FR3D：解耦自车运动的未来动态三维重建世界模型

ICML 2026 | FR3D：解耦自车运动的未来动态三维重建世界模型

专知会员服务

7+阅读 · 6月17日

【伯克利博士论文】迈向可扩展与自我演进的大语言模型智能体

【伯克利博士论文】迈向可扩展与自我演进的大语言模型智能体

专知会员服务

11+阅读 · 6月17日

学习数据的几何：形状空间分析数学综述

学习数据的几何：形状空间分析数学综述

专知会员服务

8+阅读 · 6月17日

《现代防空系统综述：架构、传感器、拦截器及新兴威胁环境对基础设施受限防御环境的影响》2026最新长综述

《现代防空系统综述：架构、传感器、拦截器及新兴威胁环境对基础设施受限防御环境的影响》2026最新长综述

专知会员服务

18+阅读 · 6月17日

定向能反无人机系统最新发展动态

定向能反无人机系统最新发展动态

专知会员服务

9+阅读 · 6月17日

从燃煤战舰到算法战争：水面指挥的永恒要求

从燃煤战舰到算法战争：水面指挥的永恒要求

专知会员服务

6+阅读 · 6月17日

《短程弹道再入飞行器拦截时间中的一项异常现象》

《短程弹道再入飞行器拦截时间中的一项异常现象》

专知会员服务

8+阅读 · 6月17日

《基于回归方法与任务上下文的对抗环境动态战术网络报文优先级排序》

《基于回归方法与任务上下文的对抗环境动态战术网络报文优先级排序》

专知会员服务

8+阅读 · 6月17日

相关VIP内容

【图机器学习进展与趋势@ICML2022】Graph Machine Learning @ ICML 2022

【图机器学习进展与趋势@ICML2022】Graph Machine Learning @ ICML 2022

专知会员服务

40+阅读 · 2022年7月25日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【中科院自动化所】深度图生成方法及应用综述，A Survey on Deep Graph Generation: Methods and Applications

【中科院自动化所】深度图生成方法及应用综述，A Survey on Deep Graph Generation: Methods and Applications

专知会员服务

24+阅读 · 2022年3月15日

【新加破南洋理工】点云的无监督表示学习综述，Unsupervised Representation Learning for Point Clouds: A Survey

【新加破南洋理工】点云的无监督表示学习综述，Unsupervised Representation Learning for Point Clouds: A Survey

专知会员服务

29+阅读 · 2022年3月2日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

最新6篇ICLR2021篇图神经网络论文推荐

专知会员服务

57+阅读 · 2021年1月26日

最新《图神经网络知识图谱补全综述论文》A Survey on Graph Neural Networks for Knowledge Graph Completion

最新《图神经网络知识图谱补全综述论文》A Survey on Graph Neural Networks for Knowledge Graph Completion

专知会员服务

137+阅读 · 2020年7月29日

【论文推荐】深度学习中的异常实例检测:综述，Anomalous Instance Detection in Deep Learning: A Survey

【论文推荐】深度学习中的异常实例检测:综述，Anomalous Instance Detection in Deep Learning: A Survey

专知会员服务

97+阅读 · 2020年3月17日

【论文推荐】基于机器学习的5G网络异常检测，Machine Learning based Anomaly Detection for 5G Networks

【论文推荐】基于机器学习的5G网络异常检测，Machine Learning based Anomaly Detection for 5G Networks

专知会员服务

36+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

热门VIP内容

开通专知VIP会员享更多权益服务

深入解读 Palantir AIP：全球最具争议的人工智能平台究竟如何运作

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

【KDD2020-Tutorial】深度学习异常检测，180页ppt

【KDD2020-Tutorial】深度学习异常检测，180页ppt

专知

49+阅读 · 2020年8月28日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

图神经网络（Graph Neural Networks，GNN）综述

图神经网络（Graph Neural Networks，GNN）综述

极市平台

104+阅读 · 2019年11月27日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

Survey of Malware Analysis through Control Flow Graph using Machine Learning

Arxiv

0+阅读 · 2023年5月15日

Towards Large-Scale Small Object Detection: Survey and Benchmarks

Arxiv

41+阅读 · 2022年7月28日

A Survey on Data Augmentation for Text Classification

A Survey on Data Augmentation for Text Classification

Arxiv

16+阅读 · 2021年7月7日

Graph Learning: A Survey

Arxiv

58+阅读 · 2021年5月3日

A Survey on Multi-Task Learning

Arxiv

32+阅读 · 2021年3月29日

Deep Graph Structure Learning for Robust Representations: A Survey

Arxiv

21+阅读 · 2021年3月4日

Image/Video Deep Anomaly Detection: A Survey

Arxiv

16+阅读 · 2021年3月2日

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Arxiv

17+阅读 · 2019年10月9日

Deep Learning on Graphs: A Survey

Arxiv

53+阅读 · 2018年12月11日

Deep Learning for Generic Object Detection: A Survey

Deep Learning for Generic Object Detection: A Survey

Arxiv

14+阅读 · 2018年9月6日

相关基金

基于网络活动分析的窃密木马检测技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

在线社交网络上恶意网址的实时预警

国家自然科学基金

2+阅读 · 2014年12月31日

计算可靠且可组合安全的复杂密码协议符号化分析方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

恶意软件静态分析与检测关键技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

Nrf2-ARE信号通路在氢气干预新生儿坏死性小肠结肠炎中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

自适应软件设计模式组合理论研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于API的静态插桩技术与Android平台恶意代码检测

国家自然科学基金

0+阅读 · 2012年12月31日

P2P僵尸网络检测关键技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于多主体认知逻辑模型检测的Web服务组合验证

国家自然科学基金

0+阅读 · 2011年12月31日

基于抽象的软件符号模型检测研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员