In a context of malicious software detection, machine learning (ML) is widely used to generalize to new malware. However, it has been demonstrated that ML models can be fooled or may have generalization problems on malware that has never been seen. We investigate the possible benefits of quantum algorithms for classification tasks. We implement two models of Quantum Machine Learning algorithms, and we compare them to classical models for the classification of a dataset composed of malicious and benign executable files. We try to optimize our algorithms based on methods found in the literature, and analyze our results in an exploratory way, to identify the most interesting directions to explore for the future.
翻译:在恶意软件检测的背景下,机器学习(ML)被广泛用于对新型恶意软件进行泛化。然而,已有研究表明,机器学习模型可能被欺骗,或在针对未见过的恶意软件时出现泛化问题。我们探索了量子算法在分类任务中的潜在优势。我们实现了两种量子机器学习算法模型,并将其与经典模型在包含恶意与良性可执行文件的数据集分类任务上进行了比较。我们尝试基于文献中的方法优化算法,并以探索性方式分析结果,以确定未来最值得研究的方向。