AutoML4ETC: Automated Neural Architecture Search for Real-World Encrypted Traffic Classification

Deep learning (DL) has been successfully applied to encrypted network traffic classification in experimental settings. However, in production use, it has been shown that a DL classifier's performance inevitably decays over time. Re-training the model on newer datasets has been shown to only partially improve its performance. Manually re-tuning the model architecture to meet the performance expectations on newer datasets is time-consuming and requires domain expertise. We propose AutoML4ETC, a novel tool to automatically design efficient and high-performing neural architectures for encrypted traffic classification. We define a novel, powerful search space tailored specifically for the early classification of encrypted traffic using packet header bytes. We show that with different search strategies over our search space, AutoML4ETC generates neural architectures that outperform the state-of-the-art encrypted traffic classifiers on several datasets, including public benchmark datasets and real-world TLS and QUIC traffic collected from the Orange mobile network. In addition to being more accurate, AutoML4ETC's architectures are significantly more efficient and lighter in terms of the number of parameters. Finally, we make AutoML4ETC publicly available for future research.

翻译：深度学习（DL）在实验场景中已成功应用于加密网络流量分类。然而，在实际生产环境中，DL分类器的性能会不可避免地随时间衰减。研究表明，仅在新数据集上重新训练模型只能部分改善其性能。针对新数据集手动调整模型架构以满足性能预期既耗时又需要领域专业知识。我们提出AutoML4ETC这一新型工具，可自动设计高效且性能优越的神经架构用于加密流量分类。我们针对基于数据包头字节的加密流量早期分类任务，定义了一个新颖且功能强大的搜索空间。实验表明，通过在该搜索空间上采用不同搜索策略，AutoML4ETC生成的神经架构在多个数据集（包括公开基准数据集以及从Orange移动网络采集的真实世界TLS和QUIC流量）上均优于现有最先进的加密流量分类器。除精度更高外，AutoML4ETC生成的架构在参数数量上显著更高效、更轻量。最后，我们将AutoML4ETC公开供未来研究使用。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日