Click-through rate (CTR) prediction is one of the fundamental tasks for online advertising and recommendation. While multi-layer perceptron (MLP) serves as a core component in many deep CTR prediction models, it has been widely recognized that applying a vanilla MLP network alone is inefficient in learning multiplicative feature interactions. As such, many two-stream interaction models (e.g., DeepFM and DCN) have been proposed by integrating an MLP network with another dedicated network for enhanced CTR prediction. As the MLP stream learns feature interactions implicitly, existing research focuses mainly on enhancing explicit feature interactions in the complementary stream. In contrast, our empirical study shows that a well-tuned two-stream MLP model that simply combines two MLPs can even achieve surprisingly good performance, which has never been reported before by existing work. Based on this observation, we further propose feature selection and interaction aggregation layers that can be easily plugged to make an enhanced two-stream MLP model, FinalMLP. In this way, it not only enables differentiated feature inputs but also effectively fuses stream-level interactions across two streams. Our evaluation results on four open benchmark datasets as well as an online A/B test in our industrial system show that FinalMLP achieves better performance than many sophisticated two-stream CTR models. Our source code will be available at MindSpore/models and FuxiCTR/model_zoo.
翻译:点击率(CTR)预测是在线广告与推荐的基础任务之一。虽然多层感知机(MLP)是许多深度CTR预测模型的核心组件,但普遍认为单独使用原始MLP网络在学习乘法特征交互方面效率较低。为此,研究者提出了诸多双流交互模型(如DeepFM和DCN),通过将MLP网络与另一专用网络集成以增强CTR预测。由于MLP流隐式学习特征交互,现有研究主要侧重于增强互补流中的显式特征交互。然而,我们的实证研究表明,一个经过良好调优的、仅简单组合两个MLP的双流MLP模型竟能取得令人惊讶的优异性能——这一现象此前从未被文献报道过。基于此发现,我们进一步提出可即插即用的特征选择与交互聚合层,构建出增强型双流MLP模型FinalMLP。该模型不仅能够实现差异化的特征输入,还能有效融合双流间的流级交互。在四个公开基准数据集及工业系统的在线A/B测试中,FinalMLP的性能优于诸多复杂的双流CTR模型。我们的源代码将在MindSpore/models和FuxiCTR/model_zoo中开放。