Rep3Net: An Approach Exploiting Multimodal Representation for Molecular Bioactivity Prediction

Accurate prediction of compound potency accelerates early-stage drug discovery by prioritizing candidates for experimental testing. However, many Quantitative Structure-Activity Relationship (QSAR) approaches for this prediction are constrained by their choice of molecular representation: handcrafted descriptors capture global properties but miss local topology, graph neural networks encode structure but often lack broader chemical context, and SMILES-based language models provide contextual patterns learned from large corpora but are seldom combined with structural features. To exploit these complementary signals, we introduce Rep3Net, a unified multimodal architecture that fuses RDKit molecular descriptors, graph-derived features from a residual graph-convolutional backbone, and ChemBERTa SMILES embeddings. We evaluate Rep3Net on a curated ChEMBL subset for Human PARP1 using fivefold cross validation. Rep3Net attains an MSE of $0.83\pm0.06$, RMSE of $0.91\pm0.03$, $R^{2}=0.43\pm0.01$, and yields Pearson and Spearman correlations of $0.66\pm0.01$ and $0.67\pm0.01$, respectively, substantially improving over several strong GNN baselines. In addition, Rep3Net achieves a favorable latency-to-parameter trade-off thanks to a single-layer GCN backbone and parallel frozen encoders. Ablations show that graph topology, ChemBERTa semantics, and handcrafted descriptors each contribute complementary information, with full fusion providing the largest error reduction. These results demonstrate that multimodal representation fusion can improve potency prediction for PARP1 and provide a scalable framework for virtual screening in early-stage drug discovery.

翻译：准确预测化合物活性可通过优先选择候选化合物进行实验测试来加速早期药物发现。然而，许多用于此预测的定量构效关系方法受限于其分子表示的选择：手工设计的描述符捕获全局性质但遗漏局部拓扑结构，图神经网络编码结构但常缺乏更广泛的化学背景，而基于SMILES的语言模型提供了从大型语料库中学到的上下文模式，但很少与结构特征相结合。为利用这些互补信号，我们提出了Rep3Net，一种统一的多模态架构，它融合了RDKit分子描述符、来自残差图卷积主干的图衍生特征以及ChemBERTa SMILES嵌入。我们在一个精选的用于人类PARP1的ChEMBL子集上使用五折交叉验证评估了Rep3Net。Rep3Net取得了$0.83\pm0.06$的MSE、$0.91\pm0.03$的RMSE、$R^{2}=0.43\pm0.01$，并分别获得了$0.66\pm0.01$和$0.67\pm0.01$的皮尔逊与斯皮尔曼相关系数，相较于多个强大的图神经网络基线模型有显著提升。此外，得益于单层图卷积网络主干和并行冻结编码器，Rep3Net实现了有利的延迟-参数量权衡。消融研究表明，图拓扑、ChemBERTa语义和手工描述符各自提供互补信息，完全融合能带来最大的误差降低。这些结果表明，多模态表示融合可以改进PARP1的活性预测，并为早期药物发现中的虚拟筛选提供了一个可扩展的框架。