Bayesian optimization (BO) is a principled approach to molecular design tasks. In this paper we explain three pitfalls of BO which can cause poor empirical performance: an incorrect prior width, over-smoothing, and inadequate acquisition function maximization. We show that with these issues addressed, even a basic BO setup is able to achieve the highest overall performance on the PMO benchmark for molecule design (Gao et al, 2022). These results suggest that BO may benefit from more attention in the machine learning for molecules community.
翻译:贝叶斯优化(BO)是一种基于原理的分子设计方法。本文阐述了导致BO实证性能不佳的三个常见问题:先验宽度设置不当、过度平滑现象以及采集函数最大化不足。研究表明,在解决这些问题后,即使是基础BO框架也能在PMO分子设计基准测试(Gao等人,2022)中取得最佳综合性能。这些结果表明,BO方法值得分子机器学习领域给予更多关注。