通过斯托卡迭热性目标增强改进分子设计 (Improving Molecular Design by Stochastic Iterative Target Augmentation) - 专知论文

会员服务 ·

0

生成模型 · MoDELS · 预测器/决策函数 · SimPLe · 重赋权 ·

2021 年 8 月 15 日

Improving Molecular Design by Stochastic Iterative Target Augmentation

翻译：通过斯托卡迭热性目标增强改进分子设计

Kevin Yang,Wengong Jin,Kyle Swanson,Regina Barzilay,Tommi Jaakkola

from arxiv, ICML 2020

Generative models in molecular design tend to be richly parameterized, data-hungry neural models, as they must create complex structured objects as outputs. Estimating such models from data may be challenging due to the lack of sufficient training data. In this paper, we propose a surprisingly effective self-training approach for iteratively creating additional molecular targets. We first pre-train the generative model together with a simple property predictor. The property predictor is then used as a likelihood model for filtering candidate structures from the generative model. Additional targets are iteratively produced and used in the course of stochastic EM iterations to maximize the log-likelihood that the candidate structures are accepted. A simple rejection (re-weighting) sampler suffices to draw posterior samples since the generative model is already reasonable after pre-training. We demonstrate significant gains over strong baselines for both unconditional and conditional molecular design. In particular, our approach outperforms the previous state-of-the-art in conditional molecular design by over 10% in absolute gain. Finally, we show that our approach is useful in other domains as well, such as program synthesis.

翻译：分子设计中的生成模型往往具有丰富的参数、数据饥饿的神经模型,因为它们必须创造出复杂的结构物体作为产出。从数据中估算这些模型可能由于缺乏足够的培训数据而具有挑战性。在本文件中,我们建议为迭代创建更多的分子目标采取惊人有效的自我培训方法。我们首先先将基因模型与简单的属性预测器一起进行基因测试。然后,财产预测器作为从基因模型中过滤候选结构的可能模型使用。在随机电离层过程中,还反复生成和使用更多的目标,以最大限度地扩大候选结构被接受的对日志相似性。一个简单的拒绝(重新加权)取样器足以绘制外表样,因为基因模型在培训前已经很合理。我们在无条件和有条件的分子设计方面都展示了强大的基线所取得的巨大收益。特别是,我们的方法超越了在有条件分子设计中的先前状态,通过10%以上获得绝对收益。最后,我们表明我们的方法在其它领域也是有用的,例如合成程序。

0

相关内容

生成模型

在机器学习中，生成模型可以用来直接对数据建模（例如根据某个变量的概率密度函数进行数据采样），也可以用来建立变量间的条件概率分布。条件概率分布可以由生成模型根据贝叶斯定理形成。

人工智能的理论及实践知识图谱，160页pdf

人工智能的理论及实践知识图谱，160页pdf

专知会员服务

104+阅读 · 2021年6月30日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

111+阅读 · 2020年6月10日

【论文推荐】Stochastic Graph Neural Networks，随机图神经网络

【论文推荐】Stochastic Graph Neural Networks，随机图神经网络

专知会员服务

69+阅读 · 2020年6月6日

【CVPR2020-台大】透视眼：学会透过障碍物看东西，Learning to See Through Obstructions

【CVPR2020-台大】透视眼：学会透过障碍物看东西，Learning to See Through Obstructions

专知会员服务

27+阅读 · 2020年4月3日

【MIT】图神经网络的泛化与表示极限，《Generalization and Representational Limits of Graph Neural Networks》

【MIT】图神经网络的泛化与表示极限，《Generalization and Representational Limits of Graph Neural Networks》

专知会员服务

46+阅读 · 2020年2月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Training Deep Networks from Zero to Hero: avoiding pitfalls and going beyond

Arxiv

0+阅读 · 2021年10月13日

Expectigrad: Fast Stochastic Optimization with Robust Convergence Properties

Arxiv

0+阅读 · 2021年10月12日

Adapting Stepsizes by Momentumized Gradients Improves Optimization and Generalization

Arxiv

0+阅读 · 2021年10月10日

IH-GAN: A Conditional Generative Model for Implicit Surface-Based Inverse Design of Cellular Structures

Arxiv

0+阅读 · 2021年10月10日

IMF: Iterative Max-Flow for Node Localizability Detection in Barycentric Linear Localization

Arxiv

0+阅读 · 2021年10月8日

Stochastic Iterative Graph Matching

Arxiv

6+阅读 · 2021年6月4日

NSCaching: Simple and Efficient Negative Sampling for Knowledge Graph Embedding

NSCaching: Simple and Efficient Negative Sampling for Knowledge Graph Embedding

Arxiv

7+阅读 · 2019年1月18日

Aleatoric uncertainty estimation with test-time augmentation for medical image segmentation with convolutional neural networks

Aleatoric uncertainty estimation with test-time augmentation for medical image segmentation with convolutional neural networks

Arxiv

7+阅读 · 2018年7月20日

Test-time augmentation with uncertainty estimation for deep learning-based medical image segmentation

Test-time augmentation with uncertainty estimation for deep learning-based medical image segmentation

Arxiv

4+阅读 · 2018年7月19日

Adaptive strategy for superpixel-based region-growing image segmentation

Arxiv

4+阅读 · 2018年3月17日

VIP会员

文章信息

相关主题

预测器/决策函数

相关VIP内容

人工智能的理论及实践知识图谱，160页pdf

人工智能的理论及实践知识图谱，160页pdf

专知会员服务

104+阅读 · 2021年6月30日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

111+阅读 · 2020年6月10日

【论文推荐】Stochastic Graph Neural Networks，随机图神经网络

【论文推荐】Stochastic Graph Neural Networks，随机图神经网络

专知会员服务

69+阅读 · 2020年6月6日

【CVPR2020-台大】透视眼：学会透过障碍物看东西，Learning to See Through Obstructions

【CVPR2020-台大】透视眼：学会透过障碍物看东西，Learning to See Through Obstructions

专知会员服务

27+阅读 · 2020年4月3日

【MIT】图神经网络的泛化与表示极限，《Generalization and Representational Limits of Graph Neural Networks》

【MIT】图神经网络的泛化与表示极限，《Generalization and Representational Limits of Graph Neural Networks》

专知会员服务

46+阅读 · 2020年2月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

智能体记忆深度剖析：评价指标与系统局限性的分类体系及实证分析

《可信人工智能赋能系统的支柱》

【CMU博士论文】可靠轨迹预测的分层基石：数据、评估与方法

人工智能赋能边缘与自主系统：美陆军现代化进程聚焦威胁探测与战术边缘情报

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Training Deep Networks from Zero to Hero: avoiding pitfalls and going beyond

Arxiv

0+阅读 · 2021年10月13日

Expectigrad: Fast Stochastic Optimization with Robust Convergence Properties

Arxiv

0+阅读 · 2021年10月12日

Adapting Stepsizes by Momentumized Gradients Improves Optimization and Generalization

Arxiv

0+阅读 · 2021年10月10日

IH-GAN: A Conditional Generative Model for Implicit Surface-Based Inverse Design of Cellular Structures

Arxiv

0+阅读 · 2021年10月10日

IMF: Iterative Max-Flow for Node Localizability Detection in Barycentric Linear Localization

Arxiv

0+阅读 · 2021年10月8日

Stochastic Iterative Graph Matching

Arxiv

6+阅读 · 2021年6月4日

NSCaching: Simple and Efficient Negative Sampling for Knowledge Graph Embedding

NSCaching: Simple and Efficient Negative Sampling for Knowledge Graph Embedding

Arxiv

7+阅读 · 2019年1月18日

Aleatoric uncertainty estimation with test-time augmentation for medical image segmentation with convolutional neural networks

Aleatoric uncertainty estimation with test-time augmentation for medical image segmentation with convolutional neural networks

Arxiv

7+阅读 · 2018年7月20日

Test-time augmentation with uncertainty estimation for deep learning-based medical image segmentation

Test-time augmentation with uncertainty estimation for deep learning-based medical image segmentation

Arxiv

4+阅读 · 2018年7月19日

Adaptive strategy for superpixel-based region-growing image segmentation

Arxiv

4+阅读 · 2018年3月17日

微信扫码咨询专知VIP会员