Target specific peptide design using latent space approximate trajectory collector - 专知论文

会员服务 ·

0

近似 · 潜在 · 设计 · Learning · Extensibility ·

2023 年 2 月 2 日

Target specific peptide design using latent space approximate trajectory collector

翻译：靶向特定肽设计的潜在空间近似轨迹收集器

Tong Lin,Sijie Chen,Ruchira Basu,Dehu Pei,Xiaolin Cheng,Levent Burak Kara

Despite the prevalence and many successes of deep learning applications in de novo molecular design, the problem of peptide generation targeting specific proteins remains unsolved. A main barrier for this is the scarcity of the high-quality training data. To tackle the issue, we propose a novel machine learning based peptide design architecture, called Latent Space Approximate Trajectory Collector (LSATC). It consists of a series of samplers on an optimization trajectory on a highly non-convex energy landscape that approximates the distributions of peptides with desired properties in a latent space. The process involves little human intervention and can be implemented in an end-to-end manner. We demonstrate the model by the design of peptide extensions targeting Beta-catenin, a key nuclear effector protein involved in canonical Wnt signalling. When compared with a random sampler, LSATC can sample peptides with $36\%$ lower binding scores in a $16$ times smaller interquartile range (IQR) and $284\%$ less hydrophobicity with a $1.4$ times smaller IQR. LSATC also largely outperforms other common generative models. Finally, we utilized a clustering algorithm to select 4 peptides from the 100 LSATC designed peptides for experimental validation. The result confirms that all the four peptides extended by LSATC show improved Beta-catenin binding by at least $20.0\%$, and two of the peptides show a $3$ fold increase in binding affinity as compared to the base peptide.

翻译：尽管深度学习在从头分子设计中广泛应用并取得了诸多成功，但针对特定蛋白质的肽生成问题仍未解决。其主要障碍在于缺乏高质量的训练数据。为解决这一问题，我们提出了一种基于机器学习的新型肽设计架构，称为潜在空间近似轨迹收集器（LSATC）。该架构由一系列在高度非凸能量景观的优化轨迹上的采样器组成，这些采样器近似于在潜在空间中具有目标特性的肽的分布。该过程几乎不需要人工干预，并且可以以端到端的方式实现。我们通过设计靶向β-连环蛋白（一种参与经典Wnt信号通路的关键核效应蛋白）的肽延伸来展示该模型。与随机采样器相比，LSATC可以采样到结合分数降低36%、四分位距（IQR）缩小16倍、疏水性减少284%且IQR缩小1.4倍的肽。LSATC的性能也大幅优于其他常见生成模型。最后，我们利用聚类算法从LSATC设计的100个肽中选出4个进行实验验证。结果证实，LSATC延伸的所有四种肽对β-连环蛋白的结合能力均提升至少20.0%，其中两种肽与基础肽相比，结合亲和力提高了3倍。

0

相关内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

专知会员服务

35+阅读 · 2019年11月30日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

80+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

玉米ZmAGO18b特异性结合的小RNA及生物学功能研究

国家自然科学基金

0+阅读 · 2015年12月31日

补肾方调节MAVS介导的信号通路发挥抗炎、抗病毒作用机制及其主体疗效中药组

国家自然科学基金

0+阅读 · 2014年12月31日

处理大活性空间的从头算量子化学新方法

国家自然科学基金

0+阅读 · 2012年12月31日

肺组织内结核分枝杆菌抗原多肽疫苗保护效果的研究

国家自然科学基金

0+阅读 · 2012年12月31日

HER2抗体介导的pH敏感光交联聚合物胶束递药系统的构建及抗肿瘤研究

国家自然科学基金

0+阅读 · 2012年12月31日

糖酵解在APC-Cdh1调控缺血后星形胶质细胞反应性增殖中的作用及机制

国家自然科学基金

0+阅读 · 2011年12月31日

调控家蚕发育非编码RNA（non-coding RNA, ncRNA）的功能解析

国家自然科学基金

0+阅读 · 2011年12月31日

结核特异性记忆CD4+T细胞表观遗传调控的研究

国家自然科学基金

0+阅读 · 2011年12月31日

AmKn自组装抗菌肽的抗肿瘤机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

用于生长因子类药物生殖发育毒性评价的小鼠胚胎干细胞特异性分子标记物的筛选

国家自然科学基金

0+阅读 · 2009年12月31日

Learning to Zoom and Unzoom

Arxiv

0+阅读 · 2023年3月27日

Stability and Robustness of Distributed Suboptimal Model Predictive Control

Arxiv

0+阅读 · 2023年3月27日

Consistent and fast inference in compartmental models of epidemics using Poisson Approximate Likelihoods

Arxiv

0+阅读 · 2023年3月27日

Finite Strain Topology Optimization with Nonlinear Stability Constraints

Arxiv

0+阅读 · 2023年3月27日

Trajectory Optimization on Matrix Lie Groups with Differential Dynamic Programming and Nonlinear Constraints

Arxiv

0+阅读 · 2023年3月24日

Functional Regression Models with Functional Response: New Approaches and a Comparative Study

Arxiv

0+阅读 · 2023年3月24日

Adaptive Endpointing with Deep Contextual Multi-armed Bandits

Adaptive Endpointing with Deep Contextual Multi-armed Bandits

Arxiv

0+阅读 · 2023年3月23日

6D Object Pose Estimation from Approximate 3D Models for Orbital Robotics

Arxiv

1+阅读 · 2023年3月23日

Contrastive Clustering

Arxiv

31+阅读 · 2020年9月21日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Arxiv

11+阅读 · 2018年2月10日

VIP会员

文章信息

相关主题

最新内容

ICML 2026 | 自回归Boltzmann生成器重塑分子采样

ICML 2026 | 自回归Boltzmann生成器重塑分子采样

专知会员服务

2+阅读 · 6月26日

GNN跨域综述：从消息传递到图基础模型

GNN跨域综述：从消息传递到图基础模型

专知会员服务

0+阅读 · 6月26日

无人机自主控制与人工智能：系统性综述

无人机自主控制与人工智能：系统性综述

专知会员服务

11+阅读 · 6月26日

巡飞弹与反无人机系统——现代战场的两大支柱

巡飞弹与反无人机系统——现代战场的两大支柱

专知会员服务

4+阅读 · 6月26日

《打造“黄金舰队”》57页报告

《打造“黄金舰队”》57页报告

专知会员服务

3+阅读 · 6月26日

《北约数字教官网络发展路径》128页报告

《北约数字教官网络发展路径》128页报告

专知会员服务

2+阅读 · 6月26日

ECCV 2026 | MIMFlow：MIM与归一化流统一图像生成

ECCV 2026 | MIMFlow：MIM与归一化流统一图像生成

专知会员服务

7+阅读 · 6月25日

超越自回归边界：扩散模型、世界模型与SSM如何重塑代码智能

超越自回归边界：扩散模型、世界模型与SSM如何重塑代码智能

专知会员服务

6+阅读 · 6月25日

重塑决策优势：美军作战艺术与多域作战中联盟联合全域指挥控制（CJADC2）体系的融合

重塑决策优势：美军作战艺术与多域作战中联盟联合全域指挥控制（CJADC2）体系的融合

专知会员服务

10+阅读 · 6月25日

网状网络及其在军事领域的运用

网状网络及其在军事领域的运用

专知会员服务

8+阅读 · 6月25日

《意识即战场——全球安全体系中认知战的演进：乌克兰构建认知作战体系的展望》

《意识即战场——全球安全体系中认知战的演进：乌克兰构建认知作战体系的展望》

专知会员服务

8+阅读 · 6月25日

无美国参与的欧洲战争方式（万字长文）

无美国参与的欧洲战争方式（万字长文）

专知会员服务

8+阅读 · 6月25日

重构“下一场战争”的制胜理论：超越兰彻斯特方程与现代系统

重构“下一场战争”的制胜理论：超越兰彻斯特方程与现代系统

专知会员服务

10+阅读 · 6月25日

《国防工业中基于模型定义的实施：产品定义数字化转型的战略路径》90页

《国防工业中基于模型定义的实施：产品定义数字化转型的战略路径》90页

专知会员服务

9+阅读 · 6月25日

《国防领域敏感性分析白皮书》

《国防领域敏感性分析白皮书》

专知会员服务

9+阅读 · 6月25日

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

专知会员服务

35+阅读 · 2019年11月30日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

80+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

GNN跨域综述：从消息传递到图基础模型

巡飞弹与反无人机系统——现代战场的两大支柱

ICML 2026 | 自回归Boltzmann生成器重塑分子采样

无人机自主控制与人工智能：系统性综述

相关资讯

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Learning to Zoom and Unzoom

Arxiv

0+阅读 · 2023年3月27日

Stability and Robustness of Distributed Suboptimal Model Predictive Control

Arxiv

0+阅读 · 2023年3月27日

Consistent and fast inference in compartmental models of epidemics using Poisson Approximate Likelihoods

Arxiv

0+阅读 · 2023年3月27日

Finite Strain Topology Optimization with Nonlinear Stability Constraints

Arxiv

0+阅读 · 2023年3月27日

Trajectory Optimization on Matrix Lie Groups with Differential Dynamic Programming and Nonlinear Constraints

Arxiv

0+阅读 · 2023年3月24日

Functional Regression Models with Functional Response: New Approaches and a Comparative Study

Arxiv

0+阅读 · 2023年3月24日

Adaptive Endpointing with Deep Contextual Multi-armed Bandits

Adaptive Endpointing with Deep Contextual Multi-armed Bandits

Arxiv

0+阅读 · 2023年3月23日

6D Object Pose Estimation from Approximate 3D Models for Orbital Robotics

Arxiv

1+阅读 · 2023年3月23日

Contrastive Clustering

Arxiv

31+阅读 · 2020年9月21日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Arxiv

11+阅读 · 2018年2月10日

相关基金

玉米ZmAGO18b特异性结合的小RNA及生物学功能研究

国家自然科学基金

0+阅读 · 2015年12月31日

补肾方调节MAVS介导的信号通路发挥抗炎、抗病毒作用机制及其主体疗效中药组

国家自然科学基金

0+阅读 · 2014年12月31日

处理大活性空间的从头算量子化学新方法

国家自然科学基金

0+阅读 · 2012年12月31日

肺组织内结核分枝杆菌抗原多肽疫苗保护效果的研究

国家自然科学基金

0+阅读 · 2012年12月31日

HER2抗体介导的pH敏感光交联聚合物胶束递药系统的构建及抗肿瘤研究

国家自然科学基金

0+阅读 · 2012年12月31日

糖酵解在APC-Cdh1调控缺血后星形胶质细胞反应性增殖中的作用及机制

国家自然科学基金

0+阅读 · 2011年12月31日

调控家蚕发育非编码RNA（non-coding RNA, ncRNA）的功能解析

国家自然科学基金

0+阅读 · 2011年12月31日

结核特异性记忆CD4+T细胞表观遗传调控的研究

国家自然科学基金

0+阅读 · 2011年12月31日

AmKn自组装抗菌肽的抗肿瘤机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

用于生长因子类药物生殖发育毒性评价的小鼠胚胎干细胞特异性分子标记物的筛选

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员