Papers
Topics
Authors
Recent
Search
2000 character limit reached

Accelerating Inference in Molecular Diffusion Models with Latent Representations of Protein Structure

Published 22 Nov 2023 in q-bio.BM and cs.LG | (2311.13466v2)

Abstract: Diffusion generative models have emerged as a powerful framework for addressing problems in structural biology and structure-based drug design. These models operate directly on 3D molecular structures. Due to the unfavorable scaling of graph neural networks (GNNs) with graph size as well as the relatively slow inference speeds inherent to diffusion models, many existing molecular diffusion models rely on coarse-grained representations of protein structure to make training and inference feasible. However, such coarse-grained representations discard essential information for modeling molecular interactions and impair the quality of generated structures. In this work, we present a novel GNN-based architecture for learning latent representations of molecular structure. When trained end-to-end with a diffusion model for de novo ligand design, our model achieves comparable performance to one with an all-atom protein representation while exhibiting a 3-fold reduction in inference time.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (21)
  1. Structure-based Drug Design with Equivariant Diffusion Models, June 2023. URL http://arxiv.org/abs/2210.13695. arXiv:2210.13695 [cs, q-bio].
  2. DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking, February 2023. URL http://arxiv.org/abs/2210.01776. arXiv:2210.01776 [physics, q-bio].
  3. Equivariant 3D-Conditional Diffusion Models for Molecular Linker Design, October 2022. URL http://arxiv.org/abs/2210.05274. arXiv:2210.05274 [cs, q-bio].
  4. DiffHopp: A Graph Diffusion Model for Novel Drug Design via Scaffold Hopping, August 2023. URL http://arxiv.org/abs/2308.07416. arXiv:2308.07416 [q-bio].
  5. AlphaFold2 versus experimental structures: evaluation on G protein-coupled receptors. Acta Pharmacologica Sinica, 44(1):1–7, January 2023. ISSN 1745-7254. doi: 10.1038/s41401-022-00938-y. URL https://www.nature.com/articles/s41401-022-00938-y. Number: 1 Publisher: Nature Publishing Group.
  6. How good are AlphaFold models for docking-based virtual screening? iScience, 26(1):105920, January 2023. ISSN 2589-0042. doi: 10.1016/j.isci.2022.105920.
  7. How accurately can one predict drug binding modes using AlphaFold models? eLife, 12, August 2023. doi: 10.7554/eLife.89386. URL https://elifesciences.org/reviewed-preprints/89386. Publisher: eLife Sciences Publications Limited.
  8. Denoising Diffusion Probabilistic Models, December 2020. URL http://arxiv.org/abs/2006.11239. Number: arXiv:2006.11239 arXiv:2006.11239 [cs, stat].
  9. Variational Diffusion Models, June 2022. URL http://arxiv.org/abs/2107.00630. Number: arXiv:2107.00630 arXiv:2107.00630 [cs, stat].
  10. Equivariant Diffusion for Molecule Generation in 3D. Technical Report arXiv:2203.17003, arXiv, March 2022. URL http://arxiv.org/abs/2203.17003. arXiv:2203.17003 [cs, q-bio, stat] type: article.
  11. Independent SE(3)-Equivariant Models for End-to-End Rigid Protein Docking, March 2022. URL http://arxiv.org/abs/2111.07786. Number: arXiv:2111.07786 arXiv:2111.07786 [cs].
  12. E(n) Equivariant Graph Neural Networks, February 2022. URL http://arxiv.org/abs/2102.09844. Number: arXiv:2102.09844 arXiv:2102.09844 [cs, stat].
  13. Learning from Protein Structure with Geometric Vector Perceptrons, May 2021a. URL http://arxiv.org/abs/2009.01411. arXiv:2009.01411 [cs, q-bio, stat].
  14. Equivariant Graph Neural Networks for 3D Macromolecular Structure, July 2021b. URL http://arxiv.org/abs/2106.03843. arXiv:2106.03843 [cs, q-bio].
  15. On the Expressive Power of Geometric Graph Neural Networks, June 2023. URL http://arxiv.org/abs/2301.09308. arXiv:2301.09308 [cs, math, stat].
  16. Graph Attention Networks, February 2018. URL http://arxiv.org/abs/1710.10903. arXiv:1710.10903 [cs, stat].
  17. POT: Python Optimal Transport. Journal of Machine Learning Research, 22(78):1–8, 2021. ISSN 1533-7928. URL http://jmlr.org/papers/v22/20-451.html.
  18. Binding MOAD (Mother Of All Databases). Proteins: Structure, Function, and Bioinformatics, 60(3):333–340, 2005. ISSN 1097-0134. doi: 10.1002/prot.20512. URL https://onlinelibrary.wiley.com/doi/abs/10.1002/prot.20512. _eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1002/prot.20512.
  19. Announcing the worldwide Protein Data Bank. Nature Structural & Molecular Biology, 10(12):980–980, December 2003. ISSN 1545-9985. doi: 10.1038/nsb1203-980. URL https://www.nature.com/articles/nsb1203-980. Number: 12 Publisher: Nature Publishing Group.
  20. AutoDock Vina 1.2.0: New Docking Methods, Expanded Force Field, and Python Bindings. Journal of Chemical Information and Modeling, 61(8):3891–3898, August 2021. ISSN 1549-9596. doi: 10.1021/acs.jcim.1c00203. URL https://doi.org/10.1021/acs.jcim.1c00203. Publisher: American Chemical Society.
  21. RDKit. URL http://www.rdkit.org/.
Citations (3)

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Authors (2)

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 2 likes about this paper.