Accelerating Inference in Molecular Diffusion Models with Latent Representations of Protein Structure
Abstract: Diffusion generative models have emerged as a powerful framework for addressing problems in structural biology and structure-based drug design. These models operate directly on 3D molecular structures. Due to the unfavorable scaling of graph neural networks (GNNs) with graph size as well as the relatively slow inference speeds inherent to diffusion models, many existing molecular diffusion models rely on coarse-grained representations of protein structure to make training and inference feasible. However, such coarse-grained representations discard essential information for modeling molecular interactions and impair the quality of generated structures. In this work, we present a novel GNN-based architecture for learning latent representations of molecular structure. When trained end-to-end with a diffusion model for de novo ligand design, our model achieves comparable performance to one with an all-atom protein representation while exhibiting a 3-fold reduction in inference time.
- Structure-based Drug Design with Equivariant Diffusion Models, June 2023. URL http://arxiv.org/abs/2210.13695. arXiv:2210.13695 [cs, q-bio].
- DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking, February 2023. URL http://arxiv.org/abs/2210.01776. arXiv:2210.01776 [physics, q-bio].
- Equivariant 3D-Conditional Diffusion Models for Molecular Linker Design, October 2022. URL http://arxiv.org/abs/2210.05274. arXiv:2210.05274 [cs, q-bio].
- DiffHopp: A Graph Diffusion Model for Novel Drug Design via Scaffold Hopping, August 2023. URL http://arxiv.org/abs/2308.07416. arXiv:2308.07416 [q-bio].
- AlphaFold2 versus experimental structures: evaluation on G protein-coupled receptors. Acta Pharmacologica Sinica, 44(1):1–7, January 2023. ISSN 1745-7254. doi: 10.1038/s41401-022-00938-y. URL https://www.nature.com/articles/s41401-022-00938-y. Number: 1 Publisher: Nature Publishing Group.
- How good are AlphaFold models for docking-based virtual screening? iScience, 26(1):105920, January 2023. ISSN 2589-0042. doi: 10.1016/j.isci.2022.105920.
- How accurately can one predict drug binding modes using AlphaFold models? eLife, 12, August 2023. doi: 10.7554/eLife.89386. URL https://elifesciences.org/reviewed-preprints/89386. Publisher: eLife Sciences Publications Limited.
- Denoising Diffusion Probabilistic Models, December 2020. URL http://arxiv.org/abs/2006.11239. Number: arXiv:2006.11239 arXiv:2006.11239 [cs, stat].
- Variational Diffusion Models, June 2022. URL http://arxiv.org/abs/2107.00630. Number: arXiv:2107.00630 arXiv:2107.00630 [cs, stat].
- Equivariant Diffusion for Molecule Generation in 3D. Technical Report arXiv:2203.17003, arXiv, March 2022. URL http://arxiv.org/abs/2203.17003. arXiv:2203.17003 [cs, q-bio, stat] type: article.
- Independent SE(3)-Equivariant Models for End-to-End Rigid Protein Docking, March 2022. URL http://arxiv.org/abs/2111.07786. Number: arXiv:2111.07786 arXiv:2111.07786 [cs].
- E(n) Equivariant Graph Neural Networks, February 2022. URL http://arxiv.org/abs/2102.09844. Number: arXiv:2102.09844 arXiv:2102.09844 [cs, stat].
- Learning from Protein Structure with Geometric Vector Perceptrons, May 2021a. URL http://arxiv.org/abs/2009.01411. arXiv:2009.01411 [cs, q-bio, stat].
- Equivariant Graph Neural Networks for 3D Macromolecular Structure, July 2021b. URL http://arxiv.org/abs/2106.03843. arXiv:2106.03843 [cs, q-bio].
- On the Expressive Power of Geometric Graph Neural Networks, June 2023. URL http://arxiv.org/abs/2301.09308. arXiv:2301.09308 [cs, math, stat].
- Graph Attention Networks, February 2018. URL http://arxiv.org/abs/1710.10903. arXiv:1710.10903 [cs, stat].
- POT: Python Optimal Transport. Journal of Machine Learning Research, 22(78):1–8, 2021. ISSN 1533-7928. URL http://jmlr.org/papers/v22/20-451.html.
- Binding MOAD (Mother Of All Databases). Proteins: Structure, Function, and Bioinformatics, 60(3):333–340, 2005. ISSN 1097-0134. doi: 10.1002/prot.20512. URL https://onlinelibrary.wiley.com/doi/abs/10.1002/prot.20512. _eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1002/prot.20512.
- Announcing the worldwide Protein Data Bank. Nature Structural & Molecular Biology, 10(12):980–980, December 2003. ISSN 1545-9985. doi: 10.1038/nsb1203-980. URL https://www.nature.com/articles/nsb1203-980. Number: 12 Publisher: Nature Publishing Group.
- AutoDock Vina 1.2.0: New Docking Methods, Expanded Force Field, and Python Bindings. Journal of Chemical Information and Modeling, 61(8):3891–3898, August 2021. ISSN 1549-9596. doi: 10.1021/acs.jcim.1c00203. URL https://doi.org/10.1021/acs.jcim.1c00203. Publisher: American Chemical Society.
- RDKit. URL http://www.rdkit.org/.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.