FABind+: Enhancing Molecular Docking through Improved Pocket Prediction and Pose Generation
Abstract: Molecular docking is a pivotal process in drug discovery. While traditional techniques rely on extensive sampling and simulation governed by physical principles, these methods are often slow and costly. The advent of deep learning-based approaches has shown significant promise, offering increases in both accuracy and efficiency. Building upon the foundational work of FABind, a model designed with a focus on speed and accuracy, we present FABind+, an enhanced iteration that largely boosts the performance of its predecessor. We identify pocket prediction as a critical bottleneck in molecular docking and propose a novel methodology that significantly refines pocket prediction, thereby streamlining the docking process. Furthermore, we introduce modifications to the docking module to enhance its pose generation capabilities. In an effort to bridge the gap with conventional sampling/generative methods, we incorporate a simple yet effective sampling technique coupled with a confidence model, requiring only minor adjustments to the regression framework of FABind. Experimental results and analysis reveal that FABind+ remarkably outperforms the original FABind, achieves competitive state-of-the-art performance, and delivers insightful modeling strategies. This demonstrates FABind+ represents a substantial step forward in molecular docking and drug discovery. Our code is in https://github.com/QizhiPei/FABind.
- The impact of large language models on scientific discovery: a preliminary study using gpt-4. arXiv preprint arXiv:2311.07361, 2023.
- Scheduled sampling for sequence prediction with recurrent neural networks. Advances in neural information processing systems, 28, 2015.
- Virtual ligand screening against escherichia coli dihydrofolate reductase: improving docking enrichment using physics-based methods. SLAS Discovery, 10(7):675–681, 2005.
- Generative chemistry: drug discovery with deep learning generative models. Journal of Molecular Modeling, 27:1–18, 2021.
- Mapping of protein binding sites using clustering algorithms-development of a pharmacophore based drug discovery tool. Journal of Molecular Graphics and Modelling, 115:108228, 2022.
- Rcsb protein data bank: powerful new tools for exploring 3d structures of biological macromolecules for basic and applied research and education in fundamental biology, biomedicine, biotechnology, bioengineering and energy sciences. Nucleic acids research, 49(D1):D437–D451, 2021.
- Predicting protein ligand binding sites by combining evolutionary sequence conservation and 3d structure. PLoS computational biology, 5(12):e1000585, 2009.
- Sequence-based prediction of protein interaction sites with an integrative method. Bioinformatics, 25(5):585–591, 2009.
- Molecular design in drug discovery: a comprehensive review of deep generative models. Briefings in bioinformatics, 22(6):bbab344, 2021.
- A (sub) graph isomorphism algorithm for matching large graphs. IEEE transactions on pattern analysis and machine intelligence, 26(10):1367–1372, 2004.
- The discovery of binding modes requires rethinking docking generalization. In NeurIPS 2023 Generative AI and Biology (GenBio) Workshop, 2023a.
- Diffdock: Diffusion steps, twists, and turns for molecular docking. In International Conference on Learning Representations (ICLR 2023), 2023b.
- Performance and structural coverage of the latest, in-development alphafold model. 2023.
- Glide: a new approach for rapid, accurate docking and scoring. 1. method and assessment of docking accuracy. Journal of medicinal chemistry, 47(7):1739–1749, 2004.
- Deep docking: a deep learning platform for augmentation of structure based drug discovery. ACS central science, 6(6):939–949, 2020.
- Diffdock-site: A novel paradigm for enhanced protein-ligand predictions through binding site identification. In NeurIPS 2023 Generative AI and Biology (GenBio) Workshop, 2023.
- Physics-based methods for studying protein-ligand interactions. Current Opinion in Drug Discovery and Development, 10(3):325, 2007.
- Molecular mechanics methods for predicting protein–ligand binding. Physical Chemistry Chemical Physics, 8(44):5166–5177, 2006.
- Huber, P. J. Robust estimation of a location parameter. In Breakthroughs in statistics: Methodology and distribution, pp. 492–518. Springer, 1992.
- Deepsite: protein-binding site predictor using 3d-convolutional neural networks. Bioinformatics, 33(19):3036–3042, 2017.
- Development and validation of a genetic algorithm for flexible docking. Journal of molecular biology, 267(3):727–748, 1997.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
- Lessons learned in empirical scoring with smina from the csar 2011 benchmarking exercise. Journal of chemical information and modeling, 53(8):1893–1904, 2013.
- P2rank: machine learning based tool for rapid and accurate prediction of ligand binding sites from protein structure. Journal of cheminformatics, 10:1–12, 2018.
- Landrum, G. et al. Rdkit: A software suite for cheminformatics, computational chemistry, and predictive modeling. Greg Landrum, 2013.
- Laskowski, R. A. Surfnet: a program for visualizing molecular surfaces, cavities, and intermolecular interactions. Journal of molecular graphics, 13(5):323–330, 1995.
- Fpocket: an open source platform for ligand pocket detection. BMC bioinformatics, 10(1):1–11, 2009.
- Pre-training on large-scale generated docking conformations with helixdock to unlock the potential of protein-ligand structure prediction models. arXiv preprint arXiv:2310.13913, 2023.
- Forging the basis for developing protein–ligand interaction scoring functions. Accounts of chemical research, 50(2):302–309, 2017.
- Enhancing scientific discoveries in molecular biology with deep generative models. Molecular systems biology, 16(9):e9198, 2020.
- Tankbind: Trigonometry-aware neural networks for drug-protein binding structure prediction. bioRxiv, 2022.
- Dynamicbind: Predicting ligand-specific protein-ligand complex structure with a deep equivariant generative model. 2023.
- Deep learning model for flexible and efficient protein-ligand docking. In ICLR2022 Machine Learning for Drug Discovery, 2022. URL https://openreview.net/forum?id=WNwsnE81meC.
- Gnina 1.0: molecular docking with deep learning. Journal of cheminformatics, 13(1):1–20, 2021.
- spyrmsd: symmetry-corrected rmsd calculations in python. Journal of Cheminformatics, 12(1):49, 2020.
- Molecular docking. Molecular modeling of proteins, pp. 365–382, 2008.
- Distributed automated docking of flexible ligands to proteins: parallel applications of autodock 2.4. Journal of computer-aided molecular design, 10(4):293–304, 1996.
- End-to-end protein–ligand complex structure generation with diffusion-based generative models. BMC bioinformatics, 24(1):1–18, 2023.
- Training language models to follow instructions with human feedback. Advances in Neural Information Processing Systems, 35:27730–27744, 2022.
- Software for molecular docking: a review. Biophysical reviews, 9:91–102, 2017.
- Fabind: Fast and accurate protein-ligand binding. In Thirty-seventh Conference on Neural Information Processing Systems, 2023.
- Advances in free-energy-based simulations of protein folding and ligand binding. Current opinion in structural biology, 36:25–31, 2016.
- Deepdrug3d: classification of ligand-binding pockets in proteins with a convolutional neural network. PLoS computational biology, 15(2):e1006718, 2019.
- Protein–ligand scoring with convolutional neural networks. Journal of chemical information and modeling, 57(4):942–957, 2017.
- Better informed distance geometry: using what we know to improve conformation generation. Journal of chemical information and modeling, 55(12):2562–2574, 2015.
- Matching chemistry and shape in molecular docking. Protein Engineering, Design and Selection, 6(7):723–732, 1993.
- Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research, 15(1):1929–1958, 2014.
- Protein binding pocket dynamics. Accounts of chemical research, 49(5):809–815, 2016.
- Equibind: Geometric deep learning for drug binding structure prediction. In International Conference on Machine Learning, pp. 20503–20521. PMLR, 2022.
- Sequence-based prediction of protein–peptide binding sites using support vector machine. Journal of computational chemistry, 37(13):1223–1229, 2016.
- Moldock: a new technique for high-accuracy molecular docking. Journal of medicinal chemistry, 49(11):3315–3321, 2006.
- Autodock vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. Journal of computational chemistry, 31(2):455–461, 2010.
- Improved protein–ligand docking using gold. Proteins: Structure, Function, and Bioinformatics, 52(4):609–623, 2003.
- Pickpocket: Pocket binding prediction for specific ligands family using neural networks. bioRxiv, pp. 2020–04, 2020.
- Flexidock: Compositional diffusion models for flexible molecular docking. 2023.
- Pocketpicker: analysis of ligand binding-sites with shape descriptors. Chemistry Central Journal, 1(1):1–17, 2007.
- R-drop: Regularized dropout for neural networks. Advances in Neural Information Processing Systems, 34:10890–10905, 2021.
- Multi-scale iterative refinement towards robust and versatile molecular docking. arXiv preprint arXiv:2311.18574, 2023.
- Deepbindrg: a deep learning based method for estimating effective protein–ligand affinity. PeerJ, 7:e7362, 2019.
- Efficient and accurate large library ligand docking with karmadock. Nature Computational Science, 3(9):789–804, 2023a.
- E3bind: An end-to-end equivariant network for protein-ligand docking. arXiv preprint arXiv:2210.06069, 2022.
- Equipocket: an e (3)-equivariant geometric graph neural network for ligand binding site prediction. arXiv preprint arXiv:2302.12177, 2023b.
- Uni-mol: a universal 3d molecular representation learning framework. 2023.
- Direct molecular conformation generation. Trans. Mach. Learn. Res., 2022, 2022. URL https://openreview.net/forum?id=lCPOHiztuw.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.