A unified conformational selection and induced fit approach to protein-peptide docking - PubMed
A unified conformational selection and induced fit approach to protein-peptide docking
Mikael Trellet et al. PLoS One. 2013.
Abstract
Protein-peptide interactions are vital for the cell. They mediate, inhibit or serve as structural components in nearly 40% of all macromolecular interactions, and are often associated with diseases, making them interesting leads for protein drug design. In recent years, large-scale technologies have enabled exhaustive studies on the peptide recognition preferences for a number of peptide-binding domain families. Yet, the paucity of data regarding their molecular binding mechanisms together with their inherent flexibility makes the structural prediction of protein-peptide interactions very challenging. This leaves flexible docking as one of the few amenable computational techniques to model these complexes. We present here an ensemble, flexible protein-peptide docking protocol that combines conformational selection and induced fit mechanisms. Starting from an ensemble of three peptide conformations (extended, a-helix, polyproline-II), flexible docking with HADDOCK generates 79.4% of high quality models for bound/unbound and 69.4% for unbound/unbound docking when tested against the largest protein-peptide complexes benchmark dataset available to date. Conformational selection at the rigid-body docking stage successfully recovers the most relevant conformation for a given protein-peptide complex and the subsequent flexible refinement further improves the interface by up to 4.5 Å interface RMSD. Cluster-based scoring of the models results in a selection of near-native solutions in the top three for ∼75% of the successfully predicted cases. This unified conformational selection and induced fit approach to protein-peptide docking should open the route to the modeling of challenging systems such as disorder-order transitions taking place upon binding, significantly expanding the applicability limit of biomolecular interaction modeling by docking.
Conflict of interest statement
Competing Interests: The authors have declared that no competing interests exist.
Figures

(A) Distribution of positional backbone RMSDs between the bound form of the peptides present in the benchmark and an ideal extended conformation. These are classified into three categories (easy, medium and difficult) based on the amplitude of the conformational change upon binding. (B) Percentage of solvent accessible residues computed for all peptides in the crystal structures of the respective protein-peptide complexes.

The percentages of near-native and sub-angstrom resolution models at the various stages (rigid-body (it0), semi-flexible (it1) and water refinement (water)) are reported in the left panels and were calculated over the 400 final models generated by HADDOCK. The right panels show the percentages after water refinement as a function of the docking difficulty.

The top 400 models are selected from the 6000 models generated based on their HADDOCK score. (A) Selection details for the 19 helical peptide cases. (B) Fractions of extended cases (41) with a predominant selection (i.e. majority of the selected conformations) coming from either extended, helical or polyproline II peptides. (C) Fraction of other cases (37) with a predominant selection coming from either extended, helical or polyproline II peptides.

The percentages of near-native and sub-angstrom resolution models (see Methods) at the various stages (rigid-body (it0), semi-flexible (it1) and water refinement (water)) are reported in the left panels and were calculated over the 400 final models generated by HADDOCK. The right panels show the percentages after water refinement as a function of the docking difficulty.

A docking is defined as successful it at least one near-native model is present within the topXX selected models.

A cluster is considered near-native if one of its top four member is of near-native quality or better.

The PDB-id as well as difficulty, peptide length, rank and i-RMSD values are indicated for each case. The model selected for illustration is the acceptable model with the best rank at the end of the HADDOCK process. The model peptide is shown in purple together with the reference peptide in the crystal structure of the complex in black. Docking model and crystal structure were superimposed on backbone atoms of the protein. The protein (crystal structure) is shown in surface representation. (A) 1NX1, (B) 1CZY, (C) 1LVM and (D) 1D4T. Figure generated with PyMol .

The distributions are calculated from all generated models of the unbound/unbound docking benchmark. A negative i-RMSD difference value reflects an improvement (move toward the bound form) while a positive value indicates a deterioration of this i-RMSD. For Fnat this is reverse: a positive difference indicates an improvement. The impact of flexible refinement in torsion angle space (differences between rigid-body docking and flexible refinement (it1–it0)) is shown in A) i-RMSD diff and C) Fnat diff, and the impact of final water refinement (differences between flexible and water refinement (water-it1) is shown in B) i-RMSD diff and D) Fnat diff.

In this analysis, a docking run is defined as successful if at least one near-native model (for the selected cutoff) is generated within the pool of 400 water-refined models. Results are presented for both bound/unbound (97, black) and unbound/unbound (62, gray) cases.

Percentage of cases with sub-angstrom and near-native models quality assessed by ligand-interface RMSD. The results are given for the whole bound/unbound benchmark and the ‘non-helical’ subset, as reported by FlexPepDock.
Similar articles
-
Hierarchical Flexible Peptide Docking by Conformer Generation and Ensemble Docking of Peptides.
Zhou P, Li B, Yan Y, Jin B, Wang L, Huang SY. Zhou P, et al. J Chem Inf Model. 2018 Jun 25;58(6):1292-1302. doi: 10.1021/acs.jcim.8b00142. Epub 2018 May 21. J Chem Inf Model. 2018. PMID: 29738247
-
Information-Driven, Ensemble Flexible Peptide Docking Using HADDOCK.
Geng C, Narasimhan S, Rodrigues JP, Bonvin AM. Geng C, et al. Methods Mol Biol. 2017;1561:109-138. doi: 10.1007/978-1-4939-6798-8_8. Methods Mol Biol. 2017. PMID: 28236236
-
Kurkcuoglu Z, Bonvin AMJJ. Kurkcuoglu Z, et al. Proteins. 2020 Feb;88(2):292-306. doi: 10.1002/prot.25802. Epub 2019 Sep 3. Proteins. 2020. PMID: 31441121 Free PMC article.
-
Peptide docking and structure-based characterization of peptide binding: from knowledge to know-how.
London N, Raveh B, Schueler-Furman O. London N, et al. Curr Opin Struct Biol. 2013 Dec;23(6):894-902. doi: 10.1016/j.sbi.2013.07.006. Epub 2013 Oct 15. Curr Opin Struct Biol. 2013. PMID: 24138780 Review.
-
Understanding the challenges of protein flexibility in drug design.
Antunes DA, Devaurs D, Kavraki LE. Antunes DA, et al. Expert Opin Drug Discov. 2015 Dec;10(12):1301-13. doi: 10.1517/17460441.2015.1094458. Epub 2015 Sep 28. Expert Opin Drug Discov. 2015. PMID: 26414598 Review.
Cited by
-
Fully Blind Docking at the Atomic Level for Protein-Peptide Complex Structure Prediction.
Yan C, Xu X, Zou X. Yan C, et al. Structure. 2016 Oct 4;24(10):1842-1853. doi: 10.1016/j.str.2016.07.021. Epub 2016 Sep 15. Structure. 2016. PMID: 27642160 Free PMC article.
-
Lee H, Heo L, Lee MS, Seok C. Lee H, et al. Nucleic Acids Res. 2015 Jul 1;43(W1):W431-5. doi: 10.1093/nar/gkv495. Epub 2015 May 12. Nucleic Acids Res. 2015. PMID: 25969449 Free PMC article.
-
Modelling peptide-protein complexes: docking, simulations and machine learning.
Mondal A, Chang L, Perez A. Mondal A, et al. QRB Discov. 2022 Sep 19;3:e17. doi: 10.1017/qrd.2022.14. eCollection 2022. QRB Discov. 2022. PMID: 37529282 Free PMC article.
-
Bayarsaikhan B, Zsidó BZ, Börzsei R, Hetényi C. Bayarsaikhan B, et al. Int J Mol Sci. 2024 May 29;25(11):5945. doi: 10.3390/ijms25115945. Int J Mol Sci. 2024. PMID: 38892133 Free PMC article.
-
Nuclear export receptor CRM1 recognizes diverse conformations in nuclear export signals.
Fung HY, Fu SC, Chook YM. Fung HY, et al. Elife. 2017 Mar 10;6:e23961. doi: 10.7554/eLife.23961. Elife. 2017. PMID: 28282025 Free PMC article.
References
-
- Petsalaki E, Russell RB (2008) Peptide-mediated interactions in biological systems: new discoveries and applications. Curr Opin Biotechnol 19: 344–350. - PubMed
-
- Bordner AJ, Abagyan R (2006) Ab initio prediction of peptide-MHC binding geometry for diverse class I MHC allotypes. Proteins 63: 512–526. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
This work was supported by the Netherlands Organization for Scientific Research (www.nwo.nl) (VICI grant 700.56.442 to A.M.J.J.B.) and the WeNMR project (European FP7 e-Infrastructure grant, contract no. 261572, www.wenmr.eu). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
LinkOut - more resources
Full Text Sources
Other Literature Sources