Ab initio modeling of small proteins by iterative TASSER simulations - PubMed
- ️Mon Jan 01 2007
Ab initio modeling of small proteins by iterative TASSER simulations
Sitao Wu et al. BMC Biol. 2007.
Abstract
Background: Predicting 3-dimensional protein structures from amino-acid sequences is an important unsolved problem in computational structural biology. The problem becomes relatively easier if close homologous proteins have been solved, as high-resolution models can be built by aligning target sequences to the solved homologous structures. However, for sequences without similar folds in the Protein Data Bank (PDB) library, the models have to be predicted from scratch. Progress in the ab initio structure modeling is slow. The aim of this study was to extend the TASSER (threading/assembly/refinement) method for the ab initio modeling and examine systemically its ability to fold small single-domain proteins.
Results: We developed I-TASSER by iteratively implementing the TASSER method, which is used in the folding test of three benchmarks of small proteins. First, data on 16 small proteins (< 90 residues) were used to generate I-TASSER models, which had an average Calpha-root mean square deviation (RMSD) of 3.8A, with 6 of them having a Calpha-RMSD < 2.5A. The overall result was comparable with the all-atomic ROSETTA simulation, but the central processing unit (CPU) time by I-TASSER was much shorter (150 CPU days vs. 5 CPU hours). Second, data on 20 small proteins (< 120 residues) were used. I-TASSER folded four of them with a Calpha-RMSD < 2.5A. The average Calpha-RMSD of the I-TASSER models was 3.9A, whereas it was 5.9A using TOUCHSTONE-II software. Finally, 20 non-homologous small proteins (< 120 residues) were taken from the PDB library. An average Calpha-RMSD of 3.9A was obtained for the third benchmark, with seven cases having a Calpha-RMSD < 2.5A.
Conclusion: Our simulation results show that I-TASSER can consistently predict the correct folds and sometimes high-resolution models for small single-domain proteins. Compared with other ab initio modeling methods such as ROSETTA and TOUCHSTONE II, the average performance of I-TASSER is either much better or is similar within a lower computational time. These data, together with the significant performance of automated I-TASSER server (the Zhang-Server) in the 'free modeling' section of the recent Critical Assessment of Structure Prediction (CASP)7 experiment, demonstrate new progresses in automated ab initio model generation. The I-TASSER server is freely available for academic users http://zhang.bioinformatics.ku.edu/I-TASSER.
Figures
![Figure 1](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e911/1878469/f9371abe57b6/1741-7007-5-17-1.gif)
Flowchart of I-TASSER method for protein structure prediction.
![Figure 2](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e911/1878469/fcd8c31a3478/1741-7007-5-17-2.gif)
Examples of I-TASSER models from three independent benchmark sets. The green color is for I-TASSER models and blue for the native structures. (A–C) are from benchmark I (Bradley et al [13]); (D–F) are from benchmark II (Zhang et al [12]); and (G–I) are from benchmark III, selected directly from the PDB library. Column 1 contains the high-resolution models with a Cα-RMSD ≤ 1.5Å; column 2 contains the medium-resolution models with a Cα-RMSD of 1.5–5Å; column 3 contains the low-resolution models with a Cα-RMSD > 5Å. The Cα-RMSD value for the examples are: (A) 1ogwA_ (1.1Å), (B) 1di2A_ (2.3Å), (C) 1dcjA_(10.0Å), (D) 1cy5A (1.5Å), (E) 1pgx (3.1Å), (F) 1gnuA (8.2Å), (G) 1cqkA (1.5Å), (H) 1gyvA (3.3Å), (I) 1no5A(10.5Å). The pictures were generated using PyMOL software [45].
![Figure 3](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e911/1878469/e774eccca48c/1741-7007-5-17-3.gif)
Comparison of I-TASSER models with the PPA threading alignment results. (A) Cα-RMSD to native of the I-TASSER models versus Cα-RMSD to native of the best threading alignment over the same aligned regions. (B) TM-score of the I-TASSER models versus TM-score of the best threading alignments.
Similar articles
-
Analysis of TASSER-based CASP7 protein structure prediction results.
Zhou H, Pandit SB, Lee SY, Borreguero J, Chen H, Wroblewska L, Skolnick J. Zhou H, et al. Proteins. 2007;69 Suppl 8:90-7. doi: 10.1002/prot.21649. Proteins. 2007. PMID: 17705276
-
Integration of QUARK and I-TASSER for Ab Initio Protein Structure Prediction in CASP11.
Zhang W, Yang J, He B, Walker SE, Zhang H, Govindarajoo B, Virtanen J, Xue Z, Shen HB, Zhang Y. Zhang W, et al. Proteins. 2016 Sep;84 Suppl 1(Suppl 1):76-86. doi: 10.1002/prot.24930. Epub 2015 Sep 23. Proteins. 2016. PMID: 26370505 Free PMC article.
-
I-TASSER server for protein 3D structure prediction.
Zhang Y. Zhang Y. BMC Bioinformatics. 2008 Jan 23;9:40. doi: 10.1186/1471-2105-9-40. BMC Bioinformatics. 2008. PMID: 18215316 Free PMC article.
-
Zhou X, Zheng W, Li Y, Pearce R, Zhang C, Bell EW, Zhang G, Zhang Y. Zhou X, et al. Nat Protoc. 2022 Oct;17(10):2326-2353. doi: 10.1038/s41596-022-00728-0. Epub 2022 Aug 5. Nat Protoc. 2022. PMID: 35931779 Review.
-
AI-Driven Deep Learning Techniques in Protein Structure Prediction.
Chen L, Li Q, Nasif KFA, Xie Y, Deng B, Niu S, Pouriyeh S, Dai Z, Chen J, Xie CY. Chen L, et al. Int J Mol Sci. 2024 Aug 1;25(15):8426. doi: 10.3390/ijms25158426. Int J Mol Sci. 2024. PMID: 39125995 Free PMC article. Review.
Cited by
-
Raza A, Saeed A, Ibrar A, Muddassar M, Khan AA, Iqbal J. Raza A, et al. ISRN Pharmacol. 2012;2012:707932. doi: 10.5402/2012/707932. Epub 2012 Aug 16. ISRN Pharmacol. 2012. PMID: 22966467 Free PMC article.
-
Toward the solution of the protein structure prediction problem.
Pearce R, Zhang Y. Pearce R, et al. J Biol Chem. 2021 Jul;297(1):100870. doi: 10.1016/j.jbc.2021.100870. Epub 2021 Jun 11. J Biol Chem. 2021. PMID: 34119522 Free PMC article. Review.
-
Fragment-free approach to protein folding using conditional neural fields.
Zhao F, Peng J, Xu J. Zhao F, et al. Bioinformatics. 2010 Jun 15;26(12):i310-7. doi: 10.1093/bioinformatics/btq193. Bioinformatics. 2010. PMID: 20529922 Free PMC article.
-
Patino-Lopez G, Aravind L, Dong X, Kruhlak MJ, Ostap EM, Shaw S. Patino-Lopez G, et al. J Biol Chem. 2010 Mar 19;285(12):8675-86. doi: 10.1074/jbc.M109.086959. Epub 2010 Jan 12. J Biol Chem. 2010. PMID: 20071333 Free PMC article.
-
I-TASSER: a unified platform for automated protein structure and function prediction.
Roy A, Kucukural A, Zhang Y. Roy A, et al. Nat Protoc. 2010 Apr;5(4):725-38. doi: 10.1038/nprot.2010.5. Epub 2010 Mar 25. Nat Protoc. 2010. PMID: 20360767 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources