Prediction of protein secondary structure content for the twilight zone sequences - PubMed
- ️Mon Jan 01 2007
. 2007 Nov 15;69(3):486-98.
doi: 10.1002/prot.21527.
Affiliations
- PMID: 17623861
- DOI: 10.1002/prot.21527
Prediction of protein secondary structure content for the twilight zone sequences
Leila Homaeian et al. Proteins. 2007.
Abstract
Secondary protein structure carries information about local structural arrangements, which include three major conformations: alpha-helices, beta-strands, and coils. Significant majority of successful methods for prediction of the secondary structure is based on multiple sequence alignment. However, multiple alignment fails to provide accurate results when a sequence comes from the twilight zone, that is, it is characterized by low (<30%) homology. To this end, we propose a novel method for prediction of secondary structure content through comprehensive sequence representation, called PSSC-core. The method uses a multiple linear regression model and introduces a comprehensive feature-based sequence representation to predict amount of helices and strands for sequences from the twilight zone. The PSSC-core method was tested and compared with two other state-of-the-art prediction methods on a set of 2187 twilight zone sequences. The results indicate that our method provides better predictions for both helix and strand content. The PSSC-core is shown to provide statistically significantly better results when compared with the competing methods, reducing the prediction error by 5-7% for helix and 7-9% for strand content predictions. The proposed feature-based sequence representation uses a comprehensive set of physicochemical properties that are custom-designed for each of the helix and strand content predictions. It includes composition and composition moment vectors, frequency of tetra-peptides associated with helical and strand conformations, various property-based groups like exchange groups, chemical groups of the side chains and hydrophobic group, auto-correlations based on hydrophobicity, side-chain masses, hydropathy, and conformational patterns for beta-sheets. The PSSC-core method provides an alternative for predicting the secondary structure content that can be used to validate and constrain results of other structure prediction methods. At the same time, it also provides useful insight into design of successful protein sequence representations that can be used in developing new methods related to prediction of different aspects of the secondary protein structure.
(c) 2007 Wiley-Liss, Inc.
Similar articles
-
Ruan J, Wang K, Yang J, Kurgan LA, Cios K. Ruan J, et al. Artif Intell Med. 2005 Sep-Oct;35(1-2):19-35. doi: 10.1016/j.artmed.2005.02.006. Artif Intell Med. 2005. PMID: 16081261
-
Prediction of protein structural class for the twilight zone sequences.
Kurgan L, Chen K. Kurgan L, et al. Biochem Biophys Res Commun. 2007 Jun 1;357(2):453-60. doi: 10.1016/j.bbrc.2007.03.164. Epub 2007 Apr 5. Biochem Biophys Res Commun. 2007. PMID: 17433260
-
Tubulin secondary structure analysis, limited proteolysis sites, and homology to FtsZ.
de Pereda JM, Leynadier D, Evangelio JA, Chacón P, Andreu JM. de Pereda JM, et al. Biochemistry. 1996 Nov 12;35(45):14203-15. doi: 10.1021/bi961357b. Biochemistry. 1996. PMID: 8916905
-
Predicting the conformation of proteins from sequences. Progress and future progress.
Benner SA. Benner SA. J Mol Recognit. 1995 Jan-Apr;8(1-2):9-28. doi: 10.1002/jmr.300080104. J Mol Recognit. 1995. PMID: 7598957 Review.
-
Prediction of protein structure from amino acid sequence.
Sternberg MJ. Sternberg MJ. Anticancer Drug Des. 1986 Nov;1(3):169-78. Anticancer Drug Des. 1986. PMID: 3329910 Review.
Cited by
-
Fold homology detection using sequence fragment composition profiles of proteins.
Solis AD, Rackovsky SR. Solis AD, et al. Proteins. 2010 Oct;78(13):2745-56. doi: 10.1002/prot.22788. Proteins. 2010. PMID: 20635424 Free PMC article.
-
Mizianty MJ, Kurgan L. Mizianty MJ, et al. BMC Bioinformatics. 2009 Dec 13;10:414. doi: 10.1186/1471-2105-10-414. BMC Bioinformatics. 2009. PMID: 20003388 Free PMC article.
-
Wu Z, Basu S, Wu X, Kurgan L. Wu Z, et al. Protein Sci. 2023 Jan;32(1):e4544. doi: 10.1002/pro.4544. Protein Sci. 2023. PMID: 36519304 Free PMC article.
-
Shekhawat U, Roy Chowdhury Chakravarty A. Shekhawat U, et al. J Biol Phys. 2022 Dec;48(4):399-414. doi: 10.1007/s10867-022-09615-x. Epub 2022 Nov 23. J Biol Phys. 2022. PMID: 36422744 Free PMC article.
-
On the relation between the predicted secondary structure and the protein size.
Kurgan L. Kurgan L. Protein J. 2008 Jun;27(4):234-9. doi: 10.1007/s10930-008-9129-0. Protein J. 2008. PMID: 18299971
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources