Intrinsically semi-disordered state and its role in induced folding and protein aggregation - PubMed
Intrinsically semi-disordered state and its role in induced folding and protein aggregation
Tuo Zhang et al. Cell Biochem Biophys. 2013.
Abstract
Intrinsically disordered proteins (IDPs) refer to those proteins without fixed three-dimensional structures under physiological conditions. Although experiments suggest that the conformations of IDPs can vary from random coils, semi-compact globules, to compact globules with different contents of secondary structures, computational efforts to separate IDPs into different states are not yet successful. Recently, we developed a neural-network-based disorder prediction technique SPINE-D that was ranked as one of the top performing techniques for disorder prediction in the biannual meeting of critical assessment of structure prediction techniques (CASP 9, 2010). Here, we further analyze the results from SPINE-D prediction by defining a semi-disordered state that has about 50% predicted probability to be disordered or ordered. This semi-disordered state is partially collapsed with intermediate levels of predicted solvent accessibility and secondary structure content. The relative difference in compositions between semi-disordered and fully disordered regions is highly correlated with amyloid aggregation propensity (a correlation coefficient of 0.86 if excluding four charged residues and proline, 0.73 if not). In addition, we observed that some semi-disordered regions participate in induced folding, and others play key roles in protein aggregation. More specifically, a semi-disordered region is amyloidogenic in fully unstructured proteins (such as alpha-synuclein and Sup35) but prone to local unfolding that exposes the hydrophobic core to aggregation in structured globular proteins (such as SOD1 and lysozyme). A transition from full disorder to semi-disorder at about 30-40 Qs is observed in the poly-Q (poly-glutamine) tract of huntingtin. The accuracy of using semi-disorder to predict binding-induced folding and aggregation is compared with several methods trained for the purpose. These results indicate the usefulness of three-state classification (order, semi-disorder, and full-disorder) in distinguishing nonfolding from induced-folding and aggregation-resistant from aggregation-prone IDPs and in locating weakly stable, locally unfolding, and potentially aggregation regions in structured proteins. A comparison with five representative disorder-prediction methods showed that SPINE-D is the only method with a clear separation of semi-disorder from ordered and fully disordered states.
Figures
![Fig. 1](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c298/3838602/3fdfb0bc0b81/12013_2013_9638_Fig1_HTML.gif)
The distribution of disorder probability predicted by SPINE-D at residue level before (a) and after scaling (b) and at long segment level (>30 amino acid residues) (c) for three datasets (DX4080, SL477, and Control703). The insert in (a) shows the fine detail around the disorder probability of 0.06. The negative control set (stable monomeric proteins) does not have a peak for fully disordered residues or regions, indicating the usefulness of separating semi-disorder from full disorder
![Fig. 2](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c298/3838602/e942d25fd125/12013_2013_9638_Fig2_HTML.gif)
Ordered (green), semi-disordered (blue), and fully disordered (red) regions in term of fraction of exposed residues (x-axis) and fraction of residues with secondary structures (y-axis) based on SPINE-D results of the DX4080 dataset. A residue is defined as exposed if its predicted solvent accessibility is greater than 25 %. Secondary structures and solvent accessibility are predicted by SPINE-X and Real-SPINE 3, respectively (Color figure online)
![Fig. 3](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c298/3838602/0b6f17599606/12013_2013_9638_Fig3_HTML.gif)
(a) The Gibbs triangle diagram of the fractions of residues in three states (ordered, semi-disordered, fully disordered residues) for all proteins in the three datasets as labeled. Each protein is a point and its position is determined by three fractions of residues. (b) Disorder probability profiles with zero ordered residues (f o = 0) for the chain A of the PDB ID 2qt4 (2qt4A) in the control set, for DP00179 (chain B in PDB ID 1DPJ) in SL477, and for chain C of PDB ID 2prg (2prgC) in DX4080. The semi-disordered regions correspond to structured regions (horizontal lines) stabilized by disulfide bonds (2qt4A), by binding-induced folding (DP00179 and 2prgC). Only one structured region of 2prgC bound with its target is visible in this figure. The gray area indicates the region defined as semi-disordered
![Fig. 4](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c298/3838602/78ff4ea4664e/12013_2013_9638_Fig4_HTML.gif)
Structured regions (blue bar) by induced folding of disordered proteins are compared with their semi-disordered regions (probability profile within the gray region) in eight additional proteins with predicted f o = 0 in the DX4080 dataset (PDB IDs as labeled). Only one structured region (1ytvN) corresponds to a fully disordered region at the N-terminal end of chain N of 1ytv. But it is semi-disordered after removing the terminal effect (dashed line). The N-terminal region of chain G of 1mey (consensus zinc finger) does not have coordinates but the same region in identical chains C and F does. Thus, the whole chain G made of mostly the semi-disordered state can be labeled as structured from residue 1 to 85 after binding with DNA in a trimeric form (Color figure online)
![Fig. 5](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c298/3838602/57104da4fcb6/12013_2013_9638_Fig5_HTML.gif)
Transition of the polyglutamine tract of huntingtin from a fully disordered to a partially semi-disordered state. Fraction of glutamines (Qs) in a semi-disordered state (fQ in red) and the average disorder probability (P, in blue) in the poly-Q region as a function of the number of glutamines in the poly-Q tract of huntingtin (Color figure online)
![Fig. 6](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c298/3838602/da688d5c0747/12013_2013_9638_Fig6_HTML.gif)
Semi-disordered state in unstructured (alpha-synuclein and Sup35) and structured proteins (SOD1 and human lysozyme). Predicted disordered probability profiles (P in red) compared with compaction ratios for three different regions at normal pH from combined NMR experiments and replica exchange molecular dynamics simulations of alpha-synuclein (in blue) (a), the measured Cys accessibility profile (scaled by the largest accessibility of 82.2 %, in blue) of yeast Sup 35 (b), root mean squared distance (RMSD) from native by molecular dynamics simulations of SOD1 (c), and the unstructured regions in a partially unfolded state detected by H/D exchange (blue) and the fibril core region from proteolysis (orange) (d). In (c), open regions in blue line correspond locally unfolded regions of SOD1. RMSD values are rescaled and shifted to facilitate comparison. The gray bar indicates the region defined as semi-disordered in disorder probability (0.4 ≤ P ≤ 0.7) (Color figure online)
![Fig. 7](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c298/3838602/f4106c7b6240/12013_2013_9638_Fig7_HTML.gif)
The disorder probability profile of acylphosphate from hyperthermophilic archaeon Sulfolobus solfataricus (Sso AcP) predicted from SPINE-D (red line). The semi-disordered residues from 1 to 12 at the N-terminal after removing terminal effect agree with the unstructured region for the first 12 residues from the NMR experiment (PDB #1Y9O) (blue) (Color figure online)
![Fig. 8](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c298/3838602/697be14808ca/12013_2013_9638_Fig8_HTML.gif)
Strong positive correlation between amyloid aggregation propensity at pH 7 and relative difference in compositions of amino acid residue types between semi-disordered and fully disordered regions [(Crsd−Crfd)/Crfd, green squares] or between ordered and semi-disordered regions [(Cro−Crsd)/Crsd, blue circles] generated from the DX4080 dataset. Cro,Crsd, and Crfdare compositions of amino acid residues for ordered, semi-disordered, and fully disordered states, respectively. Above and below zero of [(Crsd−Crfd)/Crfd] or [(Cro−Crsd)/Crsd] indicates enrichment or depletion relative to fully disordered regions or semi-disordered regions, respectively
![Fig. 9](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c298/3838602/f605dac34267/12013_2013_9638_Fig9_HTML.gif)
(a) Distributions of predicted disorder probabilities for five on-line servers (Disopred2, IU-long, MD, MFDp, and Dispro) in addition to SPINE-D as labeled. (b) Disorder probability of yeast Sup 35 predicted by the above five methods in addition to SPINE-D as labeled. No methods except SPINE-D (in red) separated a collapsed N-terminal region and an extended C-terminal region in agreement with the experimental Cys accessibility data (in black) (Color figure online)
Similar articles
-
Vitali M, Rigamonti V, Natalello A, Colzani B, Avvakumova S, Brocca S, Santambrogio C, Narkiewicz J, Legname G, Colombo M, Prosperi D, Grandori R. Vitali M, et al. Biochim Biophys Acta Gen Subj. 2018 Jul;1862(7):1556-1564. doi: 10.1016/j.bbagen.2018.03.026. Epub 2018 Apr 2. Biochim Biophys Acta Gen Subj. 2018. PMID: 29621630
-
Intrinsic Disorder and Semi-disorder Prediction by SPINE-D.
Zhang T, Faraggi E, Li Z, Zhou Y. Zhang T, et al. Methods Mol Biol. 2017;1484:159-174. doi: 10.1007/978-1-4939-6406-2_12. Methods Mol Biol. 2017. PMID: 27787826
-
Single-molecule fluorescence studies of intrinsically disordered proteins.
Ferreon AC, Moran CR, Gambin Y, Deniz AA. Ferreon AC, et al. Methods Enzymol. 2010;472:179-204. doi: 10.1016/S0076-6879(10)72010-3. Methods Enzymol. 2010. PMID: 20580965
-
Intrinsic disorder in proteins associated with neurodegenerative diseases.
Uversky VN. Uversky VN. Front Biosci (Landmark Ed). 2009 Jun 1;14(14):5188-238. doi: 10.2741/3594. Front Biosci (Landmark Ed). 2009. PMID: 19482612 Review.
-
Juanes-Gusano D, Santos M, Reboto V, Alonso M, Rodríguez-Cabello JC. Juanes-Gusano D, et al. J Pept Sci. 2022 Jan;28(1):e3362. doi: 10.1002/psc.3362. Epub 2021 Sep 20. J Pept Sci. 2022. PMID: 34545666 Review.
Cited by
-
Andrich K, Bieschke J. Andrich K, et al. Adv Exp Med Biol. 2015;863:139-61. doi: 10.1007/978-3-319-18365-7_7. Adv Exp Med Biol. 2015. PMID: 26092630 Free PMC article.
-
Digested disorder, Quarterly intrinsic disorder digest (October-November-December, 2013).
DeForte S, Reddy KD, Uversky VN. DeForte S, et al. Intrinsically Disord Proteins. 2015 Mar 9;3(1):e984569. doi: 10.4161/21690707.2014.984569. eCollection 2015. Intrinsically Disord Proteins. 2015. PMID: 28293487 Free PMC article.
-
Coskuner-Weber O, Mirzanli O, Uversky VN. Coskuner-Weber O, et al. Biophys Rev. 2022 Jun 8;14(3):679-707. doi: 10.1007/s12551-022-00968-0. eCollection 2022 Jun. Biophys Rev. 2022. PMID: 35791387 Free PMC article. Review.
-
SPOT-Disorder2: Improved Protein Intrinsic Disorder Prediction by Ensembled Deep Learning.
Hanson J, Paliwal KK, Litfin T, Zhou Y. Hanson J, et al. Genomics Proteomics Bioinformatics. 2019 Dec;17(6):645-656. doi: 10.1016/j.gpb.2019.01.004. Epub 2020 Mar 13. Genomics Proteomics Bioinformatics. 2019. PMID: 32173600 Free PMC article.
-
Misfolding of amyloidogenic proteins and their interactions with membranes.
Relini A, Marano N, Gliozzi A. Relini A, et al. Biomolecules. 2013 Dec 27;4(1):20-55. doi: 10.3390/biom4010020. Biomolecules. 2013. PMID: 24970204 Free PMC article. Review.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Miscellaneous