Improved amino acid flexibility parameters - PubMed
Improved amino acid flexibility parameters
David K Smith et al. Protein Sci. 2003 May.
Abstract
Protein molecules exhibit varying degrees of flexibility throughout their three-dimensional structures, with some segments showing little mobility while others may be so disordered as to be unresolvable by techniques such as X-ray crystallography. Atomic displacement parameters, or B-factors, from X-ray crystallographic studies give an experimentally determined indication of the degree of mobility in a protein structure. To provide better estimators of amino acid flexibility, we have examined B-factors from a large set of high-resolution crystal structures. Because of the differences among structures, it is necessary to normalize the B-factors. However, many proteins have segments of unusually high mobility, which must be accounted for before normalization can be performed. Accordingly, a median-based method from quality control studies was used to identify outliers. After removal of outliers from, and normalization of, each protein chain, the B-factors were collected for each amino acid in the set. It was found that the distribution of normalized B-factors followed a Gumbel, or extreme value distribution, and the location parameter, or mode, of this distribution was used as an estimator of flexibility for the amino acid. These new parameters have a higher correlation with experimentally determined B-factors than parameters from earlier methods.
Figures
![Figure 1.](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16ac/2323876/ce4cc25f50ee/61237-22f1_F1TT.gif)
Raw and normalized B-factors for 1FNA (Dickinson et al. 1994). (A) The experimentally observed B-factors for 1FNA with the Z = 3.0 (upper) and M = 3.5 (lower) cut-off lines indicated. (B) Normalized B-factors after outliers at Z ≥ 3.0 removed (gray circles), and after outliers at M ≥ 3.5 removed (triangles). The lack of variation in the normalized B-factors can be seen if outliers are not removed properly.
![Figure 2.](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16ac/2323876/8b41d103e6be/61237-22f2_L1TT.gif)
Plots of the normalized B-factors for each amino acid, counted in bins of 0.2 normalized units, with the Y axis giving the percentage of the B-factors in each bin. The best fit to the Gumbel distribution and the parameters of the fit (λ–location, δ–scale) for each amino acid are shown. Axes scales are the same in all plots.
![Figure 3.](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16ac/2323876/3e68b6bd5fe0/61237-22f3_L1TT.gif)
The average autocorrelation function, over lags, or residue separations of 0 to 20, of the B-factors in a chain for the entire dataset. For residue separations of 1 to 7, the coefficients were 0.78, 0.61, 0.46, 0.34, 0.23, 0.15, and 0.09, respectively.
![Figure 4.](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16ac/2323876/da5fbe83e469/61237-22f4_L4TT.gif)
The distribution of the B-factors, as observed (black); as modeled by a mixture of three Gaussian functions (green; maximum-likelihood estimates, w = [0.04, 0.27, 0.69]; μ = [5.31, 1.29, −0.45]; σ = [4.19, 0.98, 0.58]); and as modeled by two Gumbel distributions, one with λ = −0.52; δ = 0.93 (red, maximum-likelihood estimate) and another with λ = −0.52; δ = 0.78 (blue, least-squares fit). The observed distribution was smoothed and the area under the curve was normalized to 1.
![Figure 5.](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16ac/2323876/8c572ce4e565/61237-22f5_L4TT.gif)
Comparison of the parameters developed here (G) with those of Vihinen et al. (1994; V) and those of Karplus and Schulz (1985; KS). The parameters by each method are shown for two flexible neighbors (red circles), one flexible and one rigid neighbor (green squares), two rigid neighbors (blue triangles), and for the complete data set (magenta diamonds).
![Figure 6.](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16ac/2323876/145ec3c77d99/61237-22f6_L1TT.gif)
Means, over the 290 (solid lines) and 196 (dashed lines) protein chain sets, of the correlation coefficients of the flexibility parameters, in sliding windows of lengths 1 to 13, with the experimental B-factors. (Circles) The parameters developed here, (triangles) those of Vihinen et al. (1994), and (squares) those of Karplus and Schulz (1985).
Similar articles
-
Eyal E, Chennubhotla C, Yang LW, Bahar I. Eyal E, et al. Bioinformatics. 2007 Jul 1;23(13):i175-84. doi: 10.1093/bioinformatics/btm186. Bioinformatics. 2007. PMID: 17646294
-
Protein thermal stability: insights from atomic displacement parameters (B values).
Parthasarathy S, Murthy MR. Parthasarathy S, et al. Protein Eng. 2000 Jan;13(1):9-13. doi: 10.1093/protein/13.1.9. Protein Eng. 2000. PMID: 10679524
-
Schrauber H, Eisenhaber F, Argos P. Schrauber H, et al. J Mol Biol. 1993 Mar 20;230(2):592-612. doi: 10.1006/jmbi.1993.1172. J Mol Biol. 1993. PMID: 8464066
-
Atomic displacement parameters in structural biology.
Carugo O. Carugo O. Amino Acids. 2018 Jul;50(7):775-786. doi: 10.1007/s00726-018-2574-y. Epub 2018 May 11. Amino Acids. 2018. PMID: 29752562 Review.
Cited by
-
Bavi N, Cortes DM, Cox CD, Rohde PR, Liu W, Deitmer JW, Bavi O, Strop P, Hill AP, Rees D, Corry B, Perozo E, Martinac B. Bavi N, et al. Nat Commun. 2016 Jun 22;7:11984. doi: 10.1038/ncomms11984. Nat Commun. 2016. PMID: 27329693 Free PMC article.
-
Exploring Flexibility and Folding Patterns Throughout Time in Voltage Sensors.
García-Morales A, Balleza D. García-Morales A, et al. J Mol Evol. 2023 Dec;91(6):819-836. doi: 10.1007/s00239-023-10140-1. Epub 2023 Nov 13. J Mol Evol. 2023. PMID: 37955698
-
Library of disordered patterns in 3D protein structures.
Lobanov MY, Furletova EI, Bogatyreva NS, Roytberg MA, Galzitskaya OV. Lobanov MY, et al. PLoS Comput Biol. 2010 Oct 14;6(10):e1000958. doi: 10.1371/journal.pcbi.1000958. PLoS Comput Biol. 2010. PMID: 20976197 Free PMC article.
-
Balleza D, Carrillo E, Gómez-Lagunas F. Balleza D, et al. Pflugers Arch. 2015 Oct;467(10):2069-79. doi: 10.1007/s00424-014-1647-3. Epub 2014 Nov 15. Pflugers Arch. 2015. PMID: 25398373
-
Pang YP. Pang YP. Heliyon. 2016 Sep 20;2(9):e00161. doi: 10.1016/j.heliyon.2016.e00161. eCollection 2016 Sep. Heliyon. 2016. PMID: 27699282 Free PMC article.
References
-
- Altman, R.B., Hughes, C., Zhao, D., and Jardetsky, O. 1994. Compositional characteristics of relatively disordered regions in proteins. Prot. Pept. Lett. 1 120–127.
-
- Altschul, S.F. and Erickson, B.W. 1988. Significance levels for biological sequence comparison using non-linear similarity functions. Bull. Math. Biol. 50 77–92. - PubMed
-
- Altschul, S.F., Gish, W., Miller, W., Myers, E.W., and Lipman, D.J. 1990. Basic local alignment search tool. J. Mol. Biol. 215 403–410. - PubMed
-
- Box, G.E.P., Jenkins, G.M., and Reinsel, G.C. 1994. Time series analysis. Forecasting and control, 3rd ed. Prentice Hall, Englewood Cliffs, NJ.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources