pubmed.ncbi.nlm.nih.gov

The 3D profile method for identifying fibril-forming segments of proteins - PubMed

  • ️Sun Jan 01 2006

The 3D profile method for identifying fibril-forming segments of proteins

Michael J Thompson et al. Proc Natl Acad Sci U S A. 2006.

Abstract

Based on the crystal structure of the cross-beta spine formed by the peptide NNQQNY, we have developed a computational approach for identifying those segments of amyloidogenic proteins that themselves can form amyloid-like fibrils. The approach builds on experiments showing that hexapeptides are sufficient for forming amyloid-like fibrils. Each six-residue peptide of a protein of interest is mapped onto an ensemble of templates, or 3D profile, generated from the crystal structure of the peptide NNQQNY by small displacements of one of the two intermeshed beta-sheets relative to the other. The energy of each mapping of a sequence to the profile is evaluated by using ROSETTADESIGN, and the lowest energy match for a given peptide to the template library is taken as the putative prediction. If the energy of the putative prediction is lower than a threshold value, a prediction of fibril formation is made. This method can reach an accuracy of approximately 80% with a P value of approximately 10(-12) when a conservative energy threshold is used to separate peptides that form fibrils from those that do not. We see enrichment for positive predictions in a set of fibril-forming segments of amyloid proteins, and we illustrate the method with applications to proteins of interest in amyloid research.

PubMed Disclaimer

Conflict of interest statement

Conflict of interest statement: No conflicts declared.

Figures

Fig. 1.
Fig. 1.

Schematic representation of the 3D profile method with

rosettadesign

energy for detecting fibril-forming segments. From the crystal structure of the NNQQNY peptide (Upper Left), a set of near-native templates is generated by translations of one of the two β-sheets relative to the other, along three orthogonal directions, as shown (Center). A sequence of interest (Lower Left) is scanned by sliding a window of six residues and mapping each peptide onto the templates in the ensemble. Each mapping of sequence to template is evaluated energetically with

rosettadesign

. Finally, a putative prediction is made by taking the best-scoring (lowest energy) fit between peptide and template (Right). The putative prediction is accepted as a prediction if its energy is lower than the threshold energy.

Fig. 2.
Fig. 2.

Prediction performance of the 3D profile method with

rosettadesign

energy for predicting fibril-forming sequences from the AmylHex database, shown as receiver-operator characteristic curves. The percentage of correct predictions is shown as a function of percentage of wrong predictions, as the energy threshold is raised from a very low value (good energetic fit) to a very high (poor energetic fit). The curve with circles shows the fit of hexapeptides to the NNQQNY crystal structure; the curve with squares shows the fit by using the entire near-native ensemble (variations of the NNQQNY structure). The diagonal line shows how a random predictor would perform. The curve plotted with triangles (read off the right y axis) traces the probability that the results at each point on the curve plotted with squares could have been obtained by chance. The two minima of the probability curve are indicated by the black and gray lines.

Fig. 3.
Fig. 3.

Enrichment of fibril-forming sequences predicted in the AmylFrag database of sequences compared with a reference set of sequences. The curves plotted with squares and circles show the fractions of predictions with energies below the threshold energy on the x axis obtained for the AmylFrag and control sets of peptides, respectively. The triangle curve (read off the right y axis) gives the ratio of these two fractions, Fa for AmylFrag and Fb for background, as a measure of enrichment. Energy thresholds <−19.5 do not provide enrichment, whereas an energy threshold of −25.5 provides substantial enrichment.

Fig. 4.
Fig. 4.

Application of the 3D profile method with

rosettadesign

energy for detecting fibril-forming segments of proteins known to form fibrils. (a) Lysozyme. The vertical bars represent the lowest energy template matches for each hexapeptide, with the black bars indicating a known fibril-forming segment of the protein (21). Notice that the hexamer predicted to have the lowest energy is within the known fibril-forming segment. Gray and black horizontal lines indicate the permissive and conservative thresholds, respectively, taken from Fig. 1. (b) Myoglobin. The gray vertical bars represent the lowest energy template matches for segments of the protein where there are no experimental data. The black bars indicate the segment of the protein that forms fibrils in isolation (23), and these include the segments with the lowest energies. Gray and black horizontal lines indicate the permissive and conservative thresholds, respectively. (c) Abeta(1–42). The vertical bars represent the lowest energy template matches for each hexapeptide with those colored gray representing residue positions for which there is no experimental data. Black and gray horizontal lines indicate the permissive and conservative thresholds, respectively. Black vertical bars indicate those segments of the peptide for which there is experimental evidence of fibril formation or ordered β-structure. The experimental evidence for fibril formation of various segments is shown above the plot with the hatched boxes representing NMR/EPR (6, 27, 28), the cross-hatched box representing proline-scanning mutagenesis (29), and the solid lines representing positive fibrillization assays (26, 30). Notice that the lowest energy segment is the C-terminal segment known to be important for fibril formation. (d) Tau. The vertical bars represent the lowest energy predictions for each hexapeptide. Gray bars represent positions where there is no experimental data. Black bars indicate the segment of the protein (PHF43) implicated in aggregation, including the position of the known amyloidogenic peptide VQIVYK (4). Notice that this segment is one of the lowest energy segments. Gray and black horizontal lines indicate the permissive and conservative thresholds, respectively.

Similar articles

Cited by

References

    1. Guijarro J. I., Sunde M., Jones J. A., Campbell I. D., Dobson C. M. Proc. Natl. Acad. Sci. USA. 1998;95:4224–4228. - PMC - PubMed
    1. Chiti F., Webster P., Taddei N., Clark A., Stefani M., Ramponi G., Dobson C. M. Proc. Natl. Acad. Sci. USA. 1999;96:3590–3594. - PMC - PubMed
    1. Fandrich M., Fletcher M. A., Dobson C. M. Nature. 2001;410:165–166. - PubMed
    1. von Bergen M., Friedhoff P., Biernat J., Heberle J., Mandelkow E. M., Mandelkow E. Proc. Natl. Acad. Sci. USA. 2000;97:5129–5134. - PMC - PubMed
    1. Tenidis K., Waldner M., Bernhagen J., Fischle W., Bergmann M., Weber M., Merkle M. L., Voelter W., Brunner H., Kapurniotu A. J. Mol. Biol. 2000;295:1055–1071. - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources