Detecting homology of distantly related proteins with consensus sequences - PubMed
- ️Thu Jan 01 1987
Comparative Study
. 1987 Dec 20;198(4):567-77.
doi: 10.1016/0022-2836(87)90200-2.
Affiliations
- PMID: 3430622
- DOI: 10.1016/0022-2836(87)90200-2
Comparative Study
Detecting homology of distantly related proteins with consensus sequences
L Patthy. J Mol Biol. 1987.
Abstract
A simple protocol is described that is suitable for the detection of distantly related members of a protein family. In this procedure, similarity to a consensus sequence is used to distinguish chance similarity from similarity due to common ancestry. The consensus sequence is constructed from the sequences of established members of a protein family and it incorporates features characteristic of the protein fold of this family: conserved residues, the pattern of variable and conserved segments, preferred location of gaps etc. The database is searched with the consensus sequence, using the unitary matrix or log odds matrix for scoring the alignments, with variable gap penalty. The advantage of the method is that it weights key residues, ignores sequence similarity in variable segments (thus partially eliminating "background noise" coming from chance similarity), distinguishes gaps disrupting conserved segments from those occurring in positions known to be tolerant of gap events. The utility of the method was demonstrated in the case of the protein family homologous with the internal repeats of complement B as well as the internal repeats identified in fibroblast proteoglycan PG40. The consensus sequence method succeeded in finding some new members of these protein families that could not be detected by earlier methods of sequence comparison.
Similar articles
-
Prediction of surface loops of protein-folds from multiple alignments of homologous sequences.
Patthy L. Patthy L. Acta Biochim Biophys Hung. 1989;24(1-2):3-13. Acta Biochim Biophys Hung. 1989. PMID: 2481916
-
A symmetric-iterated multiple alignment of protein sequences.
Brocchieri L, Karlin S. Brocchieri L, et al. J Mol Biol. 1998 Feb 13;276(1):249-64. doi: 10.1006/jmbi.1997.1527. J Mol Biol. 1998. PMID: 9514731
-
Local multiple alignment by consensus matrix.
Alexandrov NN. Alexandrov NN. Comput Appl Biosci. 1992 Aug;8(4):339-45. doi: 10.1093/bioinformatics/8.4.339. Comput Appl Biosci. 1992. PMID: 1498689
-
Rapid and sensitive sequence comparison with FASTP and FASTA.
Pearson WR. Pearson WR. Methods Enzymol. 1990;183:63-98. doi: 10.1016/0076-6879(90)83007-v. Methods Enzymol. 1990. PMID: 2156132
-
Kuan J, Saier MH Jr. Kuan J, et al. Crit Rev Biochem Mol Biol. 1993;28(3):209-33. doi: 10.3109/10409239309086795. Crit Rev Biochem Mol Biol. 1993. PMID: 8325039 Review.
Cited by
-
Miguel Llinás and the Structure of the Kringle Fold.
Patthy L. Patthy L. Protein J. 2021 Aug;40(4):450-453. doi: 10.1007/s10930-021-09981-w. Epub 2021 Mar 31. Protein J. 2021. PMID: 33791899 Free PMC article. No abstract available.
-
A large family of bacterial activator proteins.
Henikoff S, Haughn GW, Calvo JM, Wallace JC. Henikoff S, et al. Proc Natl Acad Sci U S A. 1988 Sep;85(18):6602-6. doi: 10.1073/pnas.85.18.6602. Proc Natl Acad Sci U S A. 1988. PMID: 3413113 Free PMC article.
-
Powerful fusion: PSI-BLAST and consensus sequences.
Przybylski D, Rost B. Przybylski D, et al. Bioinformatics. 2008 Sep 15;24(18):1987-93. doi: 10.1093/bioinformatics/btn384. Epub 2008 Aug 4. Bioinformatics. 2008. PMID: 18678588 Free PMC article.
-
A protein alignment scoring system sensitive at all evolutionary distances.
Altschul SF. Altschul SF. J Mol Evol. 1993 Mar;36(3):290-300. doi: 10.1007/BF00160485. J Mol Evol. 1993. PMID: 8483166
-
Ribosome-binding protein p34 is a member of the leucine-rich-repeat-protein superfamily.
Ohsumi T, Ichimura T, Sugano H, Omata S, Isobe T, Kuwano R. Ohsumi T, et al. Biochem J. 1993 Sep 1;294 ( Pt 2)(Pt 2):465-72. doi: 10.1042/bj2940465. Biochem J. 1993. PMID: 7690545 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous