Protein identification with N and C-terminal sequence tags in proteome projects - PubMed
- ️Thu Jan 01 1998
. 1998 May 8;278(3):599-608.
doi: 10.1006/jmbi.1998.1726.
E Gasteiger, L Tonella, K Ou, M Tyler, J C Sanchez, A A Gooley, B J Walsh, A Bairoch, R D Appel, K L Williams, D F Hochstrasser
Affiliations
- PMID: 9600841
- DOI: 10.1006/jmbi.1998.1726
Protein identification with N and C-terminal sequence tags in proteome projects
M R Wilkins et al. J Mol Biol. 1998.
Abstract
Genome sequences are available for increasing numbers of organisms. The proteomes (protein complement expressed by the genome) of many such organisms are being studied with two-dimensional (2D) gel electrophoresis. Here we have investigated the application of short N-terminal and C-terminal sequence tags to the identification of proteins separated on 2D gels. The theoretical N and C termini of 15, 519 proteins, representing all SWISS-PROT entries for the organisms Mycoplasma genitalium, Bacillus subtilis, Escherichia coli, Saccharomyces cerevisiae and human, were analysed. Sequence tags were found to be surprisingly specific, with N-terminal tags of four amino acid residues found to be unique for between 43% and 83% of proteins, and C-terminal tags of four amino acid residues unique for between 74% and 97% of proteins, depending on the species studied. Sequence tags of five amino acid residues were found to be even more specific. To utilise this specificity of sequence tags for protein identification, we created a world-wide web-accessible protein identification program, TagIdent (http://www.expasy.ch/www/tools.html), which matches sequence tags of up to six amino acid residues as well as estimated protein pI and mass against proteins in the SWISS-PROT database. We demonstrate the utility of this identification approach with sequence tags generated from 91 different E. coli proteins purified by 2D gel electrophoresis. Fifty-one proteins were unambiguously identified by virtue of their sequence tags and estimated pI and mass, and a further 11 proteins identified when sequence tags were combined with protein amino acid composition data. We conlcude that the TagIdent identification approach is best suited to the identification of proteins from prokaryotes whose complete genome sequences are available. The approach is less well suited to proteins from eukaryotes, as many eukaryotic proteins are not amenable to sequencing via Edman degradation, and tag protein identification cannot be unambiguous unless an organism's complete sequence is available.
Copyright 1998 Academic Press Limited.
Similar articles
-
Wilkins MR, Gasteiger E, Wheeler CH, Lindskog I, Sanchez JC, Bairoch A, Appel RD, Dunn MJ, Hochstrasser DF. Wilkins MR, et al. Electrophoresis. 1998 Dec;19(18):3199-206. doi: 10.1002/elps.1150191824. Electrophoresis. 1998. PMID: 9932815
-
Rapid protein identification using N-terminal "sequence tag" and amino acid analysis.
Wilkins MR, Ou K, Appel RD, Sanchez JC, Yan JX, Golaz O, Farnsworth V, Cartier P, Hochstrasser DF, Williams KL, Gooley AA. Wilkins MR, et al. Biochem Biophys Res Commun. 1996 Apr 25;221(3):609-13. doi: 10.1006/bbrc.1996.0643. Biochem Biophys Res Commun. 1996. PMID: 8630008
-
'98 Escherichia coli SWISS-2DPAGE database update.
Tonella L, Walsh BJ, Sanchez JC, Ou K, Wilkins MR, Tyler M, Frutiger S, Gooley AA, Pescaru I, Appel RD, Yan JX, Bairoch A, Hoogland C, Morch FS, Hughes GJ, Williams KL, Hochstrasser DF. Tonella L, et al. Electrophoresis. 1998 Aug;19(11):1960-71. doi: 10.1002/elps.1150191114. Electrophoresis. 1998. PMID: 9740056
-
Comprehensive mass spectrometric analysis of the 20S proteasome complex.
Huang L, Burlingame AL. Huang L, et al. Methods Enzymol. 2005;405:187-236. doi: 10.1016/S0076-6879(05)05009-3. Methods Enzymol. 2005. PMID: 16413316 Review.
-
Link AJ, Robison K, Church GM. Link AJ, et al. Electrophoresis. 1997 Aug;18(8):1259-313. doi: 10.1002/elps.1150180807. Electrophoresis. 1997. PMID: 9298646 Review.
Cited by
-
Proteome evaluation of human cystic echinococcosis sera using two dimensional gel electrophoresis.
Sadjjadi FS, Rezaie-Tavirani M, Ahmadi NA, Sadjjadi SM, Zali H. Sadjjadi FS, et al. Gastroenterol Hepatol Bed Bench. 2018 Winter;11(1):75-82. Gastroenterol Hepatol Bed Bench. 2018. PMID: 29564069 Free PMC article.
-
Novel network biomarkers profile based coronary artery disease risk stratification in Asian Indians.
Vangala RK, Ravindran V, Kamath K, Rao VS, Sridhara H. Vangala RK, et al. Adv Biomed Res. 2013 Jul 30;2:59. doi: 10.4103/2277-9175.115805. eCollection 2013. Adv Biomed Res. 2013. PMID: 24223374 Free PMC article.
-
Kikuchi J, Furukawa Y, Hayashi N. Kikuchi J, et al. Mol Biotechnol. 2003 Mar;23(3):203-12. doi: 10.1385/MB:23:3:203. Mol Biotechnol. 2003. PMID: 12665691
-
Leng J, Wang H, Zhang L, Zhang J, Wang H, Cai T, Yao J, Guo Y. Leng J, et al. J Am Soc Mass Spectrom. 2011 Jul;22(7):1204-13. doi: 10.1007/s13361-011-0129-5. Epub 2011 Apr 15. J Am Soc Mass Spectrom. 2011. PMID: 21953103
-
Dai Y, Shortreed MR, Scalf M, Frey BL, Cesnik AJ, Solntsev S, Schaffer LV, Smith LM. Dai Y, et al. J Proteome Res. 2017 Nov 3;16(11):4156-4165. doi: 10.1021/acs.jproteome.7b00516. J Proteome Res. 2017. PMID: 28968100 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials
Miscellaneous