Prediction of ubiquitination sites by using the composition of k-spaced amino acid pairs - PubMed
Prediction of ubiquitination sites by using the composition of k-spaced amino acid pairs
Zhen Chen et al. PLoS One. 2011.
Abstract
As one of the most important reversible protein post-translation modifications, ubiquitination has been reported to be involved in lots of biological processes and closely implicated with various diseases. To fully decipher the molecular mechanisms of ubiquitination-related biological processes, an initial but crucial step is the recognition of ubiquitylated substrates and the corresponding ubiquitination sites. Here, a new bioinformatics tool named CKSAAP_UbSite was developed to predict ubiquitination sites from protein sequences. With the assistance of Support Vector Machine (SVM), the highlight of CKSAAP_UbSite is to employ the composition of k-spaced amino acid pairs surrounding a query site (i.e. any lysine in a query sequence) as input. When trained and tested in the dataset of yeast ubiquitination sites (Radivojac et al, Proteins, 2010, 78: 365-380), a 100-fold cross-validation on a 1∶1 ratio of positive and negative samples revealed that the accuracy and MCC of CKSAAP_UbSite reached 73.40% and 0.4694, respectively. The proposed CKSAAP_UbSite has also been intensively benchmarked to exhibit better performance than some existing predictors, suggesting that it can be served as a useful tool to the community. Currently, CKSAAP_UbSite is freely accessible at http://protein.cau.edu.cn/cksaap_ubsite/. Moreover, we also found that the sequence patterns around ubiquitination sites are not conserved across different species. To ensure a reasonable prediction performance, the application of the current CKSAAP_UbSite should be limited to the proteome of yeast.
Conflict of interest statement
Competing Interests: The authors have declared that no competing interests exist.
Figures
![Figure 1](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e77/3146527/759a44d9a2dc/pone.0022930.g001.gif)
The performance of CKSAAP_UbSite and the binary encoding scheme was assessed through a 100-fold cross-validation strategy.
![Figure 2](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e77/3146527/206a5e2de799/pone.0022930.g002.gif)
The composition of each residue pair is represented by a radial vector whose length is proportional to the composition concerned.
![Figure 3](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e77/3146527/128b70199546/pone.0022930.g003.gif)
These two logos were prepared using the web server
http://www.twosamplelogo.org/and only residues significantly enriched and depleted surrounding ubiquitination sites (t-test, P<0.05) are shown.
![Figure 4](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e77/3146527/9b1671df72cb/pone.0022930.g004.gif)
Similar articles
-
Chen Z, Zhou Y, Song J, Zhang Z. Chen Z, et al. Biochim Biophys Acta. 2013 Aug;1834(8):1461-7. doi: 10.1016/j.bbapap.2013.04.006. Epub 2013 Apr 19. Biochim Biophys Acta. 2013. PMID: 23603789
-
Chen YZ, Tang YR, Sheng ZY, Zhang Z. Chen YZ, et al. BMC Bioinformatics. 2008 Feb 18;9:101. doi: 10.1186/1471-2105-9-101. BMC Bioinformatics. 2008. PMID: 18282281 Free PMC article.
-
Prediction of protein phosphorylation sites by using the composition of k-spaced amino acid pairs.
Zhao X, Zhang W, Xu X, Ma Z, Yin M. Zhao X, et al. PLoS One. 2012;7(10):e46302. doi: 10.1371/journal.pone.0046302. Epub 2012 Oct 22. PLoS One. 2012. PMID: 23110047 Free PMC article.
-
Ju Z, He JJ. Ju Z, et al. J Mol Graph Model. 2017 Oct;77:200-204. doi: 10.1016/j.jmgm.2017.08.020. Epub 2017 Aug 24. J Mol Graph Model. 2017. PMID: 28886434
-
Bioinformatics-aided Protein Sequence Analysis and Engineering.
Zhang W, Wang T. Zhang W, et al. Curr Protein Pept Sci. 2023;24(6):477-487. doi: 10.2174/1389203724666230509124300. Curr Protein Pept Sci. 2023. PMID: 37287293 Review.
Cited by
-
6mA-StackingCV: an improved stacking ensemble model for predicting DNA N6-methyladenine site.
Huang G, Huang X, Luo W. Huang G, et al. BioData Min. 2023 Nov 27;16(1):34. doi: 10.1186/s13040-023-00348-8. BioData Min. 2023. PMID: 38012796 Free PMC article.
-
Kinase-Independent Small-Molecule Inhibition of JAK-STAT Signaling.
Chou DH, Vetere A, Choudhary A, Scully SS, Schenone M, Tang A, Gomez R, Burns SM, Lundh M, Vital T, Comer E, Faloon PW, Dančík V, Ciarlo C, Paulk J, Dai M, Reddy C, Sun H, Young M, Donato N, Jaffe J, Clemons PA, Palmer M, Carr SA, Schreiber SL, Wagner BK. Chou DH, et al. J Am Chem Soc. 2015 Jun 24;137(24):7929-34. doi: 10.1021/jacs.5b04284. Epub 2015 Jun 15. J Am Chem Soc. 2015. PMID: 26042473 Free PMC article.
-
MLAFP-XN: Leveraging neural network model for development of antifungal peptide identification tool.
Sultan MF, Shaon MSH, Karim T, Ali MM, Hasan MZ, Ahmed K, Bui FM, Chen L, Dhasarathan V, Moni MA. Sultan MF, et al. Heliyon. 2024 Sep 11;10(18):e37820. doi: 10.1016/j.heliyon.2024.e37820. eCollection 2024 Sep 30. Heliyon. 2024. PMID: 39323787 Free PMC article.
-
Zhang L, Dong B, Teng Z, Zhang Y, Juan L. Zhang L, et al. Biomed Res Int. 2020 May 22;2020:9235920. doi: 10.1155/2020/9235920. eCollection 2020. Biomed Res Int. 2020. PMID: 32596396 Free PMC article.
-
Sun J, Ru J, Ramos-Mucci L, Qi F, Chen Z, Chen S, Cribbs AP, Deng L, Wang X. Sun J, et al. Int J Mol Sci. 2023 Jan 18;24(3):1878. doi: 10.3390/ijms24031878. Int J Mol Sci. 2023. PMID: 36768205 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials