Genome-wide search for novel human uORFs and N-terminal protein extensions using ribosomal footprinting - PubMed
Genome-wide search for novel human uORFs and N-terminal protein extensions using ribosomal footprinting
Claudia Fritsch et al. Genome Res. 2012 Nov.
Abstract
So far, the annotation of translation initiation sites (TISs) has been based mostly upon bioinformatics rather than experimental evidence. We adapted ribosomal footprinting to puromycin-treated cells to generate a transcriptome-wide map of TISs in a human monocytic cell line. A neural network was trained on the ribosomal footprints observed at previously annotated AUG translation initiation codons (TICs), and used for the ab initio prediction of TISs in 5062 transcripts with sufficient sequence coverage. Functional interpretation suggested 2994 novel upstream open reading frames (uORFs) in the 5' UTR, 1406 uORFs overlapping with the coding sequence, and 546 N-terminal protein extensions. The TIS detection method was validated on the basis of previously published alternative TISs and uORFs. Among primates, TICs in newly annotated TISs were significantly more conserved than control codons, both for AUGs and near-cognate codons. The transcriptome-wide map of novel candidate TISs derived as part of the study will shed further light on the way in which human proteome diversity is influenced by alternative translation initiation and regulation.
Figures

Enrichment of THP-1 cell ribosomal footprint data for TISs, following puromycin treatment. (A) Polysome profile of the TPP1 gene in control and puromycin-treated THP-1 cells. (B) Pooled read coverage for the 500 most highly expressed genes. Transcript-specific coverage values were normalized to the total number of reads for each gene and the transcript length was scaled to 1000 bp for all RefSeq sequences. (C) Pooled read coverage around the annotated AUG TICs of the 500 most highly expressed genes in puromycin-treated cells.

Algorithm used for the functional classification of neural network-predicted TISs as either “annotated TIS,” “N-terminal protein extension,” “upstream ORF” (uORF),” or “CDS-overlapping uORF.” The respective AUG or non-cognate codon was searched for in a ±3-bp window around the merged positive TIS signal emitted by the neural network. For each transcript, the depicted algorithm is applied until no further network-predicted TISs are available for classification.

Codon usage and functional classification of neural network-defined TISs: (A) Distribution of the number of putative TISs per transcript. (B) Functional classification of putative TISs. (C) TIC usage in putative TISs, either including AUG (upper row) or for near-cognate codons only (bottom row). (D) Average codon frequency over all three reading frames in the analyzed 5′ UTRs, considering either all possible codons (left), the 10 TIS-relevant codons identified in our study only (middle), or near-cognate codons only (right).

Gene-based examples for the annotation of TIS for the AMD1 (A) and TOP2A (B) genes in the presented data set: Screenshots from the online resource (the annotation tracks at
http://gengastro.1med.uni-kiel.de/suppl/footprint/) are provided. The network-identified TISs are marked in gray and are numbered consecutively along the genome assembly for each RefSeq sequence. The classification results of TIS according to the algorithm in Figure 2 are noted in red. In addition to the markings provided in the online resource, the open reading frames are highlighted with red arrows for uORFs, a blue arrow for the N-terminal protein extension of TOP2A, and green arrows for the annotated CDS of the two genes. (A) Previously known uORF at network-predicted TIS NM_001634:1. An additional internal network-predicted TIS is present in this uORF and was thus not annotated independently as noted in the results. (B) A novel uORF at network-predicted TIS NM_001067:3 and a novel N-terminal protein extension at network-predicted TIS NM_001067:2 are shown. The annotated AUG TISs are detected in both genes.

Primate conservation analysis of TICs at neural network-predicted TISs. For each functional category and codon type, the difference in mean Conservation Score in nine primate species (with 95% confidence interval) is depicted between case and control TICs. For comparison, the difference in mean Conservation Score for the annotated AUG TICs (open box) is also included for each category. Numbers below boxes refer to the number of predicted TIS falling into the respective TIS by TIC category. TICs showing statistically significant conservation after Bonferoni correction (31 tests, P < 0.0016) are marked by an asterisk. Further details of the primate conservation analysis are provided in Supplemental Table 3.
Similar articles
-
Hiragori Y, Takahashi H, Karino T, Kaido A, Hayashi N, Sasaki S, Nakao K, Motomura T, Yamashita Y, Naito S, Onouchi H. Hiragori Y, et al. Plant Mol Biol. 2023 Jan;111(1-2):37-55. doi: 10.1007/s11103-022-01309-1. Epub 2022 Aug 31. Plant Mol Biol. 2023. PMID: 36044152
-
Translation initiation at AUG and non-AUG triplets in plants.
Fang JC, Liu MJ. Fang JC, et al. Plant Sci. 2023 Oct;335:111822. doi: 10.1016/j.plantsci.2023.111822. Epub 2023 Aug 14. Plant Sci. 2023. PMID: 37574140 Review.
-
Na CH, Barbhuiya MA, Kim MS, Verbruggen S, Eacker SM, Pletnikova O, Troncoso JC, Halushka MK, Menschaert G, Overall CM, Pandey A. Na CH, et al. Genome Res. 2018 Jan;28(1):25-36. doi: 10.1101/gr.226050.117. Epub 2017 Nov 21. Genome Res. 2018. PMID: 29162641 Free PMC article.
-
Translational control by 5'-untranslated regions of eukaryotic mRNAs.
Hinnebusch AG, Ivanov IP, Sonenberg N. Hinnebusch AG, et al. Science. 2016 Jun 17;352(6292):1413-6. doi: 10.1126/science.aad9868. Science. 2016. PMID: 27313038 Free PMC article. Review.
-
Zhou F, Zhang H, Kulkarni SD, Lorsch JR, Hinnebusch AG. Zhou F, et al. RNA. 2020 Apr;26(4):419-438. doi: 10.1261/rna.073536.119. Epub 2020 Jan 8. RNA. 2020. PMID: 31915290 Free PMC article.
Cited by
-
The stringency of start codon selection in the filamentous fungus Neurospora crassa.
Wei J, Zhang Y, Ivanov IP, Sachs MS. Wei J, et al. J Biol Chem. 2013 Mar 29;288(13):9549-62. doi: 10.1074/jbc.M112.447177. Epub 2013 Feb 8. J Biol Chem. 2013. PMID: 23396971 Free PMC article.
-
Translation regulation gets its 'omics' moment.
Kuersten S, Radek A, Vogel C, Penalva LO. Kuersten S, et al. Wiley Interdiscip Rev RNA. 2013 Nov-Dec;4(6):617-30. doi: 10.1002/wrna.1173. Epub 2013 May 15. Wiley Interdiscip Rev RNA. 2013. PMID: 23677826 Free PMC article. Review.
-
Ribosome profiling: a Hi-Def monitor for protein synthesis at the genome-wide scale.
Michel AM, Baranov PV. Michel AM, et al. Wiley Interdiscip Rev RNA. 2013 Sep-Oct;4(5):473-90. doi: 10.1002/wrna.1172. Epub 2013 May 20. Wiley Interdiscip Rev RNA. 2013. PMID: 23696005 Free PMC article. Review.
-
Andreev DE, Arnold M, Kiniry SJ, Loughran G, Michel AM, Rachinskii D, Baranov PV. Andreev DE, et al. Elife. 2018 Jun 22;7:e32563. doi: 10.7554/eLife.32563. Elife. 2018. PMID: 29932418 Free PMC article.
-
Retapamulin-Assisted Ribosome Profiling Reveals the Alternative Bacterial Proteome.
Meydan S, Marks J, Klepacki D, Sharma V, Baranov PV, Firth AE, Margus T, Kefi A, Vázquez-Laslop N, Mankin AS. Meydan S, et al. Mol Cell. 2019 May 2;74(3):481-493.e6. doi: 10.1016/j.molcel.2019.02.017. Epub 2019 Mar 20. Mol Cell. 2019. PMID: 30904393 Free PMC article.
References
-
- Allen DW, Zamecnik PC 1962. The effect of puromycin on rabbit reticulocyte ribosomes. Biochim Biophys Acta 55: 865–874 - PubMed
-
- Brown CY, Mize GJ, Pineda M, George DL, Morris DR 1999. Role of two upstream open reading frames in the translational control of oncogene mdm2. Oncogene 18: 5631–5637 - PubMed
-
- Bruening W, Pelletier J 1996. A non-AUG translational initiation event generates novel WT1 isoforms. J Biol Chem 271: 8646–8654 - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases