Small Open Reading Frames, How to Find Them and Determine Their Function - PubMed
- ️Sat Jan 01 2022
Review
Small Open Reading Frames, How to Find Them and Determine Their Function
Preeti Madhav Kute et al. Front Genet. 2022.
Abstract
Advances in genomics and molecular biology have revealed an abundance of small open reading frames (sORFs) across all types of transcripts. While these sORFs are often assumed to be non-functional, many have been implicated in physiological functions and a significant number of sORFs have been described in human diseases. Thus, sORFs may represent a hidden repository of functional elements that could serve as therapeutic targets. Unlike protein-coding genes, it is not necessarily the encoded peptide of an sORF that enacts its function, sometimes simply the act of translating an sORF might have a regulatory role. Indeed, the most studied sORFs are located in the 5'UTRs of coding transcripts and can have a regulatory impact on the translation of the downstream protein-coding sequence. However, sORFs have also been abundantly identified in non-coding RNAs including lncRNAs, circular RNAs and ribosomal RNAs suggesting that sORFs may be diverse in function. Of the many different experimental methods used to discover sORFs, the most commonly used are ribosome profiling and mass spectrometry. These can confirm interactions between transcripts and ribosomes and the production of a peptide, respectively. Extensions to ribosome profiling, which also capture scanning ribosomes, have further made it possible to see how sORFs impact the translation initiation of mRNAs. While high-throughput techniques have made the identification of sORFs less difficult, defining their function, if any, is typically more challenging. Together, the abundance and potential function of many of these sORFs argues for the necessity of including sORFs in gene annotations and systematically characterizing these to understand their potential functional roles. In this review, we will focus on the high-throughput methods used in the detection and characterization of sORFs and discuss techniques for validation and functional characterization.
Keywords: SEPs; computational tools; mass spectrometry; ribosome profiling; sORFs.
Copyright © 2022 Kute, Soukarieh, Tjeldnes, Trégouët and Valen.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures

Examples of small ORFs in coding (A) and non-coding (B) transcripts. Start and Stop indicate the initiation and termination sites of the coding sequence (CDS). uORF, upstream open reading frame fully located in the 5′UTR; uStart, upstream start site; uStop, upstream stop site; uoORF, upstream overlapping open reading frame; intStart, internal start site; intORF, internal open reading frame; intStop, internal stop site; dStart, downstream Start site; dStop, downstream stop site; sORF, small open reading frame; lncRNA, long non-coding RNA; circRNA, circular RNA.

Overview of the commonly used techniques to identify and characterize sORFs and their encoded peptides. Novel sORFs and their products can be detected by the prediction algorithms using bioinformatic approaches, by generating peptide databases using improved mass spectrometry-based assays and by using ribosome profiling and related sequencing techniques to obtain translationally active transcripts. The predicted SEPs can be validated by various assays such as reporter-based overexpression, epitope tagging etc. Loss of function assays could be done to assess the cellular function of these SEPs.

Profiling and sequencing of translating transcripts. A254 profiles shown before (A) and after digestion with ribonucleases (B,C). The fractions used for further processing are highlighted, polysomes in purple, 80S in orange and 40S in green. (D) The process of library preparation for next generation sequencing. Size selection of ∼30 nt is done for ribosome profiling and ribosome complex profiling sequencing and libraries are prepared from the size selected small RNAs, whereas for polysome profiling, libraries are prepared from total RNA. Meta-coverage shown for reads obtained from polysome profiling sequencing (E), for ribosome profiling (F) and for ribosome complex profiling [(G) top: 40S, bottom: 80S].
Similar articles
-
Mining for missed sORF-encoded peptides.
Yin X, Jing Y, Xu H. Yin X, et al. Expert Rev Proteomics. 2019 Mar;16(3):257-266. doi: 10.1080/14789450.2019.1571919. Epub 2019 Feb 13. Expert Rev Proteomics. 2019. PMID: 30669886 Review.
-
Olexiouk V, Menschaert G. Olexiouk V, et al. Curr Protoc Bioinformatics. 2019 Mar;65(1):e68. doi: 10.1002/cpbi.68. Epub 2018 Nov 28. Curr Protoc Bioinformatics. 2019. PMID: 30485709
-
Perdikopanis N, Giannakakis A, Kavakiotis I, Hatzigeorgiou AG. Perdikopanis N, et al. Biology (Basel). 2024 Jul 26;13(8):563. doi: 10.3390/biology13080563. Biology (Basel). 2024. PMID: 39194501 Free PMC article.
-
Leong AZ, Lee PY, Mohtar MA, Syafruddin SE, Pung YF, Low TY. Leong AZ, et al. J Biomed Sci. 2022 Mar 17;29(1):19. doi: 10.1186/s12929-022-00802-5. J Biomed Sci. 2022. PMID: 35300685 Free PMC article. Review.
-
Laczkovich I, Mangano K, Shao X, Hockenberry AJ, Gao Y, Mankin A, Vázquez-Laslop N, Federle MJ. Laczkovich I, et al. mBio. 2022 Aug 30;13(4):e0124722. doi: 10.1128/mbio.01247-22. Epub 2022 Jul 19. mBio. 2022. PMID: 35852327 Free PMC article.
Cited by
-
Pervasive translation of small open reading frames in plant long non-coding RNAs.
Sruthi KB, Menon A, P A, Vasudevan Soniya E. Sruthi KB, et al. Front Plant Sci. 2022 Oct 24;13:975938. doi: 10.3389/fpls.2022.975938. eCollection 2022. Front Plant Sci. 2022. PMID: 36352887 Free PMC article. Review.
-
Common and Rare 5'UTR Variants Altering Upstream Open Reading Frames in Cardiovascular Genomics.
Soukarieh O, Meguerditchian C, Proust C, Aïssi D, Eyries M, Goyenvalle A, Trégouët DA. Soukarieh O, et al. Front Cardiovasc Med. 2022 Mar 21;9:841032. doi: 10.3389/fcvm.2022.841032. eCollection 2022. Front Cardiovasc Med. 2022. PMID: 35387445 Free PMC article. Review.
-
Noncanonical microprotein regulation of immunity.
Nichols C, Do-Thi VA, Peltier DC. Nichols C, et al. Mol Ther. 2024 Sep 4;32(9):2905-2929. doi: 10.1016/j.ymthe.2024.05.021. Epub 2024 May 11. Mol Ther. 2024. PMID: 38734902 Review.
-
Rovelet-Lecrux A, Bonnevalle A, Quenez O, Delcroix W, Cassinari K, Richard AC, Boland A, Deleuze JF, Goizet C, Rucar A, Verny C, Nguyen K, Lecourtois M, Nicolas G. Rovelet-Lecrux A, et al. Eur J Hum Genet. 2024 Jul;32(7):779-785. doi: 10.1038/s41431-024-01580-4. Epub 2024 Mar 4. Eur J Hum Genet. 2024. PMID: 38433263
-
Quest for Orthologs in the Era of Biodiversity Genomics.
Langschied F, Bordin N, Cosentino S, Fuentes-Palacios D, Glover N, Hiller M, Hu Y, Huerta-Cepas J, Coelho LP, Iwasaki W, Majidian S, Manzano-Morales S, Persson E, Richards TA, Gabaldón T, Sonnhammer E, Thomas PD, Dessimoz C, Ebersberger I. Langschied F, et al. Genome Biol Evol. 2024 Oct 9;16(10):evae224. doi: 10.1093/gbe/evae224. Genome Biol Evol. 2024. PMID: 39404012 Free PMC article. Review.
References
Publication types
LinkOut - more resources
Full Text Sources
Other Literature Sources