A short ORF-encoded transcriptional regulator - PubMed
- ️Fri Jan 01 2021
. 2021 Jan 26;118(4):e2021943118.
doi: 10.1073/pnas.2021943118.
Insha Ahmad 1 , Yeonjin Ko 1 , Yuxiang Zhang 1 , Thomas F Martinez 3 , Jolene K Diedrich 3 , Qian Chu 3 , James J Moresco 3 , Michael A Erb 1 , Alan Saghatelian 4 , Peter G Schultz 5 , Michael J Bollong 5
Affiliations
- PMID: 33468658
- PMCID: PMC7848545
- DOI: 10.1073/pnas.2021943118
A short ORF-encoded transcriptional regulator
Minseob Koh et al. Proc Natl Acad Sci U S A. 2021.
Abstract
Recent technological advances have expanded the annotated protein coding content of mammalian genomes, as hundreds of previously unidentified, short open reading frame (ORF)-encoded peptides (SEPs) have now been found to be translated. Although several studies have identified important physiological roles for this emerging protein class, a general method to define their interactomes is lacking. Here, we demonstrate that genetic incorporation of the photo-crosslinking noncanonical amino acid AbK into SEP transgenes allows for the facile identification of SEP cellular interaction partners using affinity-based methods. From a survey of seven SEPs, we report the discovery of short ORF-encoded histone binding protein (SEHBP), a conserved microprotein that interacts with chromatin-associated proteins, localizes to discrete genomic loci, and induces a robust transcriptional program when overexpressed in human cells. This work affords a straightforward method to help define the physiological roles of SEPs and demonstrates its utility by identifying SEHBP as a short ORF-encoded transcription factor.
Keywords: expanded genetic code; photo-crosslinking; short open reading frame-encoded peptide; transcriptional regulation.
Conflict of interest statement
The authors declare no competing interest.
Figures

A genetically encoded photo-crosslinker identifies cellular interactors of unexplored SEPs. (A) Schematic depicting the strategy used in this work to introduce the photo-crosslinking amino acid AbK (structure within inset circle) into SEP transgenes and identify their protein targets in cells. (B) Heatmap of the relative spectral count enrichment corresponding to protein preys identified from AP-MS experiments with the indicated SEP baits in the presence or absence of UV (mean of three biological replicates).

SEHBP is a translated 5′ ORF that is conserved in mammals. (A) Schematic depicting the position of the SEHBP and ZNF689 ORFs within transcript NM_138447. (B) Multiple sequence alignment depicting the sequence (Top), conservation (Middle), and consensus (Bottom) of the SEHBP amino acid sequence across the indicated mammalian species. (C) Representative A-site plots (Ribo-seq) from the indicated cell lines with the ORF of SEHBP shown in yellow.

SEHBP interacts with Histone H2B in cells. (A) Relative spectral counts of the indicated anti-FLAG immunoprecipitated proteins after 48-h expression of SEHBP-AbK-FLAG in HEK293T cells followed by exposure to UV (n = 3; mean ± SD). (B) Representative confocal microscopy-derived images of SEHBP-eGFP localization (green) from HEK293T cells after 48 h of SEHBP-eGFP expression followed by fixation and exposure to Hoechst 33342. (C) Representative Western blotting analysis for the indicated proteins after 48-h expression of SEHBP-FLAG followed by subcellular fractionization. (D) Western blotting for endogenous H2B content after anti-FLAG immunoprecipitation from HEK293T cells expressing SEHBP-FLAG. (E) Western blotting analysis for SEBHP-FLAG content after anti-H2B immunoprecipitation from HEK293T cells expressing SEHBP-FLAG. (F) Sensorgram plot from biolayer interferometry experiments demonstrating the interaction between recombinant H2B and immobilized SEHBP.

SEHBP modulates transcription at distinct loci. (A) Rank-ordered heat map depicting the log2-fold change of mRNA levels in response to SEHBP overexpression relative to a vector control in RNA-seq experiments using HEK293T cells (three biological replicates). (B) Pie chart depicting the percentage of active transcripts significantly changed by SEHBP overexpression (P < 0.05). (C) Histogram comparing the normalized enrichment scores of all MSigDB gene sets using GSEA of RNA-seq data in response to SEHBP overexpression. Plots depicting the P values of representative up-regulated (D) and down-regulated (E) gene sets from RNA-seq analysis of HEK293T cells expressing SEHBP. (F) Rank-ordered heat map of ChIP-seq derived reads of SEHBP-based enrichment at 8,900 distinct loci (Left) and 1,786 TSSs (Right).
Similar articles
-
Photo-lysine captures proteins that bind lysine post-translational modifications.
Yang T, Li XM, Bao X, Fung YM, Li XD. Yang T, et al. Nat Chem Biol. 2016 Feb;12(2):70-2. doi: 10.1038/nchembio.1990. Epub 2015 Dec 21. Nat Chem Biol. 2016. PMID: 26689789
-
Improved Identification and Analysis of Small Open Reading Frame Encoded Polypeptides.
Ma J, Diedrich JK, Jungreis I, Donaldson C, Vaughan J, Kellis M, Yates JR 3rd, Saghatelian A. Ma J, et al. Anal Chem. 2016 Apr 5;88(7):3967-75. doi: 10.1021/acs.analchem.6b00191. Epub 2016 Mar 24. Anal Chem. 2016. PMID: 27010111 Free PMC article.
-
Identification of phosphorylated small ORF-encoded peptides in Hep3B cells by LC/MS/MS.
Peng M, Zhou Y, Wan C. Peng M, et al. J Proteomics. 2024 Jul 15;303:105214. doi: 10.1016/j.jprot.2024.105214. Epub 2024 May 31. J Proteomics. 2024. PMID: 38823442
-
Identification and characterization of sORF-encoded polypeptides.
Chu Q, Ma J, Saghatelian A. Chu Q, et al. Crit Rev Biochem Mol Biol. 2015 Mar-Apr;50(2):134-41. doi: 10.3109/10409238.2015.1016215. Epub 2015 Apr 10. Crit Rev Biochem Mol Biol. 2015. PMID: 25857697 Free PMC article. Review.
-
Small Open Reading Frame-Encoded Micro-Peptides: An Emerging Protein World.
Dong X, Zhang K, Xun C, Chu T, Liang S, Zeng Y, Liu Z. Dong X, et al. Int J Mol Sci. 2023 Jun 23;24(13):10562. doi: 10.3390/ijms241310562. Int J Mol Sci. 2023. PMID: 37445739 Free PMC article. Review.
Cited by
-
Discovering microproteins: making the most of ribosome profiling data.
Chothani S, Ho L, Schafer S, Rackham O. Chothani S, et al. RNA Biol. 2023 Jan;20(1):943-954. doi: 10.1080/15476286.2023.2279845. Epub 2023 Nov 27. RNA Biol. 2023. PMID: 38013207 Free PMC article. Review.
-
Szafron LA, Iwanicka-Nowicka R, Podgorska A, Bonna AM, Sobiczewski P, Kupryjanczyk J, Szafron LM. Szafron LA, et al. Int J Mol Sci. 2024 Jul 9;25(14):7531. doi: 10.3390/ijms25147531. Int J Mol Sci. 2024. PMID: 39062774 Free PMC article.
-
Chen Y, Cao X, Loh KH, Slavoff SA. Chen Y, et al. Biochem Soc Trans. 2023 Jun 28;51(3):1071-1082. doi: 10.1042/BST20221074. Biochem Soc Trans. 2023. PMID: 37171061 Free PMC article. Review.
-
Detection of host cell microprotein impurities in antibody drug products.
Tzani I, Castro-Rivadeneyra M, Kelly P, Strasser L, Zhang L, Clynes M, Karger BL, Barron N, Bones J, Clarke C. Tzani I, et al. Nat Commun. 2024 Oct 4;15(1):8605. doi: 10.1038/s41467-024-51870-0. Nat Commun. 2024. PMID: 39366928 Free PMC article.
-
Tollis S, Singh J, Palou R, Thattikota Y, Ghazal G, Coulombe-Huntington J, Tang X, Moore S, Blake D, Bonneil E, Royer CA, Thibault P, Tyers M. Tollis S, et al. PLoS Biol. 2022 Mar 3;20(3):e3001548. doi: 10.1371/journal.pbio.3001548. eCollection 2022 Mar. PLoS Biol. 2022. PMID: 35239649 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases