Identification of endogenous retroviral reading frames in the human genome - PubMed
- ️Thu Jan 01 2004
Identification of endogenous retroviral reading frames in the human genome
Palle Villesen et al. Retrovirology. 2004.
Abstract
Background: Human endogenous retroviruses (HERVs) comprise a large class of repetitive retroelements. Most HERVs are ancient and invaded our genome at least 25 million years ago, except for the evolutionary young HERV-K group. The far majority of the encoded genes are degenerate due to mutational decay and only a few non-HERV-K loci are known to retain intact reading frames. Additional intact HERV genes may exist, since retroviral reading frames have not been systematically annotated on a genome-wide scale.
Results: By clustering of hits from multiple BLAST searches using known retroviral sequences we have mapped 1.1% of the human genome as retrovirus related. The coding potential of all identified HERV regions were analyzed by annotating viral open reading frames (vORFs) and we report 7836 loci as verified by protein homology criteria. Among 59 intact or almost-intact viral polyproteins scattered around the human genome we have found 29 envelope genes including two novel gammaretroviral types. One encodes a protein similar to a recently discovered zebrafish retrovirus (ZFERV) while another shows partial, C-terminal, homology to Syncytin (HERV-W/FRD).
Conclusions: This compilation of HERV sequences and their coding potential provide a useful tool for pursuing functional analysis such as RNA expression profiling and effects of viral proteins, which may, in turn, reveal a role for HERVs in human health and disease. All data are publicly available through a database at http://www.retrosearch.dk.
Figures

A: Genomic organization of simple retroviruses when present as a provirus (DNA) integrated in the host genome. The regulatory long terminal repeats (LTRs) flank the internal three major genes gag, pol and env. A fourth gene pro is present between gag and pol for some retroviruses, while part of either gag or pol in others. B: Individual BLAST hits (white and yellow boxes) on either strand of the human genome were clustered into HERV regions (blue boxes) or discarded by using a score function. Finally, only HERV regions with at least one retroviral ORF were kept (see Materials and Methods). In the example illustrated HERV ID 5715 was presumably inserted into an existing HERV locus with the opposite orientation. HERV ID 5715 is located in the first intron of the CD48 gene (antisense direction) and is also known as HERV-K18 or IDDMK1,222. C: HERV ID 5715 with graphical vORF annotation. Putative LTR structures are indicated and all ORFs (stop-codon to stop-codon fragments above 62 aa) are mapped and annotated by homology criteria

Number of HERV regions located inside genes, and their orientation relative to the gene. The expected number assumes a random genomic distribution.

Genomic distribution of all Gag (red) and Env (blue) ORFs above 500 aa and Pol (green) ORFs above 700 aa. Right-pointing triangles denote intact ORFs, while left-pointing triangles denote ORFs that are almost-intact besides a single stop codon or frame-shift mutation.
Similar articles
-
Flockerzi A, Burkhardt S, Schempp W, Meese E, Mayer J. Flockerzi A, et al. J Virol. 2005 Mar;79(5):2941-9. doi: 10.1128/JVI.79.5.2941-2949.2005. J Virol. 2005. PMID: 15709013 Free PMC article.
-
Status of HERV in human cells: expression and coding capacity of human proviruses.
Mayer J. Mayer J. Dev Biol (Basel). 2001;106:439-41; discussion 465-75. Dev Biol (Basel). 2001. PMID: 11761259
-
Digital expression profiles of human endogenous retroviral families in normal and cancerous tissues.
Stauffer Y, Theiler G, Sperisen P, Lebedev Y, Jongeneel CV. Stauffer Y, et al. Cancer Immun. 2004 Feb 11;4:2. Cancer Immun. 2004. PMID: 14871062
-
Gholami Barzoki M, Shatizadeh Malekshahi S, Heydarifard Z, Mahmodi MJ, Soltanghoraee H. Gholami Barzoki M, et al. Mol Biol Rep. 2023 Sep;50(9):7901-7907. doi: 10.1007/s11033-023-08658-0. Epub 2023 Jul 8. Mol Biol Rep. 2023. PMID: 37421503 Review.
-
Human endogenous retroviruses and multiple sclerosis: innocent bystanders or disease determinants?
Antony JM, Deslauriers AM, Bhat RK, Ellestad KK, Power C. Antony JM, et al. Biochim Biophys Acta. 2011 Feb;1812(2):162-76. doi: 10.1016/j.bbadis.2010.07.016. Epub 2010 Aug 6. Biochim Biophys Acta. 2011. PMID: 20696240 Free PMC article. Review.
Cited by
-
Making sense of how HIV kills infected CD4 T cells: implications for HIV cure.
Cummins NW, Badley AD. Cummins NW, et al. Mol Cell Ther. 2014 Jul 3;2:20. doi: 10.1186/2052-8426-2-20. eCollection 2014. Mol Cell Ther. 2014. PMID: 26056587 Free PMC article. Review.
-
Grandi N, Cadeddu M, Blomberg J, Tramontano E. Grandi N, et al. Retrovirology. 2016 Sep 9;13(1):67. doi: 10.1186/s12977-016-0301-x. Retrovirology. 2016. PMID: 27613107 Free PMC article.
-
HERVs, immunity, and autoimmunity: understanding the connection.
Greenig M. Greenig M. PeerJ. 2019 Apr 5;7:e6711. doi: 10.7717/peerj.6711. eCollection 2019. PeerJ. 2019. PMID: 30984482 Free PMC article.
-
Matsuzawa A, Lee J, Nakagawa S, Itoh J, Takahashi Ueda M, Mitsuhashi S, Kochi Y, Kaneko-Ishino T, Ishino F. Matsuzawa A, et al. Int J Mol Sci. 2021 Apr 26;22(9):4504. doi: 10.3390/ijms22094504. Int J Mol Sci. 2021. PMID: 33925887 Free PMC article.
-
Human DNA methylomes at base resolution show widespread epigenomic differences.
Lister R, Pelizzola M, Dowen RH, Hawkins RD, Hon G, Tonti-Filippini J, Nery JR, Lee L, Ye Z, Ngo QM, Edsall L, Antosiewicz-Bourget J, Stewart R, Ruotti V, Millar AH, Thomson JA, Ren B, Ecker JR. Lister R, et al. Nature. 2009 Nov 19;462(7271):315-22. doi: 10.1038/nature08514. Epub 2009 Oct 14. Nature. 2009. PMID: 19829295 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials