Comprehensive discovery of endogenous Argonaute binding sites in Caenorhabditis elegans - PubMed
Comprehensive discovery of endogenous Argonaute binding sites in Caenorhabditis elegans
Dimitrios G Zisoulis et al. Nat Struct Mol Biol. 2010 Feb.
Abstract
MicroRNAs (miRNAs) regulate gene expression by guiding Argonaute proteins to specific target mRNA sequences. Identification of bona fide miRNA target sites in animals is challenging because of uncertainties regarding the base-pairing requirements between miRNA and target as well as the location of functional binding sites within mRNAs. Here we present the results of a comprehensive strategy aimed at isolating endogenous mRNA target sequences bound by the Argonaute protein ALG-1 in C. elegans. Using cross-linking and ALG-1 immunoprecipitation coupled with high-throughput sequencing (CLIP-seq), we identified extensive ALG-1 interactions with specific 3' untranslated region (UTR) and coding exon sequences and discovered features that distinguish miRNA complex binding sites in 3' UTRs from those in other genic regions. Furthermore, our analyses revealed a striking enrichment of Argonaute binding sites in genes important for miRNA function, suggesting an autoregulatory role that may confer robustness to the miRNA pathway.
Figures
![Figure 1](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a28/2834287/64612c8177ee/nihms163935f1.gif)
MicroRNA targets identified by ALG-1 CLIP-seq in L4-stage worms. Graphical depictions of the number and location of reads from alg-1(−) (red upper tracks) and WT (blue lower tracks) from three biological replicates, CLIP-derived clusters (solid rectangular boxes over the reads) and putative miRNA binding sites in the 3′ UTR of mRNA transcripts (LCS, let-7 complementary sequence; LCE, lin-4 complementary element). (a) lin-41 3′ UTR. Of the six predicted LCSs, LCS1 and LCS2 (open boxes) have experimentally validated let-7 sites–. (b) lin-14 3′ UTR. Deletion of all the predicted LCE sites or of LCEs 1–5 results in misregulation of lin-14 expression,. (c) hbl-1 3′ UTR,,. The sites for let-7 miRNA (LCSs 1–8) binding have been predicted but not experimentally tested. (d) daf-12 3′ UTR. Deletion of the predicted LCSs 1–4 or 5–8 in reporter constructs leads to misregulation of reporter gene expression.
![Figure 2](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a28/2834287/489892635f90/nihms163935f2.gif)
Relative position of ALG-1 binding sites across protein-coding genes. (a) Distribution of WT (blue) and alg-1(−) (red) clusters across a composite mRNA length. Cluster position is depicted as a percentage of the gene region, from the beginning to the end of spliced transcripts. (b) Distribution of WT (blue) and alg-1(−) (red) clusters across the 3′ UTR region. Cluster position is depicted as a percentage of the region from the annotated translational stop codon to the end of transcripts, as defined by our reannotation of C. elegans genes.
![Figure 3](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a28/2834287/bdc2eb0763ef/nihms163935f3.gif)
Attributes enriched in ALG-1 binding sites within 3′ UTRs. (a) Box plots of the conservation levels measured as the fraction of perfectly conserved nucleotides between genome-wide alignments of C. elegans and HG004659 and GM084317 brenneri in CLIP-derived clusters (CDCs) and randomly derived clusters (RDCs). CDCs are significantly more conserved than RDCs as assessed by the Kolgomorov-Smirnov two-sample test (P < 10−36). (b) Box plots of RNA accessibility, measured as the average probability of being unpaired, of CDC and RDC and their corresponding flanking sequences (100 nt upstream or downstream). CDCs and flanking sequences are significantly more accessible than RDCs in the same locations as assessed by the Kolgomorov-Smirnov two-sample test (P < 10−10). (c) The ten most enriched k-mers (k = 5, 6, 7) within or 100 nt upstream or downstream of CDCs, compared to RDCs, are shown along with the range of Z-scores for the specific categories. (d–g). The number of conserved hexamers within CDCs (solid line) and RDCs (dashed line) that base-pair to miRNA or scrambled miRNA regions (dotted line), allowing for zero (orange) or only one G•U base pair (black). Error bars in dashed and dotted lines represent the s.d. among ten independent sets of RDCs and scrambled miRNAs, respectively. Hexamers within 3′-UTR CDCs and RDCs (d) or coding-exon CDCs and RDCs (e) that base-pair to cloned miRNAs or shuffled versions of cloned miRNAs. Hexamers within 3′-UTR CDCs and RDCs that base-pair to the let-7 or shuffled let-7 miRNA (f) and lin-4 or shuffled lin-4 miRNA (g). Regions of the miRNA(s) that have statistically enriched numbers of complementary hexamers within CDCs when compared to RDCs or shuffled miRNAs are denoted by * (P < 0.01) and ** (P < 10−6) as measured by a Z-test.
![Figure 4](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a28/2834287/8f815968a000/nihms163935f4.gif)
Relationship between ALG-1 binding and mRNA expression levels. (a) Effects of ALG-1 binding on mRNA levels. Box plots representing the differential expression (as a t-statistic) of genes from biological replicate microarray experiments comparing alg-1(−) to WT L4-stage worms. Genes are divided into those that contained no CDCs and those that contained CDCs only within 3′ UTRs or coding exons. Compared to genes with no CDCs or coding-exon CDCs, genes with 3′-UTR CDCs are significantly more upregulated in alg-1(−) relative to WT as assayed by the Wilcoxon rank-sum test (P < 10−4). (b) Functional enrichment of genes that have CDCs only within 3′ UTR or coding exons that are up- or downregulated in alg-1(−) worms using significantly enriched (P < 0.05 in at least one row; Holm-Bonferroni corrected) functional categories defined by the C. elegans Topomap algorithm. The intensity on the heat-map denotes −log10(p value). Genes represented by these functional categories can be divided in a matrix (right) depending on the location of the CDCs (3′ UTRs or coding exons), and whether the genes are up- or downregulated in the alg-1(−) mutants relative to WT worms. Several categories occupy multiple cells in the matrix, for example “Cell structure,” “Collagen,” “Cell adhesion,” “Protein expression,” “RNA binding” and “Germ line–enriched.” (c) UCSC Genome Browser view depicting clusters in the 3′ UTR of the alg-1 gene (blue, WT clusters; red, alg-1(−) clusters, none present) and the predicted miRNA binding sites by the various algorithms.
Similar articles
-
Zhang L, Ding L, Cheung TH, Dong MQ, Chen J, Sewell AK, Liu X, Yates JR 3rd, Han M. Zhang L, et al. Mol Cell. 2007 Nov 30;28(4):598-613. doi: 10.1016/j.molcel.2007.09.014. Mol Cell. 2007. PMID: 18042455 Free PMC article.
-
Pairing beyond the Seed Supports MicroRNA Targeting Specificity.
Broughton JP, Lovci MT, Huang JL, Yeo GW, Pasquinelli AE. Broughton JP, et al. Mol Cell. 2016 Oct 20;64(2):320-333. doi: 10.1016/j.molcel.2016.09.004. Epub 2016 Oct 6. Mol Cell. 2016. PMID: 27720646 Free PMC article.
-
Brown KC, Svendsen JM, Tucci RM, Montgomery BE, Montgomery TA. Brown KC, et al. Nucleic Acids Res. 2017 Sep 6;45(15):9093-9107. doi: 10.1093/nar/gkx536. Nucleic Acids Res. 2017. PMID: 28645154 Free PMC article.
-
Roles of microRNAs in the Caenorhabditis elegans nervous system.
Meng L, Chen L, Li Z, Wu ZX, Shan G. Meng L, et al. J Genet Genomics. 2013 Sep 20;40(9):445-52. doi: 10.1016/j.jgg.2013.07.002. Epub 2013 Aug 7. J Genet Genomics. 2013. PMID: 24053946 Review.
-
The microRNAs of Caenorhabditis elegans.
Kaufman EJ, Miska EA. Kaufman EJ, et al. Semin Cell Dev Biol. 2010 Sep;21(7):728-37. doi: 10.1016/j.semcdb.2010.07.001. Epub 2010 Jul 15. Semin Cell Dev Biol. 2010. PMID: 20637886 Review.
Cited by
-
Pervasive and dynamic protein binding sites of the mRNA transcriptome in Saccharomyces cerevisiae.
Freeberg MA, Han T, Moresco JJ, Kong A, Yang YC, Lu ZJ, Yates JR, Kim JK. Freeberg MA, et al. Genome Biol. 2013 Feb 14;14(2):R13. doi: 10.1186/gb-2013-14-2-r13. Genome Biol. 2013. PMID: 23409723 Free PMC article.
-
TarBase 6.0: capturing the exponential growth of miRNA targets with experimental support.
Vergoulis T, Vlachos IS, Alexiou P, Georgakilas G, Maragkakis M, Reczko M, Gerangelos S, Koziris N, Dalamagas T, Hatzigeorgiou AG. Vergoulis T, et al. Nucleic Acids Res. 2012 Jan;40(Database issue):D222-9. doi: 10.1093/nar/gkr1161. Epub 2011 Dec 1. Nucleic Acids Res. 2012. PMID: 22135297 Free PMC article.
-
Regulation of circadian rhythm and sleep by miR-375-timeless interaction in Drosophila.
Xia X, Fu X, Du J, Wu B, Zhao X, Zhu J, Zhao Z. Xia X, et al. FASEB J. 2020 Dec;34(12):16536-16551. doi: 10.1096/fj.202001107R. Epub 2020 Oct 20. FASEB J. 2020. PMID: 33078445 Free PMC article.
-
Haecker I, Gay LA, Yang Y, Hu J, Morse AM, McIntyre LM, Renne R. Haecker I, et al. PLoS Pathog. 2012;8(8):e1002884. doi: 10.1371/journal.ppat.1002884. Epub 2012 Aug 23. PLoS Pathog. 2012. PMID: 22927820 Free PMC article.
-
Nuclear and cytoplasmic poly(A) binding proteins (PABPs) favor distinct transcripts and isoforms.
Nicholson-Shaw AL, Kofman ER, Yeo GW, Pasquinelli AE. Nicholson-Shaw AL, et al. Nucleic Acids Res. 2022 May 6;50(8):4685-4702. doi: 10.1093/nar/gkac263. Nucleic Acids Res. 2022. PMID: 35438785 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases