pubmed.ncbi.nlm.nih.gov

Rfam 12.0: updates to the RNA families database - PubMed

. 2015 Jan;43(Database issue):D130-7.

doi: 10.1093/nar/gku1063. Epub 2014 Nov 11.

Affiliations

Rfam 12.0: updates to the RNA families database

Eric P Nawrocki et al. Nucleic Acids Res. 2015 Jan.

Abstract

The Rfam database (available at http://rfam.xfam.org) is a collection of non-coding RNA families represented by manually curated sequence alignments, consensus secondary structures and annotation gathered from corresponding Wikipedia, taxonomy and ontology resources. In this article, we detail updates and improvements to the Rfam data and website for the Rfam 12.0 release. We describe the upgrade of our search pipeline to use Infernal 1.1 and demonstrate its improved homology detection ability by comparison with the previous version. The new pipeline is easier for users to apply to their own data sets, and we illustrate its ability to annotate RNAs in genomic and metagenomic data sets of various sizes. Rfam has been expanded to include 260 new families, including the well-studied large subunit ribosomal RNA family, and for the first time includes information on short sequence- and structure-based RNA motifs present within families.

© The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.

Number of Rfam family matches for each of the 34 RMfam motifs.

Figure 2.
Figure 2.

Overview of the motif page for RM00022, the Terminator1 motif, on the Rfam 12.0 website. As in family and clan pages, tabs on the left-hand side allow the user to access different information for each motif.

Figure 3.
Figure 3.

Screenshot of the secondary structure representation for the RsmY RNA family (RF00195) with the annotation for the CsrA binding motif (RM00005) overlaid. Positions in red indicate that all the seed sequences at that position are found to contain the motif while other colours represent fewer sequences having matches at that position. The CsrA protein is a homo-dimeric, RNA binding protein. Each CsrA binds a specific RNA motif that is characterized by a short hairpin that hosts a GGA subsequence, these motifs generally occur in pairs. The CsrA-binding sRNAs, like RsmY, generally sequester excess copies of CsrA which would otherwise bind mRNAs and inhibit translation (23). Therefore, the expression of these sRNAs is a rapid way of altering expression levels for a potentially large network of proteins (24).

Similar articles

  • Rfam 11.0: 10 years of RNA families.

    Burge SW, Daub J, Eberhardt R, Tate J, Barquist L, Nawrocki EP, Eddy SR, Gardner PP, Bateman A. Burge SW, et al. Nucleic Acids Res. 2013 Jan;41(Database issue):D226-32. doi: 10.1093/nar/gks1005. Epub 2012 Nov 3. Nucleic Acids Res. 2013. PMID: 23125362 Free PMC article.

  • Non-Coding RNA Analysis Using the Rfam Database.

    Kalvari I, Nawrocki EP, Argasinska J, Quinones-Olvera N, Finn RD, Bateman A, Petrov AI. Kalvari I, et al. Curr Protoc Bioinformatics. 2018 Jun;62(1):e51. doi: 10.1002/cpbi.51. Epub 2018 Jun 5. Curr Protoc Bioinformatics. 2018. PMID: 29927072 Free PMC article.

  • Rfam: an RNA family database.

    Griffiths-Jones S, Bateman A, Marshall M, Khanna A, Eddy SR. Griffiths-Jones S, et al. Nucleic Acids Res. 2003 Jan 1;31(1):439-41. doi: 10.1093/nar/gkg006. Nucleic Acids Res. 2003. PMID: 12520045 Free PMC article.

  • Computational identification of functional RNA homologs in metagenomic data.

    Nawrocki EP, Eddy SR. Nawrocki EP, et al. RNA Biol. 2013 Jul;10(7):1170-9. doi: 10.4161/rna.25038. Epub 2013 May 20. RNA Biol. 2013. PMID: 23722291 Free PMC article. Review.

  • Customized strategies for discovering distant ncRNA homologs.

    Mosig A, Zhu L, Stadler PF. Mosig A, et al. Brief Funct Genomic Proteomic. 2009 Nov;8(6):451-60. doi: 10.1093/bfgp/elp035. Epub 2009 Sep 24. Brief Funct Genomic Proteomic. 2009. PMID: 19779009 Review.

Cited by

References

    1. Cochrane G., Alako B., Amid C., Bower L., Cerdeo-Trraga A., Cleland I., Gibson R., Goodgame N., Jang M., Kay S., et al. Facing growth in the European nucleotide archive. Nucleic Acids Res. 2013;41:D30–D35. - PMC - PubMed
    1. Nawrocki E.P., Eddy S.R. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics. 2013;29:2933–2935. - PMC - PubMed
    1. Berman H.M., Westbrook J., Feng Z., Gilliland G., Bhat T.N., Weissig H., Shindyalov I.N., Bourne P.E. The Protein Data Bank. Nucleic Acids Res. 2000;28:235–242. - PMC - PubMed
    1. Kozomara A., Griffiths-Jones S. miRBase: annotating high confidence microRNAs using deep sequencing data. Nucleic Acids Res. 2014;42:D68–D73. - PMC - PubMed
    1. Gene Ontology Consortium. Gene Ontology annotations and resources. Nucleic Acids Res. 2013;41:D530–D535. - PMC - PubMed

Publication types

MeSH terms

Substances