From genus to phylum: large-subunit and internal transcribed spacer rRNA operon regions show similar classification accuracies influenced by database composition - PubMed
Comparative Study
From genus to phylum: large-subunit and internal transcribed spacer rRNA operon regions show similar classification accuracies influenced by database composition
Andrea Porras-Alfaro et al. Appl Environ Microbiol. 2014 Feb.
Abstract
We compared the classification accuracy of two sections of the fungal internal transcribed spacer (ITS) region, individually and combined, and the 5' section (about 600 bp) of the large-subunit rRNA (LSU), using a naive Bayesian classifier and BLASTN. A hand-curated ITS-LSU training set of 1,091 sequences and a larger training set of 8,967 ITS region sequences were used. Of the factors evaluated, database composition and quality had the largest effect on classification accuracy, followed by fragment size and use of a bootstrap cutoff to improve classification confidence. The naive Bayesian classifier and BLASTN gave similar results at higher taxonomic levels, but the classifier was faster and more accurate at the genus level when a bootstrap cutoff was used. All of the ITS and LSU sections performed well (>97.7% accuracy) at higher taxonomic ranks from kingdom to family, and differences between them were small at the genus level (within 0.66 to 1.23%). When full-length sequence sections were used, the LSU outperformed the ITS1 and ITS2 fragments at the genus level, but the ITS1 and ITS2 showed higher accuracy when smaller fragment sizes of the same length and a 50% bootstrap cutoff were used. In a comparison using the larger ITS training set, ITS1 and ITS2 had very similar accuracy classification for fragments between 100 and 200 bp. Collectively, the results show that any of the ITS or LSU sections we tested provided comparable classification accuracy to the genus level and underscore the need for larger and more diverse classification training sets.
Figures

Primer locations in the ITS region, showing the variable ITS1 and ITS2 regions and sequence length in the ITS-LSU training set. Two fragment extraction methods were utilized. Fragment sizes are shown with arrows.

Classification accuracy at each taxonomic level using different rRNA gene regions for LOOCV testing with the naive Bayesian classifier (NBC) and BLASTN approaches. Numbers are percentages of correctly classified query sequences from the ITS-LSU database. The naive Bayesian classifier was trained by full-length sequences without a bootstrap cutoff.

Classification accuracy comparison for the ITS1, ITS2, and LSU regions using the naive Bayesian classifier without a bootstrap or with a 50% bootstrap cutoff. The y axis shows the percentages of LOOCV sequences that were accurately classified. (A) ITS1 region; (B) ITS2 region; (C) LSU region. Query sequences of three lengths were extracted using the primer-anchored method.

Percentage of sequences removed from the ITS-LSU training set after applying a 50% bootstrap cutoff with the naive Bayesian classifier. The y axis shows the percentage of sequence removed. Query sequences of three sizes were extracted using the primer-anchored method.

Comparison of the full lengths of ITS1, ITS2, and entire ITS sections using the naive Bayesian classifier (NBC) or BLASTN. The classifier was used with and without a 50% bootstrap cutoff.

Accuracy comparison using the ITS1 and ITS2 sections with the naive Bayesian classifier at its default setting and a 50% bootstrap cutoff for exact-length sequences of 100 bp (A), 150 bp (B), and 200 bp (C). The y axes show percentages of accuracy (lines) and of sequences removed (bars).
Similar articles
-
Accurate, rapid taxonomic classification of fungal large-subunit rRNA genes.
Liu KL, Porras-Alfaro A, Kuske CR, Eichorst SA, Xie G. Liu KL, et al. Appl Environ Microbiol. 2012 Mar;78(5):1523-33. doi: 10.1128/AEM.06826-11. Epub 2011 Dec 22. Appl Environ Microbiol. 2012. PMID: 22194300 Free PMC article.
-
Mota-Gutierrez J, Ferrocino I, Rantsiou K, Cocolin L. Mota-Gutierrez J, et al. Int J Food Microbiol. 2019 Feb 2;290:132-140. doi: 10.1016/j.ijfoodmicro.2018.10.010. Epub 2018 Oct 10. Int J Food Microbiol. 2019. PMID: 30340111
-
Mycofier: a new machine learning-based classifier for fungal ITS sequences.
Delgado-Serrano L, Restrepo S, Bustos JR, Zambrano MM, Anzola JM. Delgado-Serrano L, et al. BMC Res Notes. 2016 Aug 11;9(1):402. doi: 10.1186/s13104-016-2203-3. BMC Res Notes. 2016. PMID: 27516337 Free PMC article.
-
Gürtler V, Grando D. Gürtler V, et al. J Microbiol Methods. 2013 Dec;95(3):425-40. doi: 10.1016/j.mimet.2013.09.009. Epub 2013 Sep 16. J Microbiol Methods. 2013. PMID: 24050948 Review.
Cited by
-
Kubota Y, Takasawa A, Ono Y, Aoyama T, Takasawa K, Tada A, Magara K, Murakami T, Daimon F, Yamamoto S, Sato S, Hiratsuka Y, Kyuno D, Osanai M. Kubota Y, et al. Med Mol Morphol. 2023 Jun;56(2):144-151. doi: 10.1007/s00795-023-00349-w. Epub 2023 Feb 20. Med Mol Morphol. 2023. PMID: 36806624 Free PMC article.
-
Ceballos-Escalera A, Richards J, Arias MB, Inward DJG, Vogler AP. Ceballos-Escalera A, et al. MycoKeys. 2022 Mar 8;88:1-33. doi: 10.3897/mycokeys.88.77106. eCollection 2022. MycoKeys. 2022. PMID: 35585929 Free PMC article.
-
Unuk T, Martinović T, Finžgar D, Šibanc N, Grebenc T, Kraigher H. Unuk T, et al. Front Plant Sci. 2019 Mar 5;10:214. doi: 10.3389/fpls.2019.00214. eCollection 2019. Front Plant Sci. 2019. PMID: 30891052 Free PMC article.
-
Belowground Microbiota and the Health of Tree Crops.
Mercado-Blanco J, Abrantes I, Barra Caracciolo A, Bevivino A, Ciancio A, Grenni P, Hrynkiewicz K, Kredics L, Proença DN. Mercado-Blanco J, et al. Front Microbiol. 2018 Jun 5;9:1006. doi: 10.3389/fmicb.2018.01006. eCollection 2018. Front Microbiol. 2018. PMID: 29922245 Free PMC article. Review.
-
Paloi S, Mhuantong W, Luangsa-Ard JJ, Kobmoo N. Paloi S, et al. J Fungi (Basel). 2021 Sep 16;7(9):767. doi: 10.3390/jof7090767. J Fungi (Basel). 2021. PMID: 34575804 Free PMC article.
References
-
- Tedersoo L, Nilsson RH, Abarenkov K, Jairus T, Sadam A, Saar I, Bahram M, Bechem E, Chuyong G, Kõljalg U. 2010. 454 Pyrosequencing and Sanger sequencing of tropical mycorrhizal fungi provide similar results but reveal substantial methodological biases. New Phytol. 188:291–301. 10.1111/j.1469-8137.2010.03373.x - DOI - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical