Assessing Public Interest Based on Wikipedia's Most Visited Medical Articles During the SARS-CoV-2 Outbreak: Search Trends Analysis - PubMed
- ️Fri Jan 01 2021
Assessing Public Interest Based on Wikipedia's Most Visited Medical Articles During the SARS-CoV-2 Outbreak: Search Trends Analysis
Jędrzej Chrzanowski et al. J Med Internet Res. 2021.
Erratum in
-
Chrzanowski J, Sołek J, Fendler W, Jemielniak D. Chrzanowski J, et al. J Med Internet Res. 2021 Apr 15;23(4):e29598. doi: 10.2196/29598. J Med Internet Res. 2021. PMID: 33857009 Free PMC article.
Abstract
Background: In the current era of widespread access to the internet, we can monitor public interest in a topic via information-targeted web browsing. We sought to provide direct proof of the global population's altered use of Wikipedia medical knowledge resulting from the new COVID-19 pandemic and related global restrictions.
Objective: We aimed to identify temporal search trends and quantify changes in access to Wikipedia Medicine Project articles that were related to the COVID-19 pandemic.
Methods: We performed a retrospective analysis of medical articles across nine language versions of Wikipedia and country-specific statistics for registered COVID-19 deaths. The observed patterns were compared to a forecast model of Wikipedia use, which was trained on data from 2015 to 2019. The model comprehensively analyzed specific articles and similarities between access count data from before (ie, several years prior) and during the COVID-19 pandemic. Wikipedia articles that were linked to those directly associated with the pandemic were evaluated in terms of degrees of separation and analyzed to identify similarities in access counts. We assessed the correlation between article access counts and the number of diagnosed COVID-19 cases and deaths to identify factors that drove interest in these articles and shifts in public interest during the subsequent phases of the pandemic.
Results: We observed a significant (P<.001) increase in the number of entries on Wikipedia medical articles during the pandemic period. The increased interest in COVID-19-related articles temporally correlated with the number of global COVID-19 deaths and consistently correlated with the number of region-specific COVID-19 deaths. Articles with low degrees of separation were significantly similar (P<.001) in terms of access patterns that were indicative of information-seeking patterns.
Conclusions: The analysis of Wikipedia medical article popularity could be a viable method for epidemiologic surveillance, as it provides important information about the reasons behind public attention and factors that sustain public interest in the long term. Moreover, Wikipedia users can potentially be directed to credible and valuable information sources that are linked with the most prominent articles.
Keywords: COVID-19; Wikipedia; infodemiology; information seeking; infoveillance; interest; internet; media; online health information; pandemic; retrospective; surveillance.
©Jędrzej Chrzanowski, Julia Sołek, Dariusz Jemielniak, Dariusz Jemielniak. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 12.04.2021.
Conflict of interest statement
Conflicts of Interest: Author DJ is a non-paid, volunteer member of the Board of Trustees of Wikimedia Foundation, a non-profit publisher of Wikipedia. All the other authors have no conflicts to declare.
Figures

The disruption in annual Wikipedia visit patterns. (A) The pattern of general English Wikipedia access from 2015 to 2020 (Multimedia Appendix 3). Data for other language versions of Wikipedia are provided in Multimedia Appendix 4, Supplementary Figures S1a-S9a. (B) The FBProphet prediction model was created based on data from the 2015-2019 period. The model compared expected behaviors in 2020 (ie, the blue line) to observed access (ie, the orange line). Data for the other language versions of Wikipedia are provided in Multimedia Appendix 4, Supplementary Figures S1b-S9b. (C) A summary of monthly general access to Wikipedia across all Wikipedia language versions from September 2019 to September 2020. Solid lines represent Chi-square goodness-of-fit values and the dashed line represents the cutoff value. (D) The stability of the two reference Wikipedia articles across all languages in the 2015-2019 and 2020 periods (ie, the moving SD divided by mean percent access across a 30-day window). DE: German; EN: English; ES: Spanish; FR: French; IT: Italian; NL: Dutch; PL: Polish; RU: Russian; SV: Swedish; VI: Vietnamese.

Access to COVID-19–related Wikipedia articles in 2020 and the total number of deaths resulting from SARS-CoV-2 infection. (A) The percentage of daily article access (ie, the "COVID-19 pandemic," "Spanish flu," "Leonardo da Vinci," and "Sexual intercourse" articles) to English Wikipedia and total number of global deaths resulting from SARS-CoV-2 infection (ie, per 1 million people). Data for other language versions of Wikipedia are provided in Multimedia Appendix 4, Supplementary Figures S18-S25. (B) Heatmap of Spearman absolute regression coefficients for COVID-19–related articles and the total number of global deaths resulting from SARS-CoV-2 infection across languages and months. (C) Access to COVID-19–related articles (ie, the kNN-determined relevance values across selected Wikipedia language versions) versus the total number of region-specific deaths resulting from SARS-CoV-2 infection. The graph shows correlations between region-specific deaths and COVID-19–related articles. (D) Access to COVID-19–related articles (ie, the kNN-determined relevance values across selected Wikipedia language versions) versus the total number of global deaths resulting from SARS-CoV-2 infection. DE: German; EN: English; ES: Spanish; FR: French; IT: Italian; kNN: k-nearest neighbor; NL: Dutch; PL: Polish; RU: Russian; SV: Swedish; VI: Vietnamese.
Similar articles
-
Farič N, Potts HW, Heilman JM. Farič N, et al. J Med Internet Res. 2024 Sep 12;26:e47562. doi: 10.2196/47562. J Med Internet Res. 2024. PMID: 39264697 Free PMC article.
-
Wikipedia and medicine: quantifying readership, editors, and the significance of natural language.
Heilman JM, West AG. Heilman JM, et al. J Med Internet Res. 2015 Mar 4;17(3):e62. doi: 10.2196/jmir.4069. J Med Internet Res. 2015. PMID: 25739399 Free PMC article.
-
Badell-Grau RA, Cuff JP, Kelly BP, Waller-Evans H, Lloyd-Evans E. Badell-Grau RA, et al. J Med Internet Res. 2020 Oct 27;22(10):e19791. doi: 10.2196/19791. J Med Internet Res. 2020. PMID: 32915763 Free PMC article.
-
Maggio LA, Willinsky JM, Costello JA, Skinner NA, Martin PC, Dawson JE. Maggio LA, et al. Perspect Med Educ. 2020 Dec;9(6):333-342. doi: 10.1007/s40037-020-00620-1. Epub 2020 Oct 8. Perspect Med Educ. 2020. PMID: 33030643 Free PMC article. Review.
-
People behavioral during health information searching in COVID-19 era: a review.
Alzghaibi H. Alzghaibi H. Front Public Health. 2023 Aug 10;11:1166639. doi: 10.3389/fpubh.2023.1166639. eCollection 2023. Front Public Health. 2023. PMID: 37637820 Free PMC article. Review.
Cited by
-
Gordejeva J, Zowalla R, Pobiruchin M, Wiesner M. Gordejeva J, et al. J Med Internet Res. 2022 May 16;24(5):e36835. doi: 10.2196/36835. J Med Internet Res. 2022. PMID: 35576562 Free PMC article.
-
Citation needed? Wikipedia bibliometrics during the first wave of the COVID-19 pandemic.
Benjakob O, Aviram R, Sobel JA. Benjakob O, et al. Gigascience. 2022 Jan 12;11(1):giab095. doi: 10.1093/gigascience/giab095. Gigascience. 2022. PMID: 35022700 Free PMC article.
-
Volunteer contributions to Wikipedia increased during COVID-19 mobility restrictions.
Ruprechter T, Horta Ribeiro M, Santos T, Lemmerich F, Strohmaier M, West R, Helic D. Ruprechter T, et al. Sci Rep. 2021 Nov 2;11(1):21505. doi: 10.1038/s41598-021-00789-3. Sci Rep. 2021. PMID: 34728670 Free PMC article.
-
Undergraduate Students' Online Health Information-Seeking Behavior during the COVID-19 Pandemic.
Hsu WC. Hsu WC. Int J Environ Res Public Health. 2021 Dec 16;18(24):13250. doi: 10.3390/ijerph182413250. Int J Environ Res Public Health. 2021. PMID: 34948859 Free PMC article.
-
Okunoye B, Ning S, Jemielniak D. Okunoye B, et al. JMIR Form Res. 2022 Mar 11;6(3):e29819. doi: 10.2196/29819. JMIR Form Res. 2022. PMID: 35275080 Free PMC article.
References
-
- Ashrafi-Rizi H, Kazempour Z. Information typology in coronavirus (COVID-19) crisis; a commentary. Arch Acad Emerg Med. 2020 Mar 12;8(1):e19. http://europepmc.org/abstract/MED/32185370 - PMC - PubMed
-
- Ashrafi-Rizi H, Kazempour Z. Information diet in Covid-19 crisis; a commentary. Arch Acad Emerg Med. 2020 Mar 28;8(1):e30. http://europepmc.org/abstract/MED/32232215 - PMC - PubMed
-
- Bragazzi NL, Dai H, Damiani G, Behzadifar M, Martini M, Wu J. How big data and artificial intelligence can help better manage the COVID-19 pandemic. Int J Environ Res Public Health. 2020 May 02;17(9):3176. doi: 10.3390/ijerph17093176. https://www.mdpi.com/resolver?pii=ijerph17093176 - DOI - PMC - PubMed
-
- Bento AI, Nguyen T, Wing C, Lozano-Rojas F, Ahn YY, Simon K. Evidence from internet search data shows information-seeking responses to news of local COVID-19 cases. Proc Natl Acad Sci U S A. 2020 May 26;117(21):11220–11222. doi: 10.1073/pnas.2005335117. http://www.pnas.org/cgi/pmidlookup?view=long&pmid=32366658 - DOI - PMC - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Miscellaneous