Correlations of online search engine trends with coronavirus disease (COVID-19) incidence: Infodemiology study

Thomas S. Higgins, Arthur W. Wu, Dhruv Sharma, Elisa A. Illing, Kolin Rubel, Jonathan Y. Ting, Snot Force Alliance

Research output: Contribution to journalArticlepeer-review

88 Scopus citations

Abstract

Background: The coronavirus disease (COVID-19) is the latest pandemic of the digital age. With the internet harvesting large amounts of data from the general population in real time, public databases such as Google Trends (GT) and the Baidu Index (BI) can be an expedient tool to assist public health efforts. Objective: The aim of this study is to apply digital epidemiology to the current COVID-19 pandemic to determine the utility of providing adjunctive epidemiologic information on outbreaks of this disease and evaluate this methodology in the case of future pandemics. Methods: An epidemiologic time series analysis of online search trends relating to the COVID-19 pandemic was performed from January 9, 2020, to April 6, 2020. BI was used to obtain online search data for China, while GT was used for worldwide data, the countries of Italy and Spain, and the US states of New York and Washington. These data were compared to real-world confirmed cases and deaths of COVID-19. Chronologic patterns were assessed in relation to disease patterns, significant events, and media reports. Results: Worldwide search terms for shortness of breath, anosmia, dysgeusia and ageusia, headache, chest pain, and sneezing had strong correlations (r>0.60, P<.001) to both new daily confirmed cases and deaths from COVID-19. GT COVID-19 (search term) and GT coronavirus (virus) searches predated real-world confirmed cases by 12 days (r=0.85, SD 0.10 and r=0.76, SD 0.09, respectively, P<.001). Searches for symptoms of diarrhea, fever, shortness of breath, cough, nasal obstruction, and rhinorrhea all had a negative lag greater than 1 week compared to new daily cases, while searches for anosmia and dysgeusia peaked worldwide and in China with positive lags of 5 days and 6 weeks, respectively, corresponding with widespread media coverage of these symptoms in COVID-19. Conclusions: This study demonstrates the utility of digital epidemiology in providing helpful surveillance data of disease outbreaks like COVID-19. Although certain online search trends for this disease were influenced by media coverage, many search terms reflected clinical manifestations of the disease and showed strong correlations with real-world cases and deaths.

Original languageEnglish (US)
Article numbere19702
JournalJMIR Public Health and Surveillance
Volume6
Issue number2
DOIs
StatePublished - Apr 2020
Externally publishedYes

Bibliographical note

Publisher Copyright:
© Thomas S Higgins.

Keywords

  • Baidu
  • Big data
  • China
  • Coronavirus
  • COVID-19
  • Digital health
  • Epidemiology
  • Google Trends
  • Infodemiology
  • Infoveillance
  • Italy
  • New York
  • SARS-CoV-2
  • Spain
  • Washington

Fingerprint

Dive into the research topics of 'Correlations of online search engine trends with coronavirus disease (COVID-19) incidence: Infodemiology study'. Together they form a unique fingerprint.

Cite this