Is Google Trends a reliable tool for digital epidemiology? Insights from different clinical settings

J Epidemiol Glob Health. 2017 Sep;7(3):185-189. doi: 10.1016/j.jegh.2017.06.001. Epub 2017 Jun 9.

Abstract

Internet-derived information has been recently recognized as a valuable tool for epidemiological investigation. Google Trends, a Google Inc. portal, generates data on geographical and temporal patterns according to specified keywords. The aim of this study was to compare the reliability of Google Trends in different clinical settings, for both common diseases with lower media coverage, and for less common diseases attracting major media coverage. We carried out a search in Google Trends using the keywords "renal colic", "epistaxis", and "mushroom poisoning", selected on the basis of available and reliable epidemiological data. Besides this search, we carried out a second search for three clinical conditions (i.e., "meningitis", "Legionella Pneumophila pneumonia", and "Ebola fever"), which recently received major focus by the Italian media. In our analysis, no correlation was found between data captured from Google Trends and epidemiology of renal colics, epistaxis and mushroom poisoning. Only when searching for the term "mushroom" alone the Google Trends search generated a seasonal pattern which almost overlaps with the epidemiological profile, but this was probably mostly due to searches for harvesting and cooking rather than to for poisoning. The Google Trends data also failed to reflect the geographical and temporary patterns of disease for meningitis, Legionella Pneumophila pneumonia and Ebola fever. The results of our study confirm that Google Trends has modest reliability for defining the epidemiology of relatively common diseases with minor media coverage, or relatively rare diseases with higher audience. Overall, Google Trends seems to be more influenced by the media clamor than by true epidemiological burden.

Keywords: Digital epidemiology; Epistaxis; Google Trends; Mushroom poisoning; Renal colic.

MeSH terms

  • Data Accuracy
  • Data Collection* / methods
  • Data Collection* / trends
  • Epidemiologic Methods
  • Epidemiologic Studies*
  • Humans
  • Internet* / standards
  • Internet* / statistics & numerical data
  • Internet* / trends
  • Medical Informatics Computing / trends*
  • Reproducibility of Results
  • Resource Allocation* / methods
  • Resource Allocation* / trends