Using electronic health records and Internet search information for accurate influenza forecasting

66Citations
Citations of this article
116Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Background: Accurate influenza activity forecasting helps public health officials prepare and allocate resources for unusual influenza activity. Traditional flu surveillance systems, such as the Centers for Disease Control and Prevention's (CDC) influenza-like illnesses reports, lag behind real-time by one to 2 weeks, whereas information contained in cloud-based electronic health records (EHR) and in Internet users' search activity is typically available in near real-time. We present a method that combines the information from these two data sources with historical flu activity to produce national flu forecasts for the United States up to 4 weeks ahead of the publication of CDC's flu reports. Methods: We extend a method originally designed to track flu using Google searches, named ARGO, to combine information from EHR and Internet searches with historical flu activities. Our regularized multivariate regression model dynamically selects the most appropriate variables for flu prediction every week. The model is assessed for the flu seasons within the time period 2013-2016 using multiple metrics including root mean squared error (RMSE). Results: Our method reduces the RMSE of the publicly available alternative (Healthmap flutrends) method by 33, 20, 17 and 21%, for the four time horizons: real-time, one, two, and 3 weeks ahead, respectively. Such accuracy improvements are statistically significant at the 5% level. Our real-time estimates correctly identified the peak timing and magnitude of the studied flu seasons. Conclusions: Our method significantly reduces the prediction error when compared to historical publicly available Internet-based prediction systems, demonstrating that: (1) the method to combine data sources is as important as data quality; (2) effectively extracting information from a cloud-based EHR and Internet search activity leads to accurate forecast of flu.

References Powered by Scopus

Regression Shrinkage and Selection Via the Lasso

35483Citations
N/AReaders
Get full text

Detecting influenza epidemics using search engine query data

3223Citations
N/AReaders
Get full text

The parable of google flu: Traps in big data analysis

1796Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Social media- and internet-based disease surveillance for public health

196Citations
N/AReaders
Get full text

Big data's role in precision public health

141Citations
N/AReaders
Get full text

Accurate influenza monitoring and forecasting using novel internet data streams:a case study in the boston metropolis

74Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Yang, S., Santillana, M., Brownstein, J. S., Gray, J., Richardson, S., & Kou, S. C. (2017). Using electronic health records and Internet search information for accurate influenza forecasting. BMC Infectious Diseases, 17(1). https://doi.org/10.1186/s12879-017-2424-7

Readers over time

‘17‘18‘19‘20‘21‘22‘23‘24‘2509182736

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 49

65%

Researcher 20

27%

Professor / Associate Prof. 4

5%

Lecturer / Post doc 2

3%

Readers' Discipline

Tooltip

Computer Science 22

39%

Medicine and Dentistry 15

26%

Social Sciences 10

18%

Mathematics 10

18%

Save time finding and organizing research with Mendeley

Sign up for free
0