Flexible Modeling of Epidemics with an Empirical Bayes Framework. 2015

Logan C Brooks, and David C Farrow, and Sangwon Hyun, and Ryan J Tibshirani, and Roni Rosenfeld
School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America.

Seasonal influenza epidemics cause consistent, considerable, widespread loss annually in terms of economic burden, morbidity, and mortality. With access to accurate and reliable forecasts of a current or upcoming influenza epidemic's behavior, policy makers can design and implement more effective countermeasures. This past year, the Centers for Disease Control and Prevention hosted the "Predict the Influenza Season Challenge", with the task of predicting key epidemiological measures for the 2013-2014 U.S. influenza season with the help of digital surveillance data. We developed a framework for in-season forecasts of epidemics using a semiparametric Empirical Bayes framework, and applied it to predict the weekly percentage of outpatient doctors visits for influenza-like illness, and the season onset, duration, peak time, and peak height, with and without using Google Flu Trends data. Previous work on epidemic modeling has focused on developing mechanistic models of disease behavior and applying time series tools to explain historical data. However, tailoring these models to certain types of surveillance data can be challenging, and overly complex models with many parameters can compromise forecasting ability. Our approach instead produces possibilities for the epidemic curve of the season of interest using modified versions of data from previous seasons, allowing for reasonable variations in the timing, pace, and intensity of the seasonal epidemics, as well as noise in observations. Since the framework does not make strict domain-specific assumptions, it can easily be applied to some other diseases with seasonal epidemics. This method produces a complete posterior distribution over epidemic curves, rather than, for example, solely point predictions of forecasting targets. We report prospective influenza-like-illness forecasts made for the 2013-2014 U.S. influenza season, and compare the framework's cross-validated prediction error on historical data to that of a variety of simpler baseline predictors.

UI MeSH Term Description Entries
D007251 Influenza, Human An acute viral infection in humans involving the respiratory tract. It is marked by inflammation of the NASAL MUCOSA; the PHARYNX; and conjunctiva, and by headache and severe, often generalized, myalgia. Grippe,Human Flu,Human Influenza,Influenza in Humans,Influenza,Flu, Human,Human Influenzas,Influenza in Human,Influenzas,Influenzas, Human
D008954 Models, Biological Theoretical representations that simulate the behavior or activity of biological processes or diseases. For disease models in living animals, DISEASE MODELS, ANIMAL is available. Biological models include the use of mathematical equations, computers, and other electronic equipment. Biological Model,Biological Models,Model, Biological,Models, Biologic,Biologic Model,Biologic Models,Model, Biologic
D002487 Centers for Disease Control and Prevention, U.S. An agency of the UNITED STATES PUBLIC HEALTH SERVICE that conducts and supports programs for the prevention and control of disease and provides consultation and assistance to health departments and other countries. United States Centers for Disease Control and Prevention,CDC,CDCP,Center for Disease Control,Center for Disease Control and Prevention,Centers for Disease Control,Centers for Disease Control (U.S.),Centers for Disease Control and Prevention,Centers for Disease Control and Prevention (U.S.),Centers for Disease Control, U.S.,United States Centers for Disease Control
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D001499 Bayes Theorem A theorem in probability theory named for Thomas Bayes (1702-1761). In epidemiology, it is used to obtain the probability of disease in a group of people with some characteristic on the basis of the overall rate of that disease and of the likelihood of that characteristic in healthy and diseased individuals. The most familiar application is in clinical decision analysis where it is used for estimating the probability of a particular diagnosis given the appearance of some symptoms or test result. Bayesian Analysis,Bayesian Estimation,Bayesian Forecast,Bayesian Method,Bayesian Prediction,Analysis, Bayesian,Bayesian Approach,Approach, Bayesian,Approachs, Bayesian,Bayesian Approachs,Estimation, Bayesian,Forecast, Bayesian,Method, Bayesian,Prediction, Bayesian,Theorem, Bayes
D014481 United States A country in NORTH AMERICA between CANADA and MEXICO.
D015203 Reproducibility of Results The statistical reproducibility of measurements (often in a clinical context), including the testing of instrumentation or techniques to obtain reproducible results. The concept includes reproducibility of physiological measurements, which may be used to develop rules to assess probability or prognosis, or response to a stimulus; reproducibility of occurrence of a condition; and reproducibility of experimental results. Reliability and Validity,Reliability of Result,Reproducibility Of Result,Reproducibility of Finding,Validity of Result,Validity of Results,Face Validity,Reliability (Epidemiology),Reliability of Results,Reproducibility of Findings,Test-Retest Reliability,Validity (Epidemiology),Finding Reproducibilities,Finding Reproducibility,Of Result, Reproducibility,Of Results, Reproducibility,Reliabilities, Test-Retest,Reliability, Test-Retest,Result Reliabilities,Result Reliability,Result Validities,Result Validity,Result, Reproducibility Of,Results, Reproducibility Of,Test Retest Reliability,Validity and Reliability,Validity, Face
D015233 Models, Statistical Statistical formulations or analyses which, when applied to data and found to fit the data, are then used to verify the assumptions and parameters used in the analysis. Examples of statistical models are the linear model, binomial model, polynomial model, two-parameter model, etc. Probabilistic Models,Statistical Models,Two-Parameter Models,Model, Statistical,Models, Binomial,Models, Polynomial,Statistical Model,Binomial Model,Binomial Models,Model, Binomial,Model, Polynomial,Model, Probabilistic,Model, Two-Parameter,Models, Probabilistic,Models, Two-Parameter,Polynomial Model,Polynomial Models,Probabilistic Model,Two Parameter Models,Two-Parameter Model
D058872 Epidemics Sudden outbreaks of a disease in a country or region not previously recognized in that area, or a rapid increase in the number of new cases of a previous existing endemic disease. Epidemics can also refer to outbreaks of disease in animal or plant populations.
D019295 Computational Biology A field of biology concerned with the development of techniques for the collection and manipulation of biological data, and the use of such data to make biological discoveries or predictions. This field encompasses all computational methods and theories for solving biological problems including manipulation of models and datasets. Bioinformatics,Molecular Biology, Computational,Bio-Informatics,Biology, Computational,Computational Molecular Biology,Bio Informatics,Bio-Informatic,Bioinformatic,Biologies, Computational Molecular,Biology, Computational Molecular,Computational Molecular Biologies,Molecular Biologies, Computational

Related Publications

Logan C Brooks, and David C Farrow, and Sangwon Hyun, and Ryan J Tibshirani, and Roni Rosenfeld
January 2021, Journal of machine learning research : JMLR,
Logan C Brooks, and David C Farrow, and Sangwon Hyun, and Ryan J Tibshirani, and Roni Rosenfeld
February 2007, Bioinformatics (Oxford, England),
Logan C Brooks, and David C Farrow, and Sangwon Hyun, and Ryan J Tibshirani, and Roni Rosenfeld
May 2014, Statistical science : a review journal of the Institute of Mathematical Statistics,
Logan C Brooks, and David C Farrow, and Sangwon Hyun, and Ryan J Tibshirani, and Roni Rosenfeld
June 2012, Computational statistics & data analysis,
Logan C Brooks, and David C Farrow, and Sangwon Hyun, and Ryan J Tibshirani, and Roni Rosenfeld
November 2016, Bioinformatics (Oxford, England),
Logan C Brooks, and David C Farrow, and Sangwon Hyun, and Ryan J Tibshirani, and Roni Rosenfeld
December 1980, Proceedings of the National Academy of Sciences of the United States of America,
Logan C Brooks, and David C Farrow, and Sangwon Hyun, and Ryan J Tibshirani, and Roni Rosenfeld
July 2018, Scientific reports,
Logan C Brooks, and David C Farrow, and Sangwon Hyun, and Ryan J Tibshirani, and Roni Rosenfeld
September 1986, Risk analysis : an official publication of the Society for Risk Analysis,
Logan C Brooks, and David C Farrow, and Sangwon Hyun, and Ryan J Tibshirani, and Roni Rosenfeld
July 2012, Biostatistics (Oxford, England),
Logan C Brooks, and David C Farrow, and Sangwon Hyun, and Ryan J Tibshirani, and Roni Rosenfeld
April 2013, Computational biology and chemistry,
Copied contents to your clipboard!