Search CORE

6,355 research outputs found

Combining Search, Social Media, and Traditional Data Sources to Improve Influenza Surveillance

Author: André T Nguyen
Elaine O Nsoesie
John S Brownstein
Mark Dredze
Mauricio Santillana
Michael J Paul
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 27/08/2015
Field of study

We present a machine learning-based methodology capable of providing real-time ("nowcast") and forecast estimates of influenza activity in the US by leveraging data from multiple data sources including: Google searches, Twitter microblogs, nearly real-time hospital visit records, and data from a participatory surveillance system. Our main contribution consists of combining multiple influenza-like illnesses (ILI) activity estimates, generated independently with each data source, into a single prediction of ILI utilizing machine learning ensemble approaches. Our methodology exploits the information in each data source and produces accurate weekly ILI predictions for up to four weeks ahead of the release of CDC's ILI reports. We evaluate the predictive ability of our ensemble approach during the 2013-2014 (retrospective) and 2014-2015 (live) flu seasons for each of the four weekly time horizons. Our ensemble approach demonstrates several advantages: (1) our ensemble method's predictions outperform every prediction using each data source independently, (2) our methodology can produce predictions one week ahead of GFT's real-time estimates with comparable accuracy, and (3) our two and three week forecast estimates have comparable accuracy to real-time predictions using an autoregressive model. Moreover, our results show that considerable insight is gained from incorporating disparate data streams, in the form of social media and crowd sourced data, into influenza predictions in all time horizon

arXiv.org e-Print Archive

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

Global disease monitoring and forecasting with Wikipedia

Author: Del Valle Sara Y.
Deshpande Alina
Fairchild Geoffrey
Generous Nicholas
Priedhorsky Reid
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 15/07/2014
Field of study

Infectious disease is a leading threat to public health, economic stability, and other key social structures. Efforts to mitigate these impacts depend on accurate and timely monitoring to measure the risk and progress of disease. Traditional, biologically-focused monitoring techniques are accurate but costly and slow; in response, new techniques based on social internet data such as social media and search queries are emerging. These efforts are promising, but important challenges in the areas of scientific peer review, breadth of diseases and countries, and forecasting hamper their operational usefulness. We examine a freely available, open data source for this use: access logs from the online encyclopedia Wikipedia. Using linear models, language as a proxy for location, and a systematic yet simple article selection procedure, we tested 14 location-disease combinations and demonstrate that these data feasibly support an approach that overcomes these challenges. Specifically, our proof-of-concept yields models with

r^2

up to 0.92, forecasting value up to the 28 days tested, and several pairs of models similar enough to suggest that transferring models from one location to another without re-training is feasible. Based on these preliminary results, we close with a research agenda designed to overcome these challenges and produce a disease monitoring and forecasting system that is significantly more effective, robust, and globally comprehensive than the current state of the art.Comment: 27 pages; 4 figures; 4 tables. Version 2: Cite McIver & Brownstein and adjust novelty claims accordingly; revise title; various revisions for clarit

arXiv.org e-Print Archive

CiteSeerX

Directory of Open Access Journals

PubMed Central

FigShare

Results from the centers for disease control and prevention's predict the 2013-2014 Influenza Season Challenge

Author: Allen Christopher
Alper David
Aman Susan
Anil Kumar V. S.
Aslam Anoshã
Bakach Iurii
Barrett Chris
BASAGNI Stefano
Biggerstaff Matthew
Bisset Keith
Broniatowski David
Brooks Logan
Brownstein John S.
Butler Patrick
Chakraborty Prithwish
Chandra Priyadarshini
Chen Jiangzhuo
Del Valle Sara Y.
Deshpande Alina
Dredze Mark
Eggo Rosalind
Eubank Stephen
Fairchild Geoffrey
Farrow David
Finelli Lyn
Fox Spencer
Fung Isaac Chun Hai
Gambhir Manoj
Generous Nicholas
GESUALDO Francesco
Goldstein Ed
Hao Yi
Henderson Jette
Hickman Kyle S.
Hickmann Kyle S.
Hyman James M.
Hyun Sangwon
Karspeck Alicia
Kaup Hemchandra
Khadivi Pejman
Krishnan Ramesh
Laskowski Kathy
Lewis Bryan
Lipsitch Marc
Lum Kristian
Madhavan Satish
Marathe Madhav
Markar Ashirwad
Mekaru Sumiko R.
Meyers Lauren Ancel
Nagel Anna
Nsoesie Elaine O.
Pashley Bryanne
Paul Michael
PERRA NICOLA
Priedhorsky Reid
Ramakrishnan Anurekha
Ramakrishnan Naren
Rosenfeld Roni
Scarpino Sam
Schaible Braydon J.
Scott James
Sexton Jessica K.
Shaman Jeffrey
Singh Bismark
Srinivasan Ravi
STILO GIOVANNI
Tibshirani Ryan J.
Tozzi Alberto E.
Tse Zion Tsz Ho
Tsou Ming Hsiang
VELARDI Paola
Vespignani Alessandro
Yang Wan
Ying Yuchen
Zhang Qian
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Background: Early insights into the timing of the start, peak, and intensity of the influenza season could be useful in planning influenza prevention and control activities. To encourage development and innovation in influenza forecasting, the Centers for Disease Control and Prevention (CDC) organized a challenge to predict the 2013-14 Unites States influenza season. Methods: Challenge contestants were asked to forecast the start, peak, and intensity of the 2013-2014 influenza season at the national level and at any or all Health and Human Services (HHS) region level(s). The challenge ran from December 1, 2013-March 27, 2014; contestants were required to submit 9 biweekly forecasts at the national level to be eligible. The selection of the winner was based on expert evaluation of the methodology used to make the prediction and the accuracy of the prediction as judged against the U.S. Outpatient Influenza-like Illness Surveillance Network (ILINet). Results: Nine teams submitted 13 forecasts for all required milestones. The first forecast was due on December 2, 2013; 3/13 forecasts received correctly predicted the start of the influenza season within one week, 1/13 predicted the peak within 1 week, 3/13 predicted the peak ILINet percentage within 1 %, and 4/13 predicted the season duration within 1 week. For the prediction due on December 19, 2013, the number of forecasts that correctly forecasted the peak week increased to 2/13, the peak percentage to 6/13, and the duration of the season to 6/13. As the season progressed, the forecasts became more stable and were closer to the season milestones. Conclusion: Forecasting has become technically feasible, but further efforts are needed to improve forecast accuracy so that policy makers can reliably use these predictions. CDC and challenge contestants plan to build upon the methods developed during this contest to improve the accuracy of influenza forecasts. © 2016 The Author(s)

Archivio della ricerca- Università di Roma La Sapienza

Marshfield Clinic: Health Information Technology Paves the Way for Population Health Management

Author: Douglas McCarthy
Kimberly Mueller
Publication venue: 'The Commonwealth Fund (CMWF)'
Publication date: 08/08/2009
Field of study

Highlights Fund-defined attributes of an ideal care delivery system and best practices, including an internal electronic health record, primary care teams, physician quality metrics and mentors, and standardized care processes for chronic care management

IssueLab

Disease Surveillance Networks Initiative Asia: Final Evaluation

Author: Caridad A. Ancheta
Elaine S. Perez
Kerry Richter
Ma. Sandra B. Tempongko
Malee Sunpuwan
Ophelia M. Mendoza
Oranut Pachuen
Pratap Singhasivanon
Publication venue: SEAMEO TROPMED Network
Publication date: 12/12/2010
Field of study

The DSN Initiative was launched in 2007 under the new strategy of the Rockefeller Foundation. The initiative intends:[1] To improve human resources for disease surveillance in developing countries, thus bolstering national capacity to monitor, report, and respond to outbreaks;[2] To support regional networks to promote collaboration in disease surveillance and response across countries; and[3] To build bridges between regional and global monitoring effortsThe purpose of the DSN evaluation in the Mekong region was twofold:[1]To inform the work and strategy of the Foundation, its grantees, and the broader field of disease surveillance, based on the experience of DSN investments in the Mekong region. More specifically, the evaluation will inform future directions and strategies for current areas of DSN Initiative work, particularly in Asia, and will highlight potential new areas of work and strategy; and[2] To provide accountability to the Rockefeller Foundation's board, staff, and stakeholders for the DSN funds spent in the Mekong region

IssueLab