54 research outputs found

    Data-driven approach for creating synthetic electronic medical records

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>New algorithms for disease outbreak detection are being developed to take advantage of full electronic medical records (EMRs) that contain a wealth of patient information. However, due to privacy concerns, even anonymized EMRs cannot be shared among researchers, resulting in great difficulty in comparing the effectiveness of these algorithms. To bridge the gap between novel bio-surveillance algorithms operating on full EMRs and the lack of non-identifiable EMR data, a method for generating complete and synthetic EMRs was developed.</p> <p>Methods</p> <p>This paper describes a novel methodology for generating complete synthetic EMRs both for an outbreak illness of interest (tularemia) and for background records. The method developed has three major steps: 1) synthetic patient identity and basic information generation; 2) identification of care patterns that the synthetic patients would receive based on the information present in real EMR data for similar health problems; 3) adaptation of these care patterns to the synthetic patient population.</p> <p>Results</p> <p>We generated EMRs, including visit records, clinical activity, laboratory orders/results and radiology orders/results for 203 synthetic tularemia outbreak patients. Validation of the records by a medical expert revealed problems in 19% of the records; these were subsequently corrected. We also generated background EMRs for over 3000 patients in the 4-11 yr age group. Validation of those records by a medical expert revealed problems in fewer than 3% of these background patient EMRs and the errors were subsequently rectified.</p> <p>Conclusions</p> <p>A data-driven method was developed for generating fully synthetic EMRs. The method is general and can be applied to any data set that has similar data elements (such as laboratory and radiology orders and results, clinical activity, prescription orders). The pilot synthetic outbreak records were for tularemia but our approach may be adapted to other infectious diseases. The pilot synthetic background records were in the 4-11 year old age group. The adaptations that must be made to the algorithms to produce synthetic background EMRs for other age groups are indicated.</p

    The AFHSC-Division of GEIS Operations Predictive Surveillance Program: a multidisciplinary approach for the early detection and response to disease outbreaks

    Get PDF
    The Armed Forces Health Surveillance Center, Division of Global Emerging Infections Surveillance and Response System Operations (AFHSC-GEIS) initiated a coordinated, multidisciplinary program to link data sets and information derived from eco-climatic remote sensing activities, ecologic niche modeling, arthropod vector, animal disease-host/reservoir, and human disease surveillance for febrile illnesses, into a predictive surveillance program that generates advisories and alerts on emerging infectious disease outbreaks. The program’s ultimate goal is pro-active public health practice through pre-event preparedness, prevention and control, and response decision-making and prioritization. This multidisciplinary program is rooted in over 10 years experience in predictive surveillance for Rift Valley fever outbreaks in Eastern Africa. The AFHSC-GEIS Rift Valley fever project is based on the identification and use of disease-emergence critical detection points as reliable signals for increased outbreak risk. The AFHSC-GEIS predictive surveillance program has formalized the Rift Valley fever project into a structured template for extending predictive surveillance capability to other Department of Defense (DoD)-priority vector- and water-borne, and zoonotic diseases and geographic areas. These include leishmaniasis, malaria, and Crimea-Congo and other viral hemorrhagic fevers in Central Asia and Africa, dengue fever in Asia and the Americas, Japanese encephalitis (JE) and chikungunya fever in Asia, and rickettsial and other tick-borne infections in the U.S., Africa and Asia

    Correction for Johansson et al., An open challenge to advance probabilistic forecasting for dengue epidemics.

    Get PDF
    Correction for “An open challenge to advance probabilistic forecasting for dengue epidemics,” by Michael A. Johansson, Karyn M. Apfeldorf, Scott Dobson, Jason Devita, Anna L. Buczak, Benjamin Baugher, Linda J. Moniz, Thomas Bagley, Steven M. Babin, Erhan Guven, Teresa K. Yamana, Jeffrey Shaman, Terry Moschou, Nick Lothian, Aaron Lane, Grant Osborne, Gao Jiang, Logan C. Brooks, David C. Farrow, Sangwon Hyun, Ryan J. Tibshirani, Roni Rosenfeld, Justin Lessler, Nicholas G. Reich, Derek A. T. Cummings, Stephen A. Lauer, Sean M. Moore, Hannah E. Clapham, Rachel Lowe, Trevor C. Bailey, Markel García-Díez, Marilia Sá Carvalho, Xavier Rodó, Tridip Sardar, Richard Paul, Evan L. Ray, Krzysztof Sakrejda, Alexandria C. Brown, Xi Meng, Osonde Osoba, Raffaele Vardavas, David Manheim, Melinda Moore, Dhananjai M. Rao, Travis C. Porco, Sarah Ackley, Fengchen Liu, Lee Worden, Matteo Convertino, Yang Liu, Abraham Reddy, Eloy Ortiz, Jorge Rivero, Humberto Brito, Alicia Juarrero, Leah R. Johnson, Robert B. Gramacy, Jeremy M. Cohen, Erin A. Mordecai, Courtney C. Murdock, Jason R. Rohr, Sadie J. Ryan, Anna M. Stewart-Ibarra, Daniel P. Weikel, Antarpreet Jutla, Rakibul Khan, Marissa Poultney, Rita R. Colwell, Brenda Rivera-García, Christopher M. Barker, Jesse E. Bell, Matthew Biggerstaff, David Swerdlow, Luis Mier-y-Teran-Romero, Brett M. Forshey, Juli Trtanj, Jason Asher, Matt Clay, Harold S. Margolis, Andrew M. Hebbeler, Dylan George, and Jean-Paul Chretien, which was first published November 11, 2019; 10.1073/pnas.1909865116. The authors note that the affiliation for Xavier Rodó should instead appear as Catalan Institution for Research and Advanced Studies (ICREA) and Climate and Health Program, Barcelona Institute for Global Health (ISGlobal). The corrected author and affiliation lines appear below. The online version has been corrected

    An open challenge to advance probabilistic forecasting for dengue epidemics.

    Get PDF
    A wide range of research has promised new tools for forecasting infectious disease dynamics, but little of that research is currently being applied in practice, because tools do not address key public health needs, do not produce probabilistic forecasts, have not been evaluated on external data, or do not provide sufficient forecast skill to be useful. We developed an open collaborative forecasting challenge to assess probabilistic forecasts for seasonal epidemics of dengue, a major global public health problem. Sixteen teams used a variety of methods and data to generate forecasts for 3 epidemiological targets (peak incidence, the week of the peak, and total incidence) over 8 dengue seasons in Iquitos, Peru and San Juan, Puerto Rico. Forecast skill was highly variable across teams and targets. While numerous forecasts showed high skill for midseason situational awareness, early season skill was low, and skill was generally lowest for high incidence seasons, those for which forecasts would be most valuable. A comparison of modeling approaches revealed that average forecast skill was lower for models including biologically meaningful data and mechanisms and that both multimodel and multiteam ensemble forecasts consistently outperformed individual model forecasts. Leveraging these insights, data, and the forecasting framework will be critical to improve forecast skill and the application of forecasts in real time for epidemic preparedness and response. Moreover, key components of this project-integration with public health needs, a common forecasting framework, shared and standardized data, and open participation-can help advance infectious disease forecasting beyond dengue

    A Survey of Data Mining and Machine Learning Methods for Cyber Security Intrusion Detection

    No full text

    Dynamic agent composition from semantic web services

    No full text
    Abstract. The shift from Web pages to Web services enables programmatic access to the near limitless information on the World Wide Web. Autonomous agents should generate concise answers to complex questions by invoking the right services with the right data. However, traditional methods of programming automated query processing capabilities are inadequate for two reasons: as Web services become more abundant, it becomes difficult to manually formulate the query process; and, services may be temporarily unavailable – typically just when they are needed. We have created a tool called Meta-Planning for Agent Composition (MPAC) that dynamically builds agents to solve a user-defined goal using a select, currently available set of services. MPAC relies on a planning algorithm and semantic descriptions of services in the Web Ontology Language/Resource Description Framework (OWL/RDF) and the Web Ontology Language-Services (OWL-S) frameworks. Our novel approach for building these agents is domain independent. It assumes that semantic descriptions of services and a registry of currently available services will be available, as envisioned by the Semantic Web community. Once an information goal is expressed through the ontology of the Web service descriptions, MPAC determines the right sequence of service invocations. To illustrate our approach, we describe a proof-ofconcept application in a maritime navigation domain.
    corecore