113 research outputs found

    Real-time classifiers from free-text for continuous surveillance of small animal disease

    Get PDF
    A wealth of information of epidemiological importance is held within unstructured narrative clinical records. Text mining provides computational techniques for extracting usable information from the language used to communicate between humans, including the spoken and written word. The aim of this work was to develop text-mining methodologies capable of rendering the large volume of information within veterinary clinical narratives accessible for research and surveillance purposes. The free-text records collated within the dataset of the Small Animal Veterinary Surveillance Network formed the development material and target of this work. The efficacy of pre-existent clinician-assigned coding applied to the dataset was evaluated and the nature of notation and vocabulary used in documenting consultations was explored and described. Consultation records were pre-processed to improve human and software readability, and software was developed to redact incidental identifiers present within the free-text. An automated system able to classify for the presence of clinical signs, utilising only information present within the free-text record, was developed with the aim that it would facilitate timely detection of spatio-temporal trends in clinical signs. Clinician-assigned main reason for visit coding provided a poor summary of the large quantity of information exchanged during a veterinary consultation and the nature of the coding and questionnaire triggering further obfuscated information. Delineation of the previously undocumented veterinary clinical sublanguage identified common themes and their manner of documentation, this was key to the development of programmatic methods. A rule-based classifier using logically-chosen dictionaries, sequential processing and data-masking redacted identifiers while maintaining research usability of records. Highly sensitive and specific free-text classification was achieved by applying classifiers for individual clinical signs within a context-sensitive scaffold, this permitted or prohibited matching dependent on the clinical context in which a clinical sign was documented. The mean sensitivity achieved within an unseen test dataset was 98.17 (74.47, 99.9)% and mean specificity 99.94 (77.1, 100.0)%. When used in combination to identify animals with any of a combination of gastrointestinal clinical signs, the sensitivity achieved was 99.44% (95% CI: 98.57, 99.78)% and specificity 99.74 (95% CI: 99.62, 99.83). This work illustrates the importance, utility and promise of free-text classification of clinical records and provides a framework within which this is possible whilst respecting the confidentiality of client and clinician

    Biomedical Information Extraction Pipelines for Public Health in the Age of Deep Learning

    Get PDF
    abstract: Unstructured texts containing biomedical information from sources such as electronic health records, scientific literature, discussion forums, and social media offer an opportunity to extract information for a wide range of applications in biomedical informatics. Building scalable and efficient pipelines for natural language processing and extraction of biomedical information plays an important role in the implementation and adoption of applications in areas such as public health. Advancements in machine learning and deep learning techniques have enabled rapid development of such pipelines. This dissertation presents entity extraction pipelines for two public health applications: virus phylogeography and pharmacovigilance. For virus phylogeography, geographical locations are extracted from biomedical scientific texts for metadata enrichment in the GenBank database containing 2.9 million virus nucleotide sequences. For pharmacovigilance, tools are developed to extract adverse drug reactions from social media posts to open avenues for post-market drug surveillance from non-traditional sources. Across these pipelines, high variance is observed in extraction performance among the entities of interest while using state-of-the-art neural network architectures. To explain the variation, linguistic measures are proposed to serve as indicators for entity extraction performance and to provide deeper insight into the domain complexity and the challenges associated with entity extraction. For both the phylogeography and pharmacovigilance pipelines presented in this work the annotated datasets and applications are open source and freely available to the public to foster further research in public health.Dissertation/ThesisDoctoral Dissertation Biomedical Informatics 201

    Social media mining under the COVID-19 context: Progress, challenges, and opportunities

    Full text link
    Social media platforms allow users worldwide to create and share information, forging vast sensing networks that allow information on certain topics to be collected, stored, mined, and analyzed in a rapid manner. During the COVID-19 pandemic, extensive social media mining efforts have been undertaken to tackle COVID-19 challenges from various perspectives. This review summarizes the progress of social media data mining studies in the COVID-19 contexts and categorizes them into six major domains, including early warning and detection, human mobility monitoring, communication and information conveying, public attitudes and emotions, infodemic and misinformation, and hatred and violence. We further document essential features of publicly available COVID-19 related social media data archives that will benefit research communities in conducting replicable and repro�ducible studies. In addition, we discuss seven challenges in social media analytics associated with their potential impacts on derived COVID-19 findings, followed by our visions for the possible paths forward in regard to social media-based COVID-19 investigations. This review serves as a valuable reference that recaps social media mining efforts in COVID-19 related studies and provides future directions along which the information harnessed from social media can be used to address public health emergencies

    Public health emergency preparedness and response capabilities : national standards for state, local, tribal, and territorial public health

    Get PDF
    In 2011, the Centers for Disease Control and Prevention (CDC) established the Public Health Preparedness Capabilities: National Standards for State and Local Planning, a set of 15 distinct, yet interrelated, capability standards designed to advance the emergency preparedness and response capacity of state and local public health systems. These standards pioneered a national capability-based framework that helped jurisdictional public health agencies structure emergency preparedness planning and further formalize their public health agency Emergency Support Function (ESF) #8 role(s) in partnership with emergency management agencies.CDC\u2019s 2018 Public Health Emergency Preparedness and Response Capabilities: National Standards for State, Local, Tribal, and Territorial Public Health include operational considerations that support the public health and medical components of the 32 core capabilities specified in the National Preparedness Goal. Jurisdictions should use these operational considerations to develop their public health agency response strategies in greater alignment with the jurisdictional public health agency ESF #8 role.Suggested Citation: Centers for Disease Control and Prevention (CDC). (2018). Public health emergency preparedness and response capabilities. Atlanta, GA: U.S. Department of Health and Human Services.CS290888-ACDC_PreparednesResponseCapabilities_October2018_Final_508.pdfIntroduction -- Using the Capability Standards for Strategic Planning -- At-A-Glance: Capability Definitions, Functions, and Summary of Changes -- Capability 1: Community Preparedness -- Capability 2: Community Recovery -- Capability 3: Emergency Operations Coordination -- Capability 4: Emergency Public Information and Warning -- Capability 5: Fatality Management -- Capability 6: Information Sharing -- Capability 7: Mass Care -- Capability 8: Medical Countermeasure Dispensing and Administration -- Capability 9: Medical Materiel Management and Distribution -- Capability 10: Medical Surge -- Capability 11: Nonpharmaceutical Interventions -- Capability 12: Public Health Laboratory Testing -- Capability 13: Public Health Surveillance and Epidemiological Investigation -- Capability 14: Responder Safety and Health -- Capability 15: Volunteer Management -- Glossary of Terms \u2013- Acknowledgements.201
    • …
    corecore