13 research outputs found

    Extraction of chemical-induced diseases using prior knowledge and textual information

    Get PDF
    We describe our approach to the chemical–disease relation (CDR) task in the BioCreative V challenge. The CDR task consists of two subtasks: automatic disease-named entity recognition and normalization (DNER), and extraction of chemical-induced diseases (CIDs) from Medline abstracts. For the DNER subtask, we used our concept recognition tool Peregrine, in combination with several optimization steps. For the CID subtask, our system, which we named RELigator, was trained on a rich feature set, comprising features derived from a graph database containing prior knowledge about chemicals and diseases, and linguistic and statistical features derived from the abstracts in the CDR training corpus. We describe the systems that were developed and present evaluation results for both subtasks on the CDR test set. For DNER, our Peregrine system reached an F-score of 0.757. For CID, the system achieved an F-score of 0.526, which ranked second among 18 participating teams. Several post-challenge modifications of the systems resulted in substantially improved F-scores (0.828 for DNER and 0.602 for CID). RELigator is available as a web service at http://biosemantics.org/index.php/software/religator

    ContextD: An algorithm to identify contextual properties of medical terms in a dutch clinical corpus

    Get PDF
    Background: In order to extract meaningful information from electronic medical records, such as signs and symptoms, diagnoses, and treatments, it is important to take into account the contextual properties of the identified information: negation, temporality, and experiencer. Most work on automatic identification of these contextual properties has been done on English clinical text. This study presents ContextD, an adaptation of the English ConText algorithm to the Dutch language, and a Dutch clinical corpus. Results: The ContextD algorithm utilized 41 unique triggers to identify the contextual properties in the clinical corpus. For the negation property, the algorithm obtained an F-score from 87% to 93% for the different document types. For the experiencer property, the F-score was 99% to 100%. For the historical and hypothetical values of the temporality property, F-scores ranged from 26% to 54% and from 13% to 44%, respectively. Conclusions: The ContextD showed good performance in identifying negation and experiencer property values across all Dutch clinical document types. Accurate identification of the temporality property proved to be difficult and requires further work. The anonymized and annotated Dutch clinical corpus can serve as a useful resource for further algorithm development

    Extraction of chemical-induced diseases using prior knowledge and textual information

    Get PDF
    We describe our approach to the chemical-disease relation (CDR) task in the BioCreative V challenge. The CDR task consists of two subtasks: Automatic disease-named entity recognition and normalization (DNER), and extraction of chemical-induced diseases (CIDs) from Medline abstracts. For the DNER subtask, we used our concept recognition tool Peregrine, in combination with several optimization steps. For the CID subtask, our system, which we named RELigator, was trained on a rich feature set, comprising features derived from a graph database containing prior knowledge about chemicals and diseases, and linguistic and statistical features derived from the abstracts in the CDR training corpus. We describe the systems that were developed and present evaluation results for both subtasks on the CDR test set. For DNER, our Peregrine system reached an F-score of 0.757. For CID, the system achieved an F-score of 0.526, which ranked second among 18 participating teams. Several post-challenge modifications of the systems resulted in substantially improved F-scores (0.828 for DNER and 0.602 for CID)

    Natural Language Processing in Radiology: A Systematic Review

    No full text
    Radiological reporting has generated large quantities of digital content within the electronic health record, which is potentially a valuable source of information for improving clinical care and supporting research. Although radiology reports are stored for communication and documentation of diagnostic imaging, harnessing their potential requires efficient and automated information extraction: they exist mainly as free-text clinical narrative, from which it is a major challenge to obtain structured data. Natural language processing (NLP) provides techniques that aid the conversion of text into a structured representation, and thus enables computers to derive meaning from human (ie, natural language) input. Used on radiology reports, NLP techniques enable automatic identification and extraction of information. By exploring the various purposes for their use, this review examines how radiology benefits from NLP. A systematic literature search identified 67 relevant publications describing NLP methods that support practical applications in radiology. This review takes a close look at the individual studies in terms of tasks (ie, the extracted information), the NLP methodology and tools used, and their application purpose and performance results. Additionally, limitations, future challenges, and requirements for advancing NLP in radiology will be discussed. (C) RSNA, 201

    ContextD: An algorithm to identify contextual properties of medical terms in a dutch clinical corpus

    No full text
    textabstractBackground: In order to extract meaningful information from electronic medical records, such as signs and symptoms, diagnoses, and treatments, it is important to take into account the contextual properties of the identified information: negation, temporality, and experiencer. Most work on automatic identification of these contextual properties has been done on English clinical text. This study presents ContextD, an adaptation of the English ConText algorithm to the Dutch language, and a Dutch clinical corpus. Results: The ContextD algorithm utilized 41 unique triggers to identify the contextual properties in the clinical corpus. For the negation property, the algorithm obtained an F-score from 87% to 93% for the different document types. For the experiencer property, the F-score was 99% to 100%. For the historical and hypothetical values of the temporality property, F-scores ranged from 26% to 54% and from 13% to 44%, respectively. Conclusions: The ContextD showed good performance in identifying negation and experiencer property values across all Dutch clinical document types. Accurate identification of the temporality property proved to be difficult and requires further work. The anonymized and annotated Dutch clinical corpus can serve as a useful resource for further algorithm development

    Patient-specific workup of adrenal incidentalomas

    Get PDF
    Purpose: To develop a clinical prediction model to predict a clinically relevant adrenal disorder for patients with adrenal incidentaloma. Materials and methods: This retrospective study is approved by the institutional review board, with waiver of informed consent. Natural language processing is used for filtering of adrenal incidentaloma cases in all thoracic and abdominal CT reports from 2010 till 2012. A total of 635 patients are identified. Stepwise logistic regression is used to construct the prediction model. The model predicts if a patient is at risk for malignancy or hormonal hyperfunction of the adrenal gland at the moment of initial presentation, thus generates a predicted probability for every individual patient. The prediction model is evaluated on its usefulness in clinical practice using decision curve analysis (DCA) based on different threshold probabilities. For patients whose predicted probability is lower than the predetermined threshold probability, further workup could be omitted. Results: A prediction model is successfully developed, with an area under the curve (AUC) of 0.78. Results of the DCA indicate that up to 11% of patients with an adrenal incidentaloma can be avoided from unnecessary workup, with a sensitivity of 100% and specificity of 11%. Conclusion: A prediction model can accurately predict if an adrenal incidentaloma patient is at risk for malignancy or hormonal hyperfunction of the adrenal gland based on initial imaging features and patient demographics. However, with most adrenal incidentalomas labeled as nonfunctional adrenocortical adenomas requiring no further treatment, it is likely that more patients could be omitting from unnecessary diagnostics

    Patient-specific workup of adrenal incidentalomas

    No full text
    \u3cp\u3ePurpose: To develop a clinical prediction model to predict a clinically relevant adrenal disorder for patients with adrenal incidentaloma. Materials and methods: This retrospective study is approved by the institutional review board, with waiver of informed consent. Natural language processing is used for filtering of adrenal incidentaloma cases in all thoracic and abdominal CT reports from 2010 till 2012. A total of 635 patients are identified. Stepwise logistic regression is used to construct the prediction model. The model predicts if a patient is at risk for malignancy or hormonal hyperfunction of the adrenal gland at the moment of initial presentation, thus generates a predicted probability for every individual patient. The prediction model is evaluated on its usefulness in clinical practice using decision curve analysis (DCA) based on different threshold probabilities. For patients whose predicted probability is lower than the predetermined threshold probability, further workup could be omitted. Results: A prediction model is successfully developed, with an area under the curve (AUC) of 0.78. Results of the DCA indicate that up to 11% of patients with an adrenal incidentaloma can be avoided from unnecessary workup, with a sensitivity of 100% and specificity of 11%. Conclusion: A prediction model can accurately predict if an adrenal incidentaloma patient is at risk for malignancy or hormonal hyperfunction of the adrenal gland based on initial imaging features and patient demographics. However, with most adrenal incidentalomas labeled as nonfunctional adrenocortical adenomas requiring no further treatment, it is likely that more patients could be omitting from unnecessary diagnostics.\u3c/p\u3
    corecore