Search CORE

13 research outputs found

Extraction of chemical-induced diseases using prior knowledge and textual information

Author: Afzal Muhammad
Ahmad Akhondi Saber
Becker Benedikt
Kors Jan
Pons Ewoud
van Mulligen Erik
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2016
Field of study

We describe our approach to the chemical–disease relation (CDR) task in the BioCreative V challenge. The CDR task consists of two subtasks: automatic disease-named entity recognition and normalization (DNER), and extraction of chemical-induced diseases (CIDs) from Medline abstracts. For the DNER subtask, we used our concept recognition tool Peregrine, in combination with several optimization steps. For the CID subtask, our system, which we named RELigator, was trained on a rich feature set, comprising features derived from a graph database containing prior knowledge about chemicals and diseases, and linguistic and statistical features derived from the abstracts in the CDR training corpus. We describe the systems that were developed and present evaluation results for both subtasks on the CDR test set. For DNER, our Peregrine system reached an F-score of 0.757. For CID, the system achieved an F-score of 0.526, which ranked second among 18 participating teams. Several post-challenge modifications of the systems resulted in substantially improved F-scores (0.828 for DNER and 0.602 for CID). RELigator is available as a web service at http://biosemantics.org/index.php/software/religator

Crossref

EUR Research Repository

PubMed Central

ContextD: An algorithm to identify contextual properties of medical terms in a dutch clinical corpus

Author: Afzal M.Z. (Zubair)
Kang N. (Ning)
Kors J.A. (Jan)
Pons E. (Ewoud)
Schuemie M.J. (Martijn)
Sturkenboom M.C.J.M. (Miriam)
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 29/11/2014
Field of study

Background: In order to extract meaningful information from electronic medical records, such as signs and symptoms, diagnoses, and treatments, it is important to take into account the contextual properties of the identified information: negation, temporality, and experiencer. Most work on automatic identification of these contextual properties has been done on English clinical text. This study presents ContextD, an adaptation of the English ConText algorithm to the Dutch language, and a Dutch clinical corpus. Results: The ContextD algorithm utilized 41 unique triggers to identify the contextual properties in the clinical corpus. For the negation property, the algorithm obtained an F-score from 87% to 93% for the different document types. For the experiencer property, the F-score was 99% to 100%. For the historical and hypothetical values of the temporality property, F-scores ranged from 26% to 54% and from 13% to 44%, respectively. Conclusions: The ContextD showed good performance in identifying negation and experiencer property values across all Dutch clinical document types. Accurate identification of the temporality property proved to be difficult and requires further work. The anonymized and annotated Dutch clinical corpus can serve as a useful resource for further algorithm development

Erasmus University Digital Repository

Extraction of chemical-induced diseases using prior knowledge and textual information

Author: Afzal M.Z. (Zubair)
Akhondi S.A. (Saber)
Becker B.F.H. (Benedikt)
Kors J.A. (Jan)
Pons E. (Ewoud)
Van Mulligen E.M. (Erik M.)
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2016
Field of study

We describe our approach to the chemical-disease relation (CDR) task in the BioCreative V challenge. The CDR task consists of two subtasks: Automatic disease-named entity recognition and normalization (DNER), and extraction of chemical-induced diseases (CIDs) from Medline abstracts. For the DNER subtask, we used our concept recognition tool Peregrine, in combination with several optimization steps. For the CID subtask, our system, which we named RELigator, was trained on a rich feature set, comprising features derived from a graph database containing prior knowledge about chemicals and diseases, and linguistic and statistical features derived from the abstracts in the CDR training corpus. We describe the systems that were developed and present evaluation results for both subtasks on the CDR test set. For DNER, our Peregrine system reached an F-score of 0.757. For CID, the system achieved an F-score of 0.526, which ranked second among 18 participating teams. Several post-challenge modifications of the systems resulted in substantially improved F-scores (0.828 for DNER and 0.602 for CID)

Erasmus University Digital Repository

ContextD: an algorithm to identify contextual properties of medical terms in a Dutch clinical corpus

Author: A Vlug
C Friedman
C Friedman
E Apostolova
E Velldal
Ewoud Pons
GK Savova
H Harkema
H Kilicoglu
H Xu
I Goldin
J Cohen
Jan A Kors
L Deléger
LM Christensen
M Light
M Skeppstedt
Martijn J Schuemie
Miriam CJM Sturkenboom
Ning Kang
NP Cruz Díaz
O Bodenreider
O Uzuner
PB Jensen
PG Mutalik
PL Elkin
QT Zeng
RM Reeves
S Agarwal
S Goryachev
U Hahn
V Vincze
W Sun
WW Chapman
WW Chapman
Y Huang
Zubair Afzal
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Impact of guidelines for the management of minor head injury on the utilization and diagnostic yield of CT over two decades, using natural language processing in a large dataset

Author: Dippel Diederik
Foks Kelly
Hunink Myriam
Pons Ewoud
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Natural Language Processing in Radiology: A Systematic Review

Author: Braun Loes
Hunink Myriam
Kors Jan
Pons Ewoud
Publication venue: 'Radiological Society of North America (RSNA)'
Publication date: 01/01/2016
Field of study

Radiological reporting has generated large quantities of digital content within the electronic health record, which is potentially a valuable source of information for improving clinical care and supporting research. Although radiology reports are stored for communication and documentation of diagnostic imaging, harnessing their potential requires efficient and automated information extraction: they exist mainly as free-text clinical narrative, from which it is a major challenge to obtain structured data. Natural language processing (NLP) provides techniques that aid the conversion of text into a structured representation, and thus enables computers to derive meaning from human (ie, natural language) input. Used on radiology reports, NLP techniques enable automatic identification and extraction of information. By exploring the various purposes for their use, this review examines how radiology benefits from NLP. A systematic literature search identified 67 relevant publications describing NLP methods that support practical applications in radiology. This review takes a close look at the individual studies in terms of tasks (ie, the extracted information), the NLP methodology and tools used, and their application purpose and performance results. Additionally, limitations, future challenges, and requirements for advancing NLP in radiology will be discussed. (C) RSNA, 201

EUR Research Repository

Adrenal Incidentaloma and Adherence to International Guidelines for Workup Based on a Retrospective Review of the Type of Language Used in the Radiology Report

Author: Haan Romy
Pons Ewoud
Schreuder Marloes
Visser Jan-Jaap
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

EUR Research Repository

ContextD: An algorithm to identify contextual properties of medical terms in a dutch clinical corpus

Author: Afzal Zubair
Kang Ning
Kors Jan
Pons Ewoud
Schuemie Martijn
Sturkenboom Miriam
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 29/11/2014
Field of study

textabstractBackground: In order to extract meaningful information from electronic medical records, such as signs and symptoms, diagnoses, and treatments, it is important to take into account the contextual properties of the identified information: negation, temporality, and experiencer. Most work on automatic identification of these contextual properties has been done on English clinical text. This study presents ContextD, an adaptation of the English ConText algorithm to the Dutch language, and a Dutch clinical corpus. Results: The ContextD algorithm utilized 41 unique triggers to identify the contextual properties in the clinical corpus. For the negation property, the algorithm obtained an F-score from 87% to 93% for the different document types. For the experiencer property, the F-score was 99% to 100%. For the historical and hypothetical values of the temporality property, F-scores ranged from 26% to 54% and from 13% to 44%, respectively. Conclusions: The ContextD showed good performance in identifying negation and experiencer property values across all Dutch clinical document types. Accurate identification of the temporality property proved to be difficult and requires further work. The anonymized and annotated Dutch clinical corpus can serve as a useful resource for further algorithm development

Springer - Publisher Connector

Patient-specific workup of adrenal incidentalomas

Author: Feelders Richard
Haan R.R.D. (Romy R. de)
Hunink Myriam
Kaymak Uzay
Pons Ewoud
Visser J.B.R. (Johannes B.R.)
Visser Jacob Johannes
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Purpose: To develop a clinical prediction model to predict a clinically relevant adrenal disorder for patients with adrenal incidentaloma. Materials and methods: This retrospective study is approved by the institutional review board, with waiver of informed consent. Natural language processing is used for filtering of adrenal incidentaloma cases in all thoracic and abdominal CT reports from 2010 till 2012. A total of 635 patients are identified. Stepwise logistic regression is used to construct the prediction model. The model predicts if a patient is at risk for malignancy or hormonal hyperfunction of the adrenal gland at the moment of initial presentation, thus generates a predicted probability for every individual patient. The prediction model is evaluated on its usefulness in clinical practice using decision curve analysis (DCA) based on different threshold probabilities. For patients whose predicted probability is lower than the predetermined threshold probability, further workup could be omitted. Results: A prediction model is successfully developed, with an area under the curve (AUC) of 0.78. Results of the DCA indicate that up to 11% of patients with an adrenal incidentaloma can be avoided from unnecessary workup, with a sensitivity of 100% and specificity of 11%. Conclusion: A prediction model can accurately predict if an adrenal incidentaloma patient is at risk for malignancy or hormonal hyperfunction of the adrenal gland based on initial imaging features and patient demographics. However, with most adrenal incidentalomas labeled as nonfunctional adrenocortical adenomas requiring no further treatment, it is likely that more patients could be omitting from unnecessary diagnostics

Repository TU/e

Crossref

Pure OAI Repository

Directory of Open Access Journals

EUR Research Repository

Erasmus University Digital Repository

Patient-specific workup of adrenal incidentalomas

Author: Feelders Richard A
Haan Romy Rde
Hunink M GMyriam
Kaymak U Uzay
Pons Ewoud
Visser Jacob J
Visser JBR Job
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

\u3cp\u3ePurpose: To develop a clinical prediction model to predict a clinically relevant adrenal disorder for patients with adrenal incidentaloma. Materials and methods: This retrospective study is approved by the institutional review board, with waiver of informed consent. Natural language processing is used for filtering of adrenal incidentaloma cases in all thoracic and abdominal CT reports from 2010 till 2012. A total of 635 patients are identified. Stepwise logistic regression is used to construct the prediction model. The model predicts if a patient is at risk for malignancy or hormonal hyperfunction of the adrenal gland at the moment of initial presentation, thus generates a predicted probability for every individual patient. The prediction model is evaluated on its usefulness in clinical practice using decision curve analysis (DCA) based on different threshold probabilities. For patients whose predicted probability is lower than the predetermined threshold probability, further workup could be omitted. Results: A prediction model is successfully developed, with an area under the curve (AUC) of 0.78. Results of the DCA indicate that up to 11% of patients with an adrenal incidentaloma can be avoided from unnecessary workup, with a sensitivity of 100% and specificity of 11%. Conclusion: A prediction model can accurately predict if an adrenal incidentaloma patient is at risk for malignancy or hormonal hyperfunction of the adrenal gland based on initial imaging features and patient demographics. However, with most adrenal incidentalomas labeled as nonfunctional adrenocortical adenomas requiring no further treatment, it is likely that more patients could be omitting from unnecessary diagnostics.\u3c/p\u3

Repository TU/e