4 research outputs found

    Medical Named Entity Recognition (MedNER): Deep learning model for recognizing medical entities (drug, disease) from scientific texts

    Get PDF
    Medical Named Entity Recognition (MedNER) is an indispensable task in biomedical text mining. NER aims to recognize and categorize named entities in scientific literature, such as genes, proteins, diseases, and medications. This work is difficult due to the complexity of scientific language and the abundance of available material in the biomedical sector. Using domain-specific embedding and Bi-LSTM, we propose a novel NER model that employs deep learning approaches to improve the performance of NER on scientific publications. Our model gets 98% F1-score on a curated data-set of Covid-related scientific publications published in multiple web of science and pubmed indexed journals, significantly outperforming previous approaches deployed on the same data-set. Our findings illustrate the efficacy of our approach in reliably recognizing and classifying named entities (drug and disease) in scientific literature, opening the way for future developments in biomedical text mining

    Named Entity Recognition in Electronic Health Records: A Methodological Review

    Get PDF
    Objectives A substantial portion of the data contained in Electronic Health Records (EHR) is unstructured, often appearing as free text. This format restricts its potential utility in clinical decision-making. Named entity recognition (NER) methods address the challenge of extracting pertinent information from unstructured text. The aim of this study was to outline the current NER methods and trace their evolution from 2011 to 2022. Methods We conducted a methodological literature review of NER methods, with a focus on distinguishing the classification models, the types of tagging systems, and the languages employed in various corpora. Results Several methods have been documented for automatically extracting relevant information from EHRs using natural language processing techniques such as NER and relation extraction (RE). These methods can automatically extract concepts, events, attributes, and other data, as well as the relationships between them. Most NER studies conducted thus far have utilized corpora in English or Chinese. Additionally, the bidirectional encoder representation from transformers using the BIO tagging system architecture is the most frequently reported classification scheme. We discovered a limited number of papers on the implementation of NER or RE tasks in EHRs within a specific clinical domain. Conclusions EHRs play a pivotal role in gathering clinical information and could serve as the primary source for automated clinical decision support systems. However, the creation of new corpora from EHRs in specific clinical domains is essential to facilitate the swift development of NER and RE models applied to EHRs for use in clinical practice
    corecore