Search CORE

83 research outputs found

Comparing Transformer-based NER approaches for analysing textual medical diagnoses

Author: Giovanni Semeraro
Marco de Gemmis
Marco Polignano
Publication venue
Publication date: 01/01/2021
Field of study

The automated analysis of medical documents has grown in research interest in recent years as a consequence of the social relevance of the thematic and the difficulties often encountered with short and very specific documents. In particular, this fervent area of research has stimulated the development of several techniques of automatic document classification, question answering, and name entity recognition (NER). Nevertheless, many open issues must be addressed to obtain results that are satisfactory for a field in which the effectiveness of predictions is a fundamental factor in order not to make mistakes that could compromise people’s lives. To this end, we focused on the name entity recognition task from medical documents and, in this work, we will discuss the results we obtained by our hybrid approach. In order to take advantage of the most relevant findings in the field of natural language processing, we decided to focus on deep neural network models. We compared several configurations of our model by varying the transformer architecture, such as BERT, RoBERTa and ELECTRA, until we obtained a configuration that we considered the best for our goals. The most promising model was used to participate in the SpRadIE task of the annual CLEF (Conference and Labs of the Evaluation Forum). The obtained results are encouraging and can be of reference for future studies on the topic

Archivio istituzionale della ricerca - Università di Bari

CLEF eHealth 2019 Evaluation Lab

Author: Assopardi Leif
Goeuriot Lorraine
Jimmy
Kanoulas Evangelos
Kelly Liadh
Li Dan
Neves Mariana
Palotti Joao
Spijker Rene
Suominen Hanna
Zuccon Guido
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Since 2012 CLEF eHealth has focused on evaluation resource building efforts around the easing and support of patients, their next-of-kins, clinical staff, and health scientists in understanding, accessing, and authoring eHealth information in a multilingual setting. This year’s lab offers three tasks: Task 1 on multilingual information extraction; Task 2 on technology assisted reviews in empirical medicine; and Task 3 on consumer health search in mono- and multilingual settings. Herein, we describe the CLEF eHealth evaluation series to-date and then present the 2019 tasks, evaluation methodology, and resources

MURAL - Maynooth University Research Archive Library

University of Strathclyde Institutional Repository

Hal - Université Grenoble Alpes

University of Canberra Research Repository

University of Surabaya Institutional Repository

International Migration, Integration and Social Cohesion online publications

UvA-DARE

University of Queensland eSpace

Crossref

NUI Maynooth Eprint Archive

Maynooth University ePrints and eTheses Archive

Extreme multi-label deep neural classification of Spanish health records according to the International Classification of Diseases

Author: Blanco Garcés Alberto
Publication venue
Publication date: 20/09/2022
Field of study

111 p.Este trabajo trata sobre la minería de textos clínicos, un campo del Procesamiento del Lenguaje Natural aplicado al dominio biomédico. El objetivo es automatizar la tarea de codificación médica. Los registros electrónicos de salud (EHR) son documentos que contienen información clínica sobre la salud de unpaciente. Los diagnósticos y procedimientos médicos plasmados en la Historia Clínica Electrónica están codificados con respecto a la Clasificación Internacional de Enfermedades (CIE). De hecho, la CIE es la base para identificar estadísticas de salud internacionales y el estándar para informar enfermedades y condiciones de salud. Desde la perspectiva del aprendizaje automático, el objetivo es resolver un problema extremo de clasificación de texto de múltiples etiquetas, ya que a cada registro de salud se le asignan múltiples códigos ICD de un conjunto de más de 70 000 términos de diagnóstico. Una cantidad importante de recursos se dedican a la codificación médica, una laboriosa tarea que actualmente se realiza de forma manual. Los EHR son narraciones extensas, y los codificadores médicos revisan los registros escritos por los médicos y asignan los códigos ICD correspondientes. Los textos son técnicos ya que los médicos emplean una jerga médica especializada, aunque rica en abreviaturas, acrónimos y errores ortográficos, ya que los médicos documentan los registros mientras realizan la práctica clínica real. Paraabordar la clasificación automática de registros de salud, investigamos y desarrollamos un conjunto de técnicas de clasificación de texto de aprendizaje profundo

Archivo Digital para la Docencia y la Investigación

Classification of Animal Experiments: A Reproducible Study. IMS Unipd at CLEF eHealth Task 1

Author: Di Nunzio Giorgio Maria
Publication venue: CEUR-WS.org
Publication date: 01/01/2019
Field of study

Archivio istituzionale della ricerca - Università di Padova

Overview of the CLEF eHealth Evaluation Lab 2019

Author: Azzopardi Leif
Goeuriot Lorraine
Kanoulas Evangelos
Kelly Liadh
Li Dan
Neves Mariana
Palotti Joao
Scells Harrisen
Spijker Rene
Suominen Hanna
Zuccon Guido
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2019
Field of study

In this paper, we provide an overview of the seventh annual edition of the CLEF eHealth evaluation lab. CLEF eHealth 2019 continues our evaluation resource building efforts around the easing and support of patients, their next-of-kins, clinical staff, and health scientists in understanding, accessing, and authoring electronic health information in a multilingual setting. This year’s lab advertised three tasks: Task 1 on indexing non-technical summaries of German animal experiments with International Classification of Diseases, Version 10 codes; Task 2 on technology assisted reviews in empirical medicine building on 2017 and 2018 tasks in English; and Task 3 on consumer health search in mono- and multilingual settings that builds on the 2013–18 Information Retrieval tasks. In total nine teams took part in these tasks (six in Task 1 and three in Task 2). Herein, we describe the resources created for these tasks and evaluation methodology adopted. We also provide a brief summary of participants of this year’s challenges and results obtained. As in previous years, the organizers have made data and tools associated with the lab tasks available for future research and development

MURAL - Maynooth University Research Archive Library

Cross-language Information Retrieval

Author: Galuščáková Petra
Nair Suraj
Oard Douglas W.
Publication venue
Publication date: 08/06/2022
Field of study

Two key assumptions shape the usual view of ranked retrieval: (1) that the searcher can choose words for their query that might appear in the documents that they wish to see, and (2) that ranking retrieved documents will suffice because the searcher will be able to recognize those which they wished to find. When the documents to be searched are in a language not known by the searcher, neither assumption is true. In such cases, Cross-Language Information Retrieval (CLIR) is needed. This chapter reviews the state of the art for CLIR and outlines some open research questions.Comment: 49 pages, 0 figure

arXiv.org e-Print Archive

A study of Machine Learning models for Clinical Coding of Medical Reports at CodiEsp 2020

Author: de Gemmis M.
Lops P.
Polignano M.
Semeraro G.
Suriano V.
Publication venue: CEUR-WS
Publication date: 01/01/2020
Field of study

The task of identifying one or more diseases associated with a patient’s clinical condition is often very complex, even for doctors and specialists. This process is usually time-consuming and has to take into account different aspects of what has occurred, including symptoms elicited and previous healthcare situations. The medical diagnosis is often provided to patients in the form of written paper without any correlation with a national or international standard. Even if the WHO (World Health Organization) released the ICD10 international glossary of diseases, almost no doctor has enough time to manually associate the patient’s clinical history with international codes. The CodiEsp task at CLEF 2020 addressed this issue by proposing the development of an automatic system to deal with this task. Our solution investigated different machine learning strategies in order to identify an approach to face that challenge. The main outcomes of the experiments showed that a strategy based on BERT for pre-filtering and one based on BiLSTMCNN-SelfAttention for classification provide valuable results. We carried out several experiments on a subset of the training set for tuning the final model submitted to the challenge. In particular, we analyzed the impact of the algorithm, the input encoding strategy, and the thresholds for multi-label classification. A set of experiments has been carried out also during a post hoc analysis. The experiments confirmed that the strategy submitted to the CodiEsp task is the best performing one among those evaluated, and it allowed us to obtain a final mean average error value on the test set equal to 0.202. To support future developments of the proposed approach and the replicability of the experiments we decided to make the source code publicly accessible

Archivio istituzionale della ricerca - Università di Bari

Supporting the Billing Process in Outpatient Medical Care: Automated Medical Coding Through Machine Learning

Author: Finze Nikola
Heinzl Armin
Hoffmann Philipp
Oberste Luis
Publication venue: AIS Electronic Library (AISeL)
Publication date: 01/01/2022
Field of study

Reimbursement in medical care implies significant administrative effort for medical staff. To bill the treatments or services provided, diagnosis and treatment codes must be assigned to patient records using standardized healthcare classification systems, which is a time-consuming and error-prone task. In contrast to ICD diagnosis codes used in most countries for inpatient care reimbursement, outpatient medical care often involves different reimbursement schemes. Following the Action Design Research methodology, we developed an NLP-based machine learning artifact in close collaboration with a general practitioner’s office in Germany, leveraging a dataset of over 5,600 patients with more than 63,000 billing codes. For the code prediction of most problematic treatments as well as a complete code prediction task, we achieved F1-scores of 93.60 % and 78.22 %, respectively. Throughout three iterations, we derived five meta requirements leading to three design principles for an automated coding system to support the reimbursement of outpatient medical care

MAnnheim DOCument Server

AIS Electronic Library (AISeL)

Search strategy formulation for systematic reviews: Issues, challenges and opportunities

Author: MacFarlane Andrew
Russell-Rose Tony
Shokraneh Farhad
Publication venue: 'Elsevier BV'
Publication date: 01/09/2022
Field of study

Systematic literature reviews play a vital role in identifying the best available evidence for health and social care research, policy, and practice. The resources required to produce systematic reviews can be significant, and a key to the success of any review is the search strategy used to identify relevant literature. However, the methods used to construct search strategies can be complex, time consuming, resource intensive and error prone. In this review, we examine the state of the art in resolving complex structured information needs, focusing primarily on the healthcare context. We analyse the literature to identify key challenges and issues and explore appropriate solutions and workarounds. From this analysis we propose a way forward to facilitate trust and to aid explainability and transparency, reproducibility and replicability through a set of key design principles for tools to support the development of search strategies in systematic literature reviews

Goldsmiths Research Online