Search CORE

26 research outputs found

MeSH indexing based on automatically generated summaries

Author: Alan R Aronson
Alberto Díaz
Antonio J Jimeno-Yepes
James G Mork
Laura Plaza
Publication venue: Springer Nature
Publication date: 01/01/2013
Field of study

BACKGROUND: MEDLINE citations are manually indexed at the U.S. National Library of Medicine (NLM) using as reference the Medical Subject Headings (MeSH) controlled vocabulary. For this task, the human indexers read the full text of the article. Due to the growth of MEDLINE, the NLM Indexing Initiative explores indexing methodologies that can support the task of the indexers. Medical Text Indexer (MTI) is a tool developed by the NLM Indexing Initiative to provide MeSH indexing recommendations to indexers. Currently, the input to MTI is MEDLINE citations, title and abstract only. Previous work has shown that using full text as input to MTI increases recall, but decreases precision sharply. We propose using summaries generated automatically from the full text for the input to MTI to use in the task of suggesting MeSH headings to indexers. Summaries distill the most salient information from the full text, which might increase the coverage of automatic indexing approaches based on MEDLINE. We hypothesize that if the results were good enough, manual indexers could possibly use automatic summaries instead of the full texts, along with the recommendations of MTI, to speed up the process while maintaining high quality of indexing results. RESULTS: We have generated summaries of different lengths using two different summarizers, and evaluated the MTI indexing on the summaries using different algorithms: MTI, individual MTI components, and machine learning. The results are compared to those of full text articles and MEDLINE citations. Our results show that automatically generated summaries achieve similar recall but higher precision compared to full text articles. Compared to MEDLINE citations, summaries achieve higher recall but lower precision. CONCLUSIONS: Our results show that automatic summaries produce better indexing than full text articles. Summaries produce similar recall to full text but much better precision, which seems to indicate that automatic summaries can efficiently capture the most important contents within the original articles. The combination of MEDLINE citations and automatically generated summaries could improve the recommendations suggested by MTI. On the other hand, indexing performance might be dependent on the MeSH heading being indexed. Summarization techniques could thus be considered as a feature selection algorithm that might have to be tuned individually for each MeSH heading

Springer - Publisher Connector

PubMed Central

Knowledge-based biomedical word sense disambiguation: comparison of approaches

Author: A Aronson
A Aronson
A Aronson
A Jimeno-Yepes
A Jimeno-Yepes
Alan R Aronson
Antonio J Jimeno-Yepes
B McInnes
C Leacock
D Alexopoulou
D Demner-Fushman
D Rebholz-Schuhmann
E Agirre
E Agirre
F Vasilescu
G Leroy
J Mork
M Joshi
M Lesk
M Schuemie
M Stevenson
M Weeber
S Gaudan
S Humphrey
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Word sense disambiguation (WSD) algorithms attempt to select the proper sense of ambiguous terms in text. Resources like the UMLS provide a reference thesaurus to be used to annotate the biomedical literature. Statistical learning approaches have produced good results, but the size of the UMLS makes the production of training data infeasible to cover all the domain. Methods We present research on existing WSD approaches based on knowledge bases, which complement the studies performed on statistical learning. We compare four approaches which rely on the UMLS Metathesaurus as the source of knowledge. The first approach compares the overlap of the context of the ambiguous word to the candidate senses based on a representation built out of the definitions, synonyms and related terms. The second approach collects training data for each of the candidate senses to perform WSD based on queries built using monosemous synonyms and related terms. These queries are used to retrieve MEDLINE citations. Then, a machine learning approach is trained on this corpus. The third approach is a graph-based method which exploits the structure of the Metathesaurus network of relations to perform unsupervised WSD. This approach ranks nodes in the graph according to their relative structural importance. The last approach uses the semantic types assigned to the concepts in the Metathesaurus to perform WSD. The context of the ambiguous word and semantic types of the candidate concepts are mapped to Journal Descriptors. These mappings are compared to decide among the candidate concepts. Results are provided estimating accuracy of the different methods on the WSD test collection available from the NLM. Conclusions We have found that the last approach achieves better results compared to the other methods. The graph-based approach, using the structure of the Metathesaurus network to estimate the relevance of the Metathesaurus concepts, does not perform well compared to the first two methods. In addition, the combination of methods improves the performance over the individual approaches. On the other hand, the performance is still below statistical learning trained on manually produced data and below the maximum frequency sense baseline. Finally, we propose several directions to improve the existing methods and to improve the Metathesaurus to be more effective in WSD.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of Melbourne Institutional Repository

Exploiting MeSH indexing in MEDLINE to generate a data set for word sense disambiguation

Author: A Jimeno
A Jimeno-Yepes
A Schwartz
A Yeh
Alan R Aronson
Antonio J Jimeno-Yepes
B McInnes
B McInnes
Bridget T McInnes
C Leacock
C Manning
G Leroy
H Liu
H Liu
H Liu
J Fan
L Hirschman
M Stevenson
M Weeber
P Pezik
R Leaman
S Gaudan
S Humphrey
T Pedersen
WA Gale
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Evaluation of Word Sense Disambiguation (WSD) methods in the biomedical domain is difficult because the available resources are either too small or too focused on specific types of entities (e.g. diseases or genes). We present a method that can be used to automatically develop a WSD test collection using the Unified Medical Language System (UMLS) Metathesaurus and the manual MeSH indexing of MEDLINE. We demonstrate the use of this method by developing such a data set, called MSH WSD. Methods In our method, the Metathesaurus is first screened to identify ambiguous terms whose possible senses consist of two or more MeSH headings. We then use each ambiguous term and its corresponding MeSH heading to extract MEDLINE citations where the term and only one of the MeSH headings co-occur. The term found in the MEDLINE citation is automatically assigned the UMLS CUI linked to the MeSH heading. Each instance has been assigned a UMLS Concept Unique Identifier (CUI). We compare the characteristics of the MSH WSD data set to the previously existing NLM WSD data set. Results The resulting MSH WSD data set consists of 106 ambiguous abbreviations, 88 ambiguous terms and 9 which are a combination of both, for a total of 203 ambiguous entities. For each ambiguous term/abbreviation, the data set contains a maximum of 100 instances per sense obtained from MEDLINE. We evaluated the reliability of the MSH WSD data set using existing knowledge-based methods and compared their performance to that of the results previously obtained by these algorithms on the pre-existing data set, NLM WSD. We show that the knowledge-based methods achieve different results but keep their relative performance except for the Journal Descriptor Indexing (JDI) method, whose performance is below the other methods. Conclusions The MSH WSD data set allows the evaluation of WSD algorithms in the biomedical domain. Compared to previously existing data sets, MSH WSD contains a larger number of biomedical terms/abbreviations and covers the largest set of UMLS Semantic Types. Furthermore, the MSH WSD data set has been generated automatically reusing already existing annotations and, therefore, can be regenerated from subsequent UMLS versions.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Reuse of terminological resources for efficient ontological engineering in Life Sciences

Author: A Anjum
A Jimeno-Yepes
A Jimeno-Yepes
A Kalyanpur
A Miles
A Tsymbal
Antonio Jimeno-Yepes
B Cuenca Grau
C Caracciolo
C Rosse
CM Duffy
Dietrich Rebholz-Schuhmann
E Beisswanger
E Jimenez-Ruiz
EM Ogilvie
Ernesto Jiménez-Ruiz
G Hirst
HS Pinto
I Horrocks
I Spasiæ
J Freund
K Frantzi
M Fernandez
O Bodenreider
O Bodenreider
P Bouquet
P Lambrix
P Shvaiko
R Berlanga
R Berlanga-Llavori
Rafael Berlanga-Llavori
S Schlobach
S Zillner
T Hauer
TR Gruber
V Nebot
V Viswanath
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

This paper is intended to explore how to use terminological resources for ontology engineering. Nowadays there are several biomedical ontologies describing overlapping domains, but there is not a clear correspondence between the concepts that are supposed to be equivalent or just similar. These resources are quite precious but their integration and further development are expensive. Terminologies may support the ontological development in several stages of the lifecycle of the ontology; e.g. ontology integration. In this paper we investigate the use of terminological resources during the ontology lifecycle. We claim that the proper creation and use of a shared thesaurus is a cornerstone for the successful application of the Semantic Web technology within life sciences. Moreover, we have applied our approach to a real scenario, the Health-e-Child (HeC) project, and we have evaluated the impact of filtering and re-organizing several resources. As a result, we have created a reference thesaurus for this project, named HeCTh

Crossref

City Research Online

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Springer - Publisher Connector

PubMed Central

Repositori Institucional de la Universitat Jaume I

Oxford University Research Archive

University of Melbourne Institutional Repository

Studying the correlation between different word sense disambiguation methods and summarization effectiveness in biomedical texts

Author: A Jimeno-Yepes
A Jimeno-Yepes
Alan R Aronson
Alberto Díaz
Antonio J Jimeno-Yepes
AR Aronson
AR Aronson
B McInnes
BT McInnes
C Leacock
CY Lin
CY Lin
E Agirre
E Agirre
F Martínez
F Vasilescu
G Erkan
I Mani
J Carrillo de Albornoz
J Gómez
J Kupiec
L Hunter
L Plaza
L Plaza
Laura Plaza
LH Reeve
LH Reeve
M Apidianaki
M Apidianaki
M Fiszman
M Jaoua
M Joshi
M Lesk
M Schuemie
M Stevenson
M Weeber
M Weeber
R Barzilay
R Mihalcea
S Brin
S Teufel
SD Afantenos
SE Shooshan
SM Humphrey
TC Rindflesch
Z Shi
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Early mobilisation in critically ill COVID-19 patients: a subanalysis of the ESICM-initiated UNITE-COVID observational study

Author: Abad-Motos Ane
Abdalla Maged
Abusalama Abdurraouf
Achterberg Sefanja
Acosta Guillermo Pérez
Adrião Diana
Ahmed Amna
Aires Buenos
Akbas Türkay
Akkari Abdel Rauof
Al-Bayati Sama
Al-Sadawi Mohammed
Alasia Datonye
Aleef Mohamed
Alfishawy Mostafa
Aljuhani Ohoud
Alkhalaf Amina
Alkhatteb Mohamed
Allan John
Alsabbah Asad
Alwraidat Mohamed
Anselmo Mónica
Antonelli Massimo
Arabi Yaseen
Assal Hebatallah
Awad Ahmed K.
Azab Mohammed A.
Azoulay Elie
Azzam Ahmed
Baccanelli Federica
Badie Julio
Baiou Anas
Bakdach Dana
Barling Danny
Barrueco-Francioni Jesús Emilio
Beck Oliver
Beishuizen Albertus
Berdaguer Ferrari Fernando
Berezo-Garcia José Angel
Berkius Johan
Bethlehem Carina
Bihariesingh-Sanchit Rosita
Binnie Alexandra
Bintaher Awadh
Biston Patrick
Bonell-Goytisolo José M.
Bota Marc
Boulanger Carole
Broadhurst Phil
Bulpa Pierre
Bílír Yelíz
Cabrera Luciano Santana
Cagova Lenka
Campbell Zoë
Camões João
Canedo Nancy
Catalan-Monzon Ignacio
Cecconi Maurizio
Ceruti Samuele
Charron Mariane
Chavez Alejandro Esquivel
Chrisment Anne
Chrysanthopoulou Evangelia
Citerio Giuseppe
Cochrane Anthony
Collin Vincent
Concha Pablo
Cook Martin
Cornet Alexander D.
Cosar Ahmet
Costa Vasco
Cowton Amanda
Crapelli Giulia Beatrice
Cuadrado Marta Martin
Cubero Patricia Jimeno
Cudia Antonella
Cuenca-Rubio Cristina
Cunha Pedro
Davey Miriam
Davydova Liubov
De Backer Daniel
De Buyser Wim
de Cabo Carlos Munoz
De Corte Thomas
de Groot Marcel
de Jong Ben
de Jong Celestine
de Pablo Sánchez Raúl
De Pascale Gennaro
De Rosa Silvia
De Schryver Nicolas
De Waele Elisabeth
De Waele Jan
de Wijs Calvin
Dean James T.
del Carmen Conesa Maria Lorente
del Pilar García-Bonillo Mª
Delgado Maria Cruz Martin
Dementienko Mariia
den Boer Sylvia
Dendane Tarek
Dereli Necla
Donadello Katia
Donnelly Adrian
Duric Natalie
Duska Frantisek
Ekren Pervin Korkmaz
Eksarko Polikseni
Elay Gülseren
elbuzidi Abdurrahmaan Suei
Eldaly Abdullah
Elhadi Muhammed
Eller Philipp
Ellervee Anneli
Ellis Caroline
Elmandouh Omar
Elrabi Omar
Eroglu Ahmet
Espina Lorena Forcelledo
Eyüpolu Selin
Ferrando Carlos
Ferrigno Gerardo
Fichtner Falk
Ficial Barbara
Florio Gaetano
Foulon Pierre
Fraga Xiana Taboada
Fraile Virginia
Franco Daniel Molano
Franquesa-Gonzalez Enric
Frenzel Tim
Frutos-Vivar Fernando
Fuest Kristina
Galal Islam
Galarza Laura
García Raquel Rodrígez
Gareth Allen
Garrioch Sweyn
Gavrilova Elena
Geagea Anna
Gervin Kevin
Ghannam Madihah E.
Ghozy Sherief
Gira Alicia
Girbes Armand R. J.
Glotta Andrea
Golden David
Gonzalez Daniel Rodriguez
Gonzalez Francisco Muñoyerro
Gonçalves Celina
Gottin Leonardo
Gotz Vera Nina
Grady Bart
Graf Jerónimo
Graham Sam
Grauslyte Lina
Greco Massimiliano
Grecu Irina
Grip Jonathan
Groeneveld Melanie
Grunow Julius J.
Guzzardella Amedeo
Haentjens Lionel
Hagan Samantha
Halacli Burcin
Hall Chris
Hamid Tarikul
Hammouda Ahmed
Hanlon Katie
Haque Injamam Ull
Harding Daniel
Henning Jeremy
Hernandez Aaron Mark
Herrmann Johannes
Higenbottam Caroline V.
Ho Vui Kian
Humaid Felwa Bin
Husain Ahmed
Hussein Aliae Mohamed
Ilieva Viktoria
Ioan Ana-Maria
Isoni Paolo
Izdes Seval
Jain Susan
Jawa Randeep
Jayyab Mustafa Abu
Jesus Montelongo Felipe De
Jimenez Jorge
Jog Sameer
Jones Nicola
Jubb Alasdair
Kalvit Kushal
Kamble Shruthi
Kansal Amit
Katz David
Kaya Ebru
Kent Melanie
Kesecioglu Jozef
Khlafalla Safa
Kilsand Kristina
Kloss Philipp
Kranen Hetty
Krupa Ivan
Kuhail Ahmed
Kumar Ashok
Kviatkovske Orinta
Labarca Eduardo
Lago Gustavo
Lahmer Tobias
Larrañaga Leire
Leganes Nieves Cruza
Legaristi Noemi
Linde Francisca Arbol
Lindholz Maximilian
Lorenz Marco
Loveleena Gupta
Lucas Juan Higuera
Lumlertgul Nuttha
Mahmoodpoor Ata
Marczin Nandor
Markou Nikolaos
Martin Belén Civantos
Martinez Felipe
Maslamani Muna Al
McCarthy Aine
McMahon Sean
Mehagnoul-Schipper Jannet
Mellinghoff Johannes
Mengi Tuçe
Mensink Roos
Meshchaninova Svetlana
Mesland Jean-Baptiste
Meybohm Patrick
Milnik Annette
Mir Antonia Socias
Mirabella Lucia
Montrucchio Giorgia
Morais Rui
Mullhi Randeep
Mumelj Lana
Mårtensson Johan
Nainan Myatra Sheila
Neporada Elena
Ng Jensen
Nichol Alistair
Nielsen Nathan D.
Nizzero Marta
Noto Alberto
Nunes Sandra
Oddy Christopher
Oliveira Ana
Omrani Ali
Ortiz Aaron Blandino
Ostermann Marlies
Padilla-Serrano Antonio
Paleari Chiara
Papamichalis Panagiotis
Parra Juan Pablo Aviles
Parra-Tanoux Daniela
Patricio Patricia
Pellegrini Mariangela
Perez-Araos Rodrigo
Perez-Calvo Cesar
Petrisor Cristina
Piagnerelli Michael
Pinto André
Polati Enrico
Poliakov Igor
Popov Evgeniy
Popova Ksenia
Potter Elizabeth
Poulton Lottie
Poyat Chrystelle
Pravia Orville Victoriano Baez
Purvis Sarah
Pyregov Alexey
Pérez-Torres David
Póvoa Pedro
Qayyum Ahad
qudah Bara Mahmoud Al
Ramires Tiago
Rana Muhammad
Reidinga Auke C.
Reinhard Veronika
Ripolles-Melchor Javier
Robin Nicole
Robles Victor Hugo Madrigal
Rodriguez-Ruiz Emilio
Rodríguez-Solis Carmen
Romeu Juan Maria
Roriz Carolina
Rosenberger Dorothea
Saha Rajnish
Sahin Ayca Sultan
Saif Ibrahim Abdulsalam
Salaverria Iñigo
Salciute-Simene Erika
Sales Gabriele
Santanilla Jairo
Santos Arnoldo
Santos Maria Lurdes
Saraçolu Kemal Tolga
Savi Marzia
Schaller Stefan J.
Schaller Stefan J.
Schellenberg Clara
Scholten Harm
Serrano Ainhoa
Shlyk Irina
Shuker Benjamin
Sierra Rosario Quispe
Silva Catarina
Singatullina Natalia
Singatullina Natalia
So Ralph K. L.
Sokolov Dmitry
Soultati Ioanna
Spadaro Savino
Spieth Peter
Spivey Michael
Spronk Peter E.
Sri-Chandana Chunda
Stubenrauch Peter
Sulemanji Demet
Suner Andrea Ortiz
Suner Kezban Ozmen
Szakmany Tamas
Taborda Lúcia
Teboul Jean-Louis
Teplykh Boris
Teruel Santiago Yus
Tharwat Aisa
Tharwat Samar
Thoral Patrick
Tirapegui Fernando
Tomak Yakup
Tonetti Tommaso
Torlinski Tomasz
Truman Nick
Turan Işıl Özkoçak
Vaccarini Barbara
Valero Clara Martínez
Valverde Virginia Hidalgo
van Bussel Bas
van den Bogaard Bas
van der Heiden Eveline
Van Hecke Jolien
Van Leemput Jan
Van Malderen Claire
Vanhove Philippe
Varela Ignacio Yago Martinez
Vargas Patricio
Vasileiadou Georgia
Venkatesh Harish
Vijayakumar Gopal
Vlachou Aikaterini
Vladislav Belskii
Vlasova Marina
Volta Carlo Alberto
von Seth Magnus
Weiss Björn
Wentowski Catherine
Wilting Rob
Wong Adrian
Yepes David
Yilmaz Mehmet
Zoerner Frank
Zverev Ivan
Österlind Jonas
Publication venue: SpringerOpen
Publication date: 14/11/2023
Field of study

Background Early mobilisation (EM) is an intervention that may improve the outcome of critically ill patients. There is limited data on EM in COVID-19 patients and its use during the first pandemic wave. Methods This is a pre-planned subanalysis of the ESICM UNITE-COVID, an international multicenter observational study involving critically ill COVID-19 patients in the ICU between February 15th and May 15th, 2020. We analysed variables associated with the initiation of EM (within 72 h of ICU admission) and explored the impact of EM on mortality, ICU and hospital length of stay, as well as discharge location. Statistical analyses were done using (generalised) linear mixed-effect models and ANOVAs. Results Mobilisation data from 4190 patients from 280 ICUs in 45 countries were analysed. 1114 (26.6%) of these patients received mobilisation within 72 h after ICU admission; 3076 (73.4%) did not. In our analysis of factors associated with EM, mechanical ventilation at admission (OR 0.29; 95% CI 0.25, 0.35; p = 0.001), higher age (OR 0.99; 95% CI 0.98, 1.00; p ≤ 0.001), pre-existing asthma (OR 0.84; 95% CI 0.73, 0.98; p = 0.028), and pre-existing kidney disease (OR 0.84; 95% CI 0.71, 0.99; p = 0.036) were negatively associated with the initiation of EM. EM was associated with a higher chance of being discharged home (OR 1.31; 95% CI 1.08, 1.58; p = 0.007) but was not associated with length of stay in ICU (adj. difference 0.91 days; 95% CI − 0.47, 1.37, p = 0.34) and hospital (adj. difference 1.4 days; 95% CI − 0.62, 2.35, p = 0.24) or mortality (OR 0.88; 95% CI 0.7, 1.09, p = 0.24) when adjusted for covariates. Conclusions Our findings demonstrate that a quarter of COVID-19 patients received EM. There was no association found between EM in COVID-19 patients' ICU and hospital length of stay or mortality. However, EM in COVID-19 patients was associated with increased odds of being discharged home rather than to a care facility. Trial registration ClinicalTrials.gov: NCT04836065 (retrospectively registered April 8th 2021)

Online Research @ Cardiff

Knowledge-based biomedical word sense disambiguation: comparison of approaches

Author: Alan R Aronson
Antonio J Jimeno-Yepes
Publication venue
Publication date: 03/04/2020
Field of study

Abstract Background: Word sense disambiguation (WSD) algorithms attempt to select the proper sense of ambiguous terms in text. Resources like the UMLS provide a reference thesaurus to be used to annotate the biomedical literature. Statistical learning approaches have produced good results, but the size of the UMLS makes the production of training data infeasible to cover all the domain

CiteSeerX

Studying the correlation between different word sense disambiguation methods and summarization effectiveness in biomedical texts

Author: Aronson Alan R
Díaz Alberto
Jimeno-Yepes Antonio J
Plaza Laura
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/08/2011
Field of study

Abstract Background Word sense disambiguation (WSD) attempts to solve lexical ambiguities by identifying the correct meaning of a word based on its context. WSD has been demonstrated to be an important step in knowledge-based approaches to automatic summarization. However, the correlation between the accuracy of the WSD methods and the summarization performance has never been studied. Results We present three existing knowledge-based WSD approaches and a graph-based summarizer. Both the WSD approaches and the summarizer employ the Unified Medical Language System (UMLS) Metathesaurus as the knowledge source. We first evaluate WSD directly, by comparing the prediction of the WSD methods to two reference sets: the NLM WSD dataset and the MSH WSD collection. We next apply the different WSD methods as part of the summarizer, to map documents onto concepts in the UMLS Metathesaurus, and evaluate the summaries that are generated. The results obtained by the different methods in both evaluations are studied and compared. Conclusions It has been found that the use of WSD techniques has a positive impact on the results of our graph-based summarizer, and that, when both the WSD and summarization tasks are assessed over large and homogeneous evaluation collections, there exists a correlation between the overall results of the WSD and summarization tasks. Furthermore, the best WSD algorithm in the first task tends to be also the best one in the second. However, we also found that the improvement achieved by the summarizer is not directly correlated with the WSD performance. The most likely reason is that the errors in disambiguation are not equally important but depend on the relative salience of the different concepts in the document to be summarized.</p

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central