Search CORE

10 research outputs found

Predicting Multiple ICD-10 Codes from Brazilian-Portuguese Clinical Notes

Author: A Perotte
AEW Johnson
F Duarte
G Salton
J Huang
M Li
M Oleynik
M Subotin
P Bojanowski
PB Jensen
SVS Pakhomov
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 29/07/2020
Field of study

ICD coding from electronic clinical records is a manual, time-consuming and expensive process. Code assignment is, however, an important task for billing purposes and database organization. While many works have studied the problem of automated ICD coding from free text using machine learning techniques, most use records in the English language, especially from the MIMIC-III public dataset. This work presents results for a dataset with Brazilian Portuguese clinical notes. We develop and optimize a Logistic Regression model, a Convolutional Neural Network (CNN), a Gated Recurrent Unit Neural Network and a CNN with Attention (CNN-Att) for prediction of diagnosis ICD codes. We also report our results for the MIMIC-III dataset, which outperform previous work among models of the same families, as well as the state of the art. Compared to MIMIC-III, the Brazilian Portuguese dataset contains far fewer words per document, when only discharge summaries are used. We experiment concatenating additional documents available in this dataset, achieving a great boost in performance. The CNN-Att model achieves the best results on both datasets, with micro-averaged F1 score of 0.537 on MIMIC-III and 0.485 on our dataset with additional documents.Comment: Accepted at BRACIS 202

arXiv.org e-Print Archive

Crossref

Estimating the health‐related quality of life of kidney stone patients: initial results from the Wisconsin Stone Quality of Life Machine‐Learning Algorithm (WISQOL‐MLA)

Author: Garreta R
Kiranyaz S
Michie D
Pakhomov S
Pakhomov SVS
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

Semantic relatedness and similarity of biomedical terms: examining the effects of recency, size, and section of biomedical publications on the performance of word2vec

Author: C Pesquita
D Sánchez
D Zhang
Erjia Yan
Fei Wang
J Pennington
JA Minarro-Giménez
MA Hadj Taieb
SVS Pakhomov
T Pedersen
Y Zhu
Yongjun Zhu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Recent advances in Swedish and Spanish medical entity recognition in clinical texts using deep neural approaches

Author: A Casillas
A Pérez
Alicia Pérez
Arantza Casillas
E Grave
H Dalianis
H Dalianis
I Martinez Soriano
JPC Chiu
L Yao
M Gridach
M Oronoz
M Oronoz
Maite Oronoz
Maryam Habibi
O Uzuner
P Bojanowski
PB Jensen
R Collobert
R Roller
R Weegar
R Östling
Rebecka Weegar
S Almgren
S Hochreiter
SVS Pakhomov
T Mikolov
T Mikolov
V Yadav
X Dong
Y Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Background Text mining and natural language processing of clinical text, such as notes from electronic health records, requires specific consideration of the specialized characteristics of these texts. Deep learning methods could potentially mitigate domain specific challenges such as limited access to in-domain tools and data sets. Methods A bi-directional Long Short-Term Memory network is applied to clinical notes in Spanish and Swedish for the task of medical named entity recognition. Several types of embeddings, both generated from in-domain and out-of-domain text corpora, and a number of generation and combination strategies for embeddings have been evaluated in order to investigate different input representations and the influence of domain on the final results. Results For Spanish, a micro averaged F1-score of 75.25 was obtained and for Swedish, the corresponding score was 76.04. The best results for both languages were achieved using embeddings generated from in-domain corpora extracted from electronic health records, but embeddings generated from related domains were also found to be beneficial. Conclusions A recurrent neural network with in-domain embeddings improved the medical named entity recognition compared to shallow learning methods, showing this combination to be suitable for entity recognition in clinical text for both languages.The publication cost of this article was funded by Stockholm University Librar

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Archivo Digital para la Docencia y la Investigación

Indirect association and ranking hypotheses for literature based discovery

Author: A Kastrin
AR Aronson
B Wilkowski
Bridget T. McInnes
BT McInnes
BT McInnes
D Cameron
D Hristovski
D Hristovski
D Hristovski
DR Swanson
DR Swanson
F Smadja
H Kilicoglu
H-T Yang
I Petrič
J Sybrandt
JD Wren
Judita Preiss
L Eronen
M Yetisgen-Yildiz
MD Gordon
MD Gordon
NC Baker
NR Smalheiser
O Bodenreider
P Bruza
R Kostoff
RA Fisher
S Henry
S Henry
Sam Henry
Sam Henry
SV Pakhomov
SVS Pakhomov
T Cohen
T Dunning
T Pedersen
T Saito
TC Rindflesch
TE Workman
Y Lin
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Recent advances in Swedish and Spanish medical entity recognition in clinical texts using deep neural approaches

Author: A Casillas
A Pérez
Alicia Pérez
Arantza Casillas
E Grave
H Dalianis
H Dalianis
I Martinez Soriano
JPC Chiu
L Yao
M Gridach
M Oronoz
M Oronoz
Maite Oronoz
Maryam Habibi
O Uzuner
P Bojanowski
PB Jensen
R Collobert
R Roller
R Weegar
R Östling
Rebecka Weegar
S Almgren
S Hochreiter
SVS Pakhomov
T Mikolov
T Mikolov
V Yadav
X Dong
Y Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Semantic similarity in the biomedical domain: an evaluation across knowledge sources

Author: A Budanitsky
A Budanitsky
A Hliaoutakis
AR Aronson
BT McInnes
BT McInnes
C Leacock
Cynthia Brandt
D Lin
D Lin
D Rao
D Sánchez
DO Seaghdha
E Agirre
E Agirre
E Agirre
GK Savova
H Al-Mubaid
H Al-Mubaid
H Cunningham
JE Caviedes
JJ Jiang
M Batet
M Lesk
M Sahami
M Stevenson
N Seco
P Resnik
R Rada
S Aseervatham
S Banerjee
S Bloehdorn
S Bloehdorn
S Brin
S Pakhomov
S Patwardhan
S Patwardhan
ST Wu
SVS Pakhomov
T Hughes
T Pedersen
TH Haveliwala
Vijay N Garla
VN Garla
W-N Lee
Y Liu
Z Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Longitudinal cognitive biomarkers predicting symptom onset in presymptomatic frontotemporal dementia

Author: A Rey
Army Individual Test Battery
AT Beck
B Dubois
BJ Hallam
D Wechsler
DH Geschwind
DI Kaufer
DR Royall
E Berg van den
E Kaplan
E Visch-Brink
EG Dopper
EG Dopper
Elise G. P. Dopper
Emma L. van der Ende
Esther van den Berg
F Happe
H Seelaar
HE Nelson
J Hassenstab
J Jolles
J Lindeboom
Janne M. Papma
JB Miller
JC Janssen
JD Rohrer
JD Rohrer
JD Rohrer
JD Rohrer
JD Warren
Jessica L. Panman
JL Whitwell
John C. van Swieten
JR Stroop
JS Snowden
JS Snowden
K Rascovsky
Laura Donker Kaat
Lauren van Asseldonk
LC Jiskoot
LH Meeter
Lieke H. H. Meeter
Lize C. Jiskoot
LLT Thurstone
M Adenzato
M Barandiaran
M Hornberger
M Laisney
MF Folstein
ML Gorno-Tempini
N Tolboom
P Ekman
R Smith
Reinier Timman
Rick van Minkelen
S Spina
Sanne Franzen
SJ Doesborgh
SVS Pakhomov
TA Rosness
TJ Ferman
WJ Youden
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Introduction: We performed 4-year follow-up neuropsychological assessment to investigate cognitive decline and the prognostic abilities from presymptomatic to symptomatic familial frontotemporal dementia (FTD). Methods: Presymptomatic MAPT (n = 15) and GRN mutation carriers (n = 31), and healthy controls (n = 39) underwent neuropsychological assessment every 2 years. Eight mutation carriers (5 MAPT, 3 GRN) became symptomatic. We investigated cognitive decline with multilevel regression modeling; the prognostic performance was assessed with ROC analyses and stepwise logistic regression. Results: MAPT converters declined on language, attention, executive function, social cognition, and memory, and GRN converters declined on attention and executive function (p < 0.05). Cognitive decline in ScreeLing phonology (p = 0.046) and letter fluency (p = 0.046) were predictive for conversion to non-fluent variant PPA, and decline on categorical fluency (p = 0.025) for an underlying MAPT mutation. Discussion: Using longitudinal neuropsychological assessment, we detected a mutation-specific pattern of cognitive decline, potentially suggesting prognostic value of neuropsychological trajectories in conversion to symptomatic FTD

Crossref

EUR Research Repository

Leiden University Scholary Publications

Profiling Speech and Pausing in Amyotrophic Lateral Sclerosis (ALS) and Frontotemporal Dementia (FTD)

Crossref

Screening pregnant women for suicidal behavior in electronic medical records: diagnostic codes vs. clinical notes processed by natural language processing

Crossref