Search CORE

7 research outputs found

Clinical Text Prediction with Numerically Grounded Conditional Language Models

Author: Petersen SE
Riedel S
Spithourakis GP
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 20/10/2016
Field of study

Assisted text input techniques can save time and effort and improve text quality. In this paper, we investigate how grounded and conditional extensions to standard neural language models can bring improvements in the tasks of word prediction and completion. These extensions incorporate a structured knowledge base and numerical values from the text into the context used to predict the next word. Our automated evaluation on a clinical dataset shows extended models significantly outperform standard models. Our best system uses both conditioning and grounding, because of their orthogonal benefits. For word prediction with a list of 5 suggestions, it improves recall from 25.03% to 71.28% and for word completion it improves keystroke savings from 34.35% to 44.81%, where theoretical bound for this dataset is 58.78%. We also perform a qualitative investigation of how models with lower perplexity occasionally fare better at the tasks. We found that at test time numbers have more influence on the document level than on individual word probabilities

arXiv.org e-Print Archive

UCL Discovery

Clinical Text Prediction with Numerically Grounded Conditional Language Models

Author: Petersen SE
Riedel S
Spithourakis GP
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/11/2016
Field of study

UCL Discovery

Numeracy for language models: Evaluating and improving their ability to predict numbers

Author: Riedel S
Spithourakis GP
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2018
Field of study

Numeracy is the ability to understand and work with numbers. It is a necessary skill for composing and understanding documents in clinical, scientific, and other technical domains. In this paper, we explore different strategies for modelling numerals with language models, such as memorisation and digit-by-digit composition, and propose a novel neural architecture that uses a continuous probability density function to model numerals from an open vocabulary. Our evaluation on clinical and scientific datasets shows that using hierarchical models to distinguish numerals from words improves a perplexity metric on the subset of numerals by 2 and 4 orders of magnitude, respectively, over non-hierarchical models. A combination of strategies can further improve perplexity. Our continuous probability density function model reduces mean absolute percentage errors by 18% and 54% in comparison to the second best strategy for each dataset, respectively

UCL Discovery

Group support systems features and their contribution to technology strategy decision-making: A review and analysis

Author: A Salo
A Shirani
AC Hax
AK Choudhury
BA Reinig
C Durst
CE Bozdağ
D Ford
E Lichtenthaler
F Ackermann
F Antunes
F Zandi
GP Spithourakis
GR Mitchell
GT Preez du
J Keller
J Lim
JF Nunamaker
JP Shim
JW Satzinger
K Weigand
KM Chudoba
L Chidambaram
L Tung
M Adkins
M Limayem
M Torkkeli
ME Porter
MG Beruvides
N Lehmann-Willenbrock
OK Ngwenyama
Q Tian
R Phaal
RA Burgelman
RB Gallupe
S Davenport
S Yilmaz
SA Zahra
SB Craig
SG Rogelberg
SG Rogelberg
T Gordon
V Chiesa
V Chiesa
V Miemis
V Vathanophas
VA Bañuls
W Chang
W Huang
WA Green
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Collective decision-making processes require careful design considerations in organizations. On one hand, the inclusion of a greater number of actors contribute to a wider knowledge base, on the other, it can become a diffuse process and be distorted from the principles initially established. This paper observes a specific collective decision making process in organizations—technology strategy formulation—and, through a critical review of the literature, analyzes how the advances in features of group support systems support improvements in different stages of this process. This paper also discusses the implications of GSS appropriation in group dynamics.This research was supported by Fundação para a Ciência e Tecnologia (SFRH/ BD/ 33727/ 2009), within the framework of the MIT Portugal Program.info:eu-repo/semantics/publishedVersio

Universidade do Minho: RepositoriUM

Crossref

Towards automated clinical coding

Author: Catling F
Riedel S
Spithourakis GP
Publication venue
Publication date: 01/12/2018
Field of study

BACKGROUND: Patients’ encounters with healthcare services must undergo clinical coding. These codes are typically derived from free-text notes. Manual clinical coding is expensive, time-consuming and prone to error. Automated clinical coding systems have great potential to save resources, and realtime availability of codes would improve oversight of patient care and accelerate research. Automated coding is made challenging by the idiosyncrasies of clinical text, the large number of disease codes and their unbalanced distribution. METHODS: We explore methods for representing clinical text and the labels in hierarchical clinical coding ontologies. Text is represented as term frequency-inverse document frequency counts and then as word embeddings, which we use as input to recurrent neural networks. Labels are represented atomically, and then by learning representations of each node in a coding ontology and composing a representation for each label from its respective node path. We consider different strategies for initialisation of the node representations. We evaluate our methods using the publicly-available Medical Information Mart for Intensive Care III dataset: we extract the history of presenting illness section from each discharge summary in the dataset, then predicting the International Classification of Diseases, ninth revision, Clinical Modification codes associated with these. RESULTS: Composing the label representations from the clinical-coding-ontology nodes increased weighted F1 for prediction of the 17,561 disease labels to 0.264–0.281 from 0.232–0.249 for atomic representations. Recurrent neural network text representation improved weighted F1 for prediction of the 19 disease-category labels to 0.682–0.701 from 0.662–0.682 using term frequency-inverse document frequency. However, term frequency-inverse document frequency outperformed recurrent neural networks for prediction of the 17,561 disease labels. CONCLUSIONS: This study demonstrates that hierarchically-structured medical knowledge can be incorporated into statistical models, and produces improved performance during automated clinical coding. This performance improvement results primarily from improved representation of rarer diseases. We also show that recurrent neural networks improve representation of medical text in some settings. Learning good representations of the very rare diseases in clinical coding ontologies from data alone remains challenging, and alternative means of representing these diseases will form a major focus of future work on automated clinical coding

UCL Discovery

Forecast combinations for intermittent demand

Author: Fotios Petropoulos
Nikolaos Kourentzes
Spithourakis GP
Timmermann A
Trabelsi A
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Comparison of statistical and machine learning methods for daily SKU demand forecasting

Author: A Davydenko
A Karatzoglou
A Liaw
A Pooya
AA Ghobbar
AA Nasiri Pour
AA Syntetos
AA Syntetos
AA Syntetos
AHC Eaves
AJ Koning
Artemios-Anargyros Semenoglou
AV Rao
B Rostami-Tabar
B Schölkopf
B Seaman
C Bergmeir
CE Rasmussen
D Nguyen
D Salinas
DJC MacKay
E Spiliotis
EA Shale
ES Gardner
ES Gardner Jr
Evangelos Spiliotis
F Lolli
F Petropoulos
F Petropoulos
F Petropoulos
FR Johnston
G Zhang
GP Spithourakis
H Chen
I Svetunkov
J Barker
JD Croston
JE Boylan
JE Boylan
JH Friedman
JL Carmo
K Hornik
K Nikolopoulos
K Nikolopoulos
KI Nikolopoulos
L Breiman
LJ Tashman
M Abolghasemi
M Babai
M Hasni
M Mohammadipour
MF Møller
MW Seeger
N Kourentzes
N Kourentzes
N Kourentzes
N Kourentzes
NC Schwertman
P Boutselis
P Montero-Manso
PH Franses
R Fildes
R Teunter
RG Brown
RH Teunter
RH Teunter
RJ Hyndman
RJ Hyndman
RP Lippmann
RS Gutierrez
S Kolassa
S Makridakis
S Makridakis
S Makridakis
S Mukhopadhyay
S Smyl
Spyros Makridakis
T Januschowski
TR Willemain
Vassilios Assimakopoulos
Y Freund
ÖG Ali
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref