Search CORE

21,774 research outputs found

Risk Assessment for Venous Thromboembolism in Chemotherapy-Treated Ambulatory Cancer Patients: A Machine Learning Approach

Author: Ferroni Patrizia
Guadagni Fiorella
Nanni Umberto
Riondino Silvia
Roselli Mario
Scarpato Noemi
Zanzotto Fabio Massimo
Publication venue: 'SAGE Publications'
Publication date: 01/01/2017
Field of study

OBJECTIVE: To design a precision medicine approach aimed at exploiting significant patterns in data, in order to produce venous thromboembolism (VTE) risk predictors for cancer outpatients that might be of advantage over the currently recommended model (Khorana score). DESIGN: Multiple kernel learning (MKL) based on support vector machines and random optimization (RO) models were used to produce VTE risk predictors (referred to as machine learning [ML]-RO) yielding the best classification performance over a training (3-fold cross-validation) and testing set. RESULTS: Attributes of the patient data set ( n = 1179) were clustered into 9 groups according to clinical significance. Our analysis produced 6 ML-RO models in the training set, which yielded better likelihood ratios (LRs) than baseline models. Of interest, the most significant LRs were observed in 2 ML-RO approaches not including the Khorana score (ML-RO-2: positive likelihood ratio [+LR] = 1.68, negative likelihood ratio [-LR] = 0.24; ML-RO-3: +LR = 1.64, -LR = 0.37). The enhanced performance of ML-RO approaches over the Khorana score was further confirmed by the analysis of the areas under the Precision-Recall curve (AUCPR), and the approaches were superior in the ML-RO approaches (best performances: ML-RO-2: AUCPR = 0.212; ML-RO-3-K: AUCPR = 0.146) compared with the Khorana score (AUCPR = 0.096). Of interest, the best-fitting model was ML-RO-2, in which blood lipids and body mass index/performance status retained the strongest weights, with a weaker association with tumor site/stage and drugs. CONCLUSIONS: Although the monocentric validation of the presented predictors might represent a limitation, these results demonstrate that a model based on MKL and RO may represent a novel methodological approach to derive VTE risk classifiers. Moreover, this study highlights the advantages of optimizing the relative importance of groups of clinical attributes in the selection of VTE risk predictors

Archivio della ricerca- Università di Roma La Sapienza

Who should be prioritized for renal transplantation?: Analysis of key stakeholder preferences using discrete choice experiments

Author: A Allepuz
AJ Lloyd
Ala Szczepura
Anil Gumber
C Byrne
C Byrne
CC Geddes
CJ Browning
Dennis Leech
Department of Health
DJ Street
Domenico Moro
FP Cappuccio
G Rubin
J Huber
J Neuberger
J Ratcliffe
J Ratcliffe
JM Wooldridge
K Ladkin
KC Norris
LF Ross
M Amaya-Amaya
M Bradley
MD Clark
Michael D Clark
NHS Blood and Transplant
NHS Blood and Transplant [formally UK Transplant]
Nick West
ON Louis
PL Tso
R Bennett
RA Koenne
RM Higgins
Robert Higgins
S Youngkong
SN Davison
VB Ashby
VS Raleigh
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Background Policies for allocating deceased donor kidneys have recently shifted from allocation based on Human Leucocyte Antigen (HLA) tissue matching in the UK and USA. Newer allocation algorithms incorporate waiting time as a primary factor, and in the UK, young adults are also favoured. However, there is little contemporary UK research on the views of stakeholders in the transplant process to inform future allocation policy. This research project aimed to address this issue. Methods Discrete Choice Experiment (DCE) questionnaires were used to establish priorities for kidney transplantation among different stakeholder groups in the UK. Questionnaires were targeted at patients, carers, donors / relatives of deceased donors, and healthcare professionals. Attributes considered included: waiting time; donor-recipient HLA match; whether a recipient had dependents; diseases affecting life expectancy; and diseases affecting quality of life. Results Responses were obtained from 908 patients (including 98 ethnic minorities); 41 carers; 48 donors / relatives of deceased donors; and 113 healthcare professionals. The patient group demonstrated statistically different preferences for every attribute (i.e. significantly different from zero) so implying that changes in given attributes affected preferences, except when prioritizing those with no rather than moderate diseases affecting quality of life. The attributes valued highly related to waiting time, tissue match, prioritizing those with dependents, and prioritizing those with moderate rather than severe diseases affecting life expectancy. Some preferences differed between healthcare professionals and patients, and ethnic minority and non-ethnic minority patients. Only non-ethnic minority patients and healthcare professionals clearly prioritized those with better tissue matches. Conclusions Our econometric results are broadly supportive of the 2006 shift in UK transplant policy which emphasized prioritizing the young and long waiters. However, our findings suggest the need for a further review in the light of observed differences in preferences amongst ethnic minorities, and also because those with dependents may be a further priority.</p

Springer - Publisher Connector

Directory of Open Access Journals

Warwick Research Archives Portal Repository

Regression Models For Readmission Prediction Using Electronic Medical Records

Author: Niu Yudi
Publication venue: DigitalCommons@WayneState
Publication date: 01/01/2013
Field of study

Hospital readmissions are not only expensive but are also potentially harmful, and most importantly, they are often preventable. Providing special care for a targeted group of patients who are at a high risk of readmission can signiﬁcantly improve the chances of avoiding rehospitalization. Despite the signiﬁcance of this problem, not many researchers have thoroughly investigated it due to the inherent complexities involved in analyzing and estimating the inherent predictive power of such complex hospitalization records. In this thesis, we propose using support vector machines and survival analysis methods to analyze data collected from Electronic Medical Records (EMR). We define the notion of abnormal patients and understand how they affect the performance of classifiers. We use sparse methods with survival regression models to build clinical models which are suitable to apply on such complex clinical data. These models are compared with existing readmission models such as ADHERE, TABAK and logistic regression models. Finally, we provide inferences and conclusions on how to extend this work to build better regression models

Digital Commons@Wayne State University

Identification of disease-causing genes using microarray data mining and gene ontology

Author: A Mohammadi
A Zhang
AA Alizadeh
Azadeh Mohammadi
B Duval
BF Souza
C Ambroise
C Ding
C Tago
D Lin
D Singh
E Martinez
FM Couto
I Guyon
I Inza
J Jaeger
JJ Jiang
L Li
L Yu
L Ziaei
Mansoor Salehi
Mohammad H Saraee
N Cristianini
P Pavlidis
P Resnik
PA Mundra
PA Mundra
PJ Park
R Genuer
RF Weaver
S Li
S Li
TM Huang
TR Golub
TS Furey
U Alon
W Xu
Y Ding
Y Saeys
Y Wang
YL Chin
Z Xie
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Background: One of the best and most accurate methods for identifying disease-causing genes is monitoring gene expression values in different samples using microarray technology. One of the shortcomings of microarray data is that they provide a small quantity of samples with respect to the number of genes. This problem reduces the classification accuracy of the methods, so gene selection is essential to improve the predictive accuracy and to identify potential marker genes for a disease. Among numerous existing methods for gene selection, support vector machine-based recursive feature elimination (SVMRFE) has become one of the leading methods, but its performance can be reduced because of the small sample size, noisy data and the fact that the method does not remove redundant genes. Methods: We propose a novel framework for gene selection which uses the advantageous features of conventional methods and addresses their weaknesses. In fact, we have combined the Fisher method and SVMRFE to utilize the advantages of a filtering method as well as an embedded method. Furthermore, we have added a redundancy reduction stage to address the weakness of the Fisher method and SVMRFE. In addition to gene expression values, the proposed method uses Gene Ontology which is a reliable source of information on genes. The use of Gene Ontology can compensate, in part, for the limitations of microarrays, such as having a small number of samples and erroneous measurement results. Results: The proposed method has been applied to colon, Diffuse Large B-Cell Lymphoma (DLBCL) and prostate cancer datasets. The empirical results show that our method has improved classification performance in terms of accuracy, sensitivity and specificity. In addition, the study of the molecular function of selected genes strengthened the hypothesis that these genes are involved in the process of cancer growth. Conclusions: The proposed method addresses the weakness of conventional methods by adding a redundancy reduction stage and utilizing Gene Ontology information. It predicts marker genes for colon, DLBCL and prostate cancer with a high accuracy. The predictions made in this study can serve as a list of candidates for subsequent wet-lab verification and might help in the search for a cure for cancers

University of Salford Institutional Repository

Springer - Publisher Connector

Directory of Open Access Journals