21,774 research outputs found

    Risk Assessment for Venous Thromboembolism in Chemotherapy-Treated Ambulatory Cancer Patients: A Machine Learning Approach

    Get PDF
    OBJECTIVE: To design a precision medicine approach aimed at exploiting significant patterns in data, in order to produce venous thromboembolism (VTE) risk predictors for cancer outpatients that might be of advantage over the currently recommended model (Khorana score). DESIGN: Multiple kernel learning (MKL) based on support vector machines and random optimization (RO) models were used to produce VTE risk predictors (referred to as machine learning [ML]-RO) yielding the best classification performance over a training (3-fold cross-validation) and testing set. RESULTS: Attributes of the patient data set ( n = 1179) were clustered into 9 groups according to clinical significance. Our analysis produced 6 ML-RO models in the training set, which yielded better likelihood ratios (LRs) than baseline models. Of interest, the most significant LRs were observed in 2 ML-RO approaches not including the Khorana score (ML-RO-2: positive likelihood ratio [+LR] = 1.68, negative likelihood ratio [-LR] = 0.24; ML-RO-3: +LR = 1.64, -LR = 0.37). The enhanced performance of ML-RO approaches over the Khorana score was further confirmed by the analysis of the areas under the Precision-Recall curve (AUCPR), and the approaches were superior in the ML-RO approaches (best performances: ML-RO-2: AUCPR = 0.212; ML-RO-3-K: AUCPR = 0.146) compared with the Khorana score (AUCPR = 0.096). Of interest, the best-fitting model was ML-RO-2, in which blood lipids and body mass index/performance status retained the strongest weights, with a weaker association with tumor site/stage and drugs. CONCLUSIONS: Although the monocentric validation of the presented predictors might represent a limitation, these results demonstrate that a model based on MKL and RO may represent a novel methodological approach to derive VTE risk classifiers. Moreover, this study highlights the advantages of optimizing the relative importance of groups of clinical attributes in the selection of VTE risk predictors

    Who should be prioritized for renal transplantation?: Analysis of key stakeholder preferences using discrete choice experiments

    Get PDF
    Background Policies for allocating deceased donor kidneys have recently shifted from allocation based on Human Leucocyte Antigen (HLA) tissue matching in the UK and USA. Newer allocation algorithms incorporate waiting time as a primary factor, and in the UK, young adults are also favoured. However, there is little contemporary UK research on the views of stakeholders in the transplant process to inform future allocation policy. This research project aimed to address this issue. Methods Discrete Choice Experiment (DCE) questionnaires were used to establish priorities for kidney transplantation among different stakeholder groups in the UK. Questionnaires were targeted at patients, carers, donors / relatives of deceased donors, and healthcare professionals. Attributes considered included: waiting time; donor-recipient HLA match; whether a recipient had dependents; diseases affecting life expectancy; and diseases affecting quality of life. Results Responses were obtained from 908 patients (including 98 ethnic minorities); 41 carers; 48 donors / relatives of deceased donors; and 113 healthcare professionals. The patient group demonstrated statistically different preferences for every attribute (i.e. significantly different from zero) so implying that changes in given attributes affected preferences, except when prioritizing those with no rather than moderate diseases affecting quality of life. The attributes valued highly related to waiting time, tissue match, prioritizing those with dependents, and prioritizing those with moderate rather than severe diseases affecting life expectancy. Some preferences differed between healthcare professionals and patients, and ethnic minority and non-ethnic minority patients. Only non-ethnic minority patients and healthcare professionals clearly prioritized those with better tissue matches. Conclusions Our econometric results are broadly supportive of the 2006 shift in UK transplant policy which emphasized prioritizing the young and long waiters. However, our findings suggest the need for a further review in the light of observed differences in preferences amongst ethnic minorities, and also because those with dependents may be a further priority.</p

    Regression Models For Readmission Prediction Using Electronic Medical Records

    Get PDF
    Hospital readmissions are not only expensive but are also potentially harmful, and most importantly, they are often preventable. Providing special care for a targeted group of patients who are at a high risk of readmission can signiïŹcantly improve the chances of avoiding rehospitalization. Despite the signiïŹcance of this problem, not many researchers have thoroughly investigated it due to the inherent complexities involved in analyzing and estimating the inherent predictive power of such complex hospitalization records. In this thesis, we propose using support vector machines and survival analysis methods to analyze data collected from Electronic Medical Records (EMR). We define the notion of abnormal patients and understand how they affect the performance of classifiers. We use sparse methods with survival regression models to build clinical models which are suitable to apply on such complex clinical data. These models are compared with existing readmission models such as ADHERE, TABAK and logistic regression models. Finally, we provide inferences and conclusions on how to extend this work to build better regression models

    Identification of disease-causing genes using microarray data mining and gene ontology

    Get PDF
    Background: One of the best and most accurate methods for identifying disease-causing genes is monitoring gene expression values in different samples using microarray technology. One of the shortcomings of microarray data is that they provide a small quantity of samples with respect to the number of genes. This problem reduces the classification accuracy of the methods, so gene selection is essential to improve the predictive accuracy and to identify potential marker genes for a disease. Among numerous existing methods for gene selection, support vector machine-based recursive feature elimination (SVMRFE) has become one of the leading methods, but its performance can be reduced because of the small sample size, noisy data and the fact that the method does not remove redundant genes. Methods: We propose a novel framework for gene selection which uses the advantageous features of conventional methods and addresses their weaknesses. In fact, we have combined the Fisher method and SVMRFE to utilize the advantages of a filtering method as well as an embedded method. Furthermore, we have added a redundancy reduction stage to address the weakness of the Fisher method and SVMRFE. In addition to gene expression values, the proposed method uses Gene Ontology which is a reliable source of information on genes. The use of Gene Ontology can compensate, in part, for the limitations of microarrays, such as having a small number of samples and erroneous measurement results. Results: The proposed method has been applied to colon, Diffuse Large B-Cell Lymphoma (DLBCL) and prostate cancer datasets. The empirical results show that our method has improved classification performance in terms of accuracy, sensitivity and specificity. In addition, the study of the molecular function of selected genes strengthened the hypothesis that these genes are involved in the process of cancer growth. Conclusions: The proposed method addresses the weakness of conventional methods by adding a redundancy reduction stage and utilizing Gene Ontology information. It predicts marker genes for colon, DLBCL and prostate cancer with a high accuracy. The predictions made in this study can serve as a list of candidates for subsequent wet-lab verification and might help in the search for a cure for cancers
    • 

    corecore