10,508 research outputs found

    New Statistical Learning Methods for Personalized Medical Decision Making

    Get PDF
    This research focuses on developing new and computationally efficient statistical learning methods for multicategory classification and personalized medical decision making. Motivated by the challenge of multicategory classification problems, and the computational efficiency and theoretical properties of support vector machines (SVM), a novel learning algorithm is proposed. The method is then adapted to estimating multicategory individualized treatment by connecting with outcome weighted learning. At last, an application to Electronic Health Record data is explored. The proposed algorithm, forward-backward SVM (FB-SVM) is based on a sequential binary classification algorithm and relies on support vector machines for each binary classification and utilizes only feasible data in each step. The method guarantees convergence and entails light computational burden. We prove the theoretical property of Fisher consistency of the classification rule derived from the FB-SVM and obtain the risk bound for the predicted misclassification rate. We conduct extensive simulation and application studies, using popular benchmarking data and data from a newly completed real-world study, to demonstrate that the proposed method has superior performance, in terms of low misclassification rates and significantly improved computational speed when compared to existing methods. Furthermore, we generalize the proposed FB-SVM with outcome weighted learning to estimate optimal individualized treatment rule (ITR) with multiple options of treatment, namely sequential outcome-weighted learning (SOM). Theoretically, we show that the resulting ITR is Fisher consistent. We demonstrate the performance of proposed method with extensive simulations. An application to a three-arm randomized trial of treating major depressive disorder shows that an individualized treatment strategy tailored to individual characteristics such as patients' expectancy of treatment efficacy and baseline depression severity reduces depressive symptoms more than non-personalized treatment strategies. Finally, we discuss how the proposed SOM learning can be used to estimate optimal ITRs with safety concerns in high dimensional data with patients' adverse reaction records who have taken statin medicine. We adopt sampling techniques, inverse probability weighting, propensity score adjustment, and variable clustering along with SOM learning in our analysis. Considering patients' demographics and medical history, we are able to recommend the best statin drug which has the lowest risk to cause myopathy or rhabdomyolysis.Doctor of Philosoph

    Active Clinical Trials for Personalized Medicine

    Full text link
    Individualized treatment rules (ITRs) tailor treatments according to individual patient characteristics. They can significantly improve patient care and are thus becoming increasingly popular. The data collected during randomized clinical trials are often used to estimate the optimal ITRs. However, these trials are generally expensive to run, and, moreover, they are not designed to efficiently estimate ITRs. In this paper, we propose a cost-effective estimation method from an active learning perspective. In particular, our method recruits only the "most informative" patients (in terms of learning the optimal ITRs) from an ongoing clinical trial. Simulation studies and real-data examples show that our active clinical trial method significantly improves on competing methods. We derive risk bounds and show that they support these observed empirical advantages.Comment: 48 Page, 9 Figures. To Appear in JASA--T&

    Subgroup Identification Using the personalized Package

    Get PDF
    A plethora of disparate statistical methods have been proposed for subgroup identification to help tailor treatment decisions for patients. However a majority of them do not have corresponding R packages and the few that do pertain to particular statistical methods or provide little means of evaluating whether meaningful subgroups have been found. Recently, the work of Chen, Tian, Cai, and Yu (2017) unified many of these subgroup identification methods into one general, consistent framework. The goal of the personalized package is to provide a corresponding unified software framework for subgroup identification analyses that provides not only estimation of subgroups, but evaluation of treatment effects within estimated subgroups. The personalized package allows for a variety of subgroup identification methods for many types of outcomes commonly encountered in medical settings. The package is built to incorporate the entire subgroup identification analysis pipeline including propensity score diagnostics, subgroup estimation, analysis of the treatment effects within subgroups, and evaluation of identified subgroups. In this framework, different methods can be accessed with little change in the analysis code. Similarly, new methods can easily be incorporated into the package. Besides familiar statistical models, the package also allows flexible machine learning tools to be leveraged in subgroup identification. Further estimation improvements can be obtained via efficiency augmentation
    • …
    corecore