2 research outputs found
Causality Refined Diagnostic Prediction
Applying machine learning in the health care domain has shown promising
results in recent years. Interpretable outputs from learning algorithms are
desirable for decision making by health care personnel. In this work, we
explore the possibility of utilizing causal relationships to refine diagnostic
prediction. We focus on the task of diagnostic prediction using discomfort
drawings, and explore two ways to employ causal identification to improve the
diagnostic results. Firstly, we use causal identification to infer the causal
relationships among diagnostic labels which, by itself, provides interpretable
results to aid the decision making and training of health care personnel.
Secondly, we suggest a post-processing approach where the inferred causal
relationships are used to refine the prediction accuracy of a multi-view
probabilistic model. Experimental results show firstly that causal
identification is capable of detecting the causal relationships among
diagnostic labels correctly, and secondly that there is potential for improving
pain diagnostics prediction accuracy using the causal relationships.Comment: NIPS 2017 Workshop on Machine Learning for Health (ML4H
Causal Discovery in the Presence of Missing Data
Missing data are ubiquitous in many domains including healthcare. When these
data entries are not missing completely at random, the (conditional)
independence relations in the observed data may be different from those in the
complete data generated by the underlying causal process. Consequently, simply
applying existing causal discovery methods to the observed data may lead to
wrong conclusions. In this paper, we aim at developing a causal discovery
method to recover the underlying causal structure from observed data that
follow different missingness mechanisms, including missing completely at random
(MCAR), missing at random (MAR), and missing not at random (MNAR). With
missingness mechanisms represented by missingness graphs, we analyse conditions
under which additional correction is needed to derive conditional
independence/dependence relations in the complete data. Based on our analysis,
we propose the Missing Value PC (MVPC) algorithm for both continuous and binary
variables, which extends the PC algorithm to incorporate additional
corrections. Our proposed MVPC is shown in theory to give asymptotically
correct results even on data that are MAR or MNAR. Experimental results on
synthetic data show that the proposed algorithm is able to find correct causal
relations even in the general case of MNAR. Moreover, we create a neuropathic
pain diagnostic simulator for evaluating causal discovery methods. Evaluated on
such simulated neuropathic pain diagnosis records and the other two real world
applications, MVPC outperforms the other benchmark methods