30 research outputs found
Causal Confusion in Imitation Learning
Behavioral cloning reduces policy learning to supervised learning by training
a discriminative model to predict expert actions given observations. Such
discriminative models are non-causal: the training procedure is unaware of the
causal structure of the interaction between the expert and the environment. We
point out that ignoring causality is particularly damaging because of the
distributional shift in imitation learning. In particular, it leads to a
counter-intuitive "causal misidentification" phenomenon: access to more
information can yield worse performance. We investigate how this problem
arises, and propose a solution to combat it through targeted
interventions---either environment interaction or expert queries---to determine
the correct causal model. We show that causal misidentification occurs in
several benchmark control domains as well as realistic driving settings, and
validate our solution against DAgger and other baselines and ablations.Comment: Published at NeurIPS 2019 9 pages, plus references and appendice
Linkages among the Foreign Exchange, Stock, and Bond Markets in Japan and the United States
While economic theory explains the linkages among the financial markets of
different countries, empirical studies mainly verify the linkages through
Granger causality, without considering latent variables or instantaneous
effects. Their findings are inconsistent regarding the existence of causal
linkages among financial markets, which might be attributed to differences in
the focused markets, data periods, and methods applied. Our study adopts causal
discovery methods including VAR-LiNGAM and LPCMCI with domain knowledge to
explore the linkages among financial markets in Japan and the United States
(US) for the post Covid-19 pandemic period under divergent monetary policy
directions. The VAR-LiNGAM results reveal that the previous day's US market
influences the following day's Japanese market for both stocks and bonds, and
the bond markets of the previous day impact the following day's foreign
exchange (FX) market directly and the following day's Japanese stock market
indirectly. The LPCMCI results indicate the existence of potential latent
confounders. Our results demonstrate that VAR-LiNGAM uniquely identifies the
directed acyclic graph (DAG), and thus provides informative insight into the
causal relationship when the assumptions are considered valid. Our study
contributes to a better understanding of the linkages among financial markets
in the analyzed data period by supporting the existence of linkages between
Japan and the US for the same financial markets and among FX, stock, and bond
markets, thus highlighting the importance of leveraging causal discovery
methods in the financial domain.Comment: Causal Analysis Workshop Series (CAWS) 2023, 18 pages, 7 Figure
Robustness of Algorithms for Causal Structure Learning to Hyperparameter Choice
Hyperparameters play a critical role in machine learning. Hyperparameter
tuning can make the difference between state-of-the-art and poor prediction
performance for any algorithm, but it is particularly challenging for structure
learning due to its unsupervised nature. As a result, hyperparameter tuning is
often neglected in favour of using the default values provided by a particular
implementation of an algorithm. While there have been numerous studies on
performance evaluation of causal discovery algorithms, how hyperparameters
affect individual algorithms, as well as the choice of the best algorithm for a
specific problem, has not been studied in depth before. This work addresses
this gap by investigating the influence of hyperparameters on causal structure
learning tasks. Specifically, we perform an empirical evaluation of
hyperparameter selection for some seminal learning algorithms on datasets of
varying levels of complexity. We find that, while the choice of algorithm
remains crucial to obtaining state-of-the-art performance, hyperparameter
selection in ensemble settings strongly influences the choice of algorithm, in
that a poor choice of hyperparameters can lead to analysts using algorithms
which do not give state-of-the-art performance for their data.Comment: 26 pages, 16 figure
A systems approach towards remote health-monitoring in older adults: Introducing a zero-interaction digital exhaust.
Using connected sensing devices to remotely monitor health is a promising way to help transition healthcare from a rather reactive to a more precision medicine oriented proactive approach, which could be particularly relevant in the face of rapid population ageing and the challenges it poses to healthcare systems. Sensor derived digital measures of health, such as digital biomarkers or digital clinical outcome assessments, may be used to monitor health status or the risk of adverse events like falls. Current research around such digital measures has largely focused on exploring the use of few individual measures obtained through mobile devices. However, especially for long-term applications in older adults, this choice of technology may not be ideal and could further add to the digital divide. Moreover, large-scale systems biology approaches, like genomics, have already proven beneficial in precision medicine, making it plausible that the same could also hold for remote-health monitoring. In this context, we introduce and describe a zero-interaction digital exhaust: a set of 1268 digital measures that cover large parts of a person's activity, behavior and physiology. Making this approach more inclusive of older adults, we base this set entirely on contactless, zero-interaction sensing technologies. Applying the resulting digital exhaust to real-world data, we then demonstrate the possibility to create multiple ageing relevant digital clinical outcome assessments. Paired with modern machine learning, we find these assessments to be surprisingly powerful and often on-par with mobile approaches. Lastly, we highlight the possibility to discover novel digital biomarkers based on this large-scale approach