Search CORE

30 research outputs found

Causal Confusion in Imitation Learning

Author: de Haan Pim
Jayaraman Dinesh
Levine Sergey
Publication venue
Publication date: 04/11/2019
Field of study

Behavioral cloning reduces policy learning to supervised learning by training a discriminative model to predict expert actions given observations. Such discriminative models are non-causal: the training procedure is unaware of the causal structure of the interaction between the expert and the environment. We point out that ignoring causality is particularly damaging because of the distributional shift in imitation learning. In particular, it leads to a counter-intuitive "causal misidentification" phenomenon: access to more information can yield worse performance. We investigate how this problem arises, and propose a solution to combat it through targeted interventions---either environment interaction or expert queries---to determine the correct causal model. We show that causal misidentification occurs in several benchmark control domains as well as realistic driving settings, and validate our solution against DAgger and other baselines and ablations.Comment: Published at NeurIPS 2019 9 pages, plus references and appendice

arXiv.org e-Print Archive

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Linkages among the Foreign Exchange, Stock, and Bond Markets in Japan and the United States

Author: Jiang Yi
Shimizu Shohei
Publication venue
Publication date: 04/10/2023
Field of study

While economic theory explains the linkages among the financial markets of different countries, empirical studies mainly verify the linkages through Granger causality, without considering latent variables or instantaneous effects. Their findings are inconsistent regarding the existence of causal linkages among financial markets, which might be attributed to differences in the focused markets, data periods, and methods applied. Our study adopts causal discovery methods including VAR-LiNGAM and LPCMCI with domain knowledge to explore the linkages among financial markets in Japan and the United States (US) for the post Covid-19 pandemic period under divergent monetary policy directions. The VAR-LiNGAM results reveal that the previous day's US market influences the following day's Japanese market for both stocks and bonds, and the bond markets of the previous day impact the following day's foreign exchange (FX) market directly and the following day's Japanese stock market indirectly. The LPCMCI results indicate the existence of potential latent confounders. Our results demonstrate that VAR-LiNGAM uniquely identifies the directed acyclic graph (DAG), and thus provides informative insight into the causal relationship when the assumptions are considered valid. Our study contributes to a better understanding of the linkages among financial markets in the analyzed data period by supporting the existence of linkages between Japan and the US for the same financial markets and among FX, stock, and bond markets, thus highlighting the importance of leveraging causal discovery methods in the financial domain.Comment: Causal Analysis Workshop Series (CAWS) 2023, 18 pages, 7 Figure

arXiv.org e-Print Archive

Causal Confusion in Imitation Learning

Author: de Haan P.
Jayaraman D.
Levine S.
Publication venue: Neural Information Processing Systems Foundation
Publication date: 01/01/2020
Field of study

International Migration, Integration and Social Cohesion online publications

Robustness of Algorithms for Causal Structure Learning to Hyperparameter Choice

Author: Clarke Paul
Machlanski Damian
Samothrakis Spyridon
Publication venue
Publication date: 27/10/2023
Field of study

Hyperparameters play a critical role in machine learning. Hyperparameter tuning can make the difference between state-of-the-art and poor prediction performance for any algorithm, but it is particularly challenging for structure learning due to its unsupervised nature. As a result, hyperparameter tuning is often neglected in favour of using the default values provided by a particular implementation of an algorithm. While there have been numerous studies on performance evaluation of causal discovery algorithms, how hyperparameters affect individual algorithms, as well as the choice of the best algorithm for a specific problem, has not been studied in depth before. This work addresses this gap by investigating the influence of hyperparameters on causal structure learning tasks. Specifically, we perform an empirical evaluation of hyperparameter selection for some seminal learning algorithms on datasets of varying levels of complexity. We find that, while the choice of algorithm remains crucial to obtaining state-of-the-art performance, hyperparameter selection in ensemble settings strongly influences the choice of algorithm, in that a poor choice of hyperparameters can lead to analysts using algorithms which do not give state-of-the-art performance for their data.Comment: 26 pages, 16 figure

arXiv.org e-Print Archive

A systems approach towards remote health-monitoring in older adults: Introducing a zero-interaction digital exhaust.

Author: Botros Angela
Buluschek Philipp
Gatica-Perez Daniel
Gerber Stephan M
Knobel Samuel E J
Mosimann Urs P
Müri René M
Nef Tobias
Pais Bruno
Saner Hugo
Santschi Valérie
Schütz Narayan
Single Michael
Urwyler Prabitha
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 16/08/2022
Field of study

Using connected sensing devices to remotely monitor health is a promising way to help transition healthcare from a rather reactive to a more precision medicine oriented proactive approach, which could be particularly relevant in the face of rapid population ageing and the challenges it poses to healthcare systems. Sensor derived digital measures of health, such as digital biomarkers or digital clinical outcome assessments, may be used to monitor health status or the risk of adverse events like falls. Current research around such digital measures has largely focused on exploring the use of few individual measures obtained through mobile devices. However, especially for long-term applications in older adults, this choice of technology may not be ideal and could further add to the digital divide. Moreover, large-scale systems biology approaches, like genomics, have already proven beneficial in precision medicine, making it plausible that the same could also hold for remote-health monitoring. In this context, we introduce and describe a zero-interaction digital exhaust: a set of 1268 digital measures that cover large parts of a person's activity, behavior and physiology. Making this approach more inclusive of older adults, we base this set entirely on contactless, zero-interaction sensing technologies. Applying the resulting digital exhaust to real-world data, we then demonstrate the possibility to create multiple ageing relevant digital clinical outcome assessments. Paired with modern machine learning, we find these assessments to be surprisingly powerful and often on-par with mobile approaches. Lastly, we highlight the possibility to discover novel digital biomarkers based on this large-scale approach

Hes-so: ArODES Open Archive (University of Applied Sciences and Arts Western Switzerland / Haute école spécialisée de Suisse occidentale / FH Westschweiz)

PubMed Central

Bern Open Repository and Information System (BORIS)