6 research outputs found

    Joint Goal and Strategy Inference across Heterogeneous Demonstrators via Reward Network Distillation

    Full text link
    Reinforcement learning (RL) has achieved tremendous success as a general framework for learning how to make decisions. However, this success relies on the interactive hand-tuning of a reward function by RL experts. On the other hand, inverse reinforcement learning (IRL) seeks to learn a reward function from readily-obtained human demonstrations. Yet, IRL suffers from two major limitations: 1) reward ambiguity - there are an infinite number of possible reward functions that could explain an expert's demonstration and 2) heterogeneity - human experts adopt varying strategies and preferences, which makes learning from multiple demonstrators difficult due to the common assumption that demonstrators seeks to maximize the same reward. In this work, we propose a method to jointly infer a task goal and humans' strategic preferences via network distillation. This approach enables us to distill a robust task reward (addressing reward ambiguity) and to model each strategy's objective (handling heterogeneity). We demonstrate our algorithm can better recover task reward and strategy rewards and imitate the strategies in two simulated tasks and a real-world table tennis task.Comment: In Proceedings of the 2020 ACM/IEEE In-ternational Conference on Human-Robot Interaction (HRI '20), March 23 to 26, 2020, Cambridge, United Kingdom.ACM, New York, NY, USA, 10 page

    Stress and worry in the 2020 coronavirus pandemic: Relationships to trust and compliance with preventive measures across 48 countries in the COVIDiSTRESS global survey

    Get PDF
    The COVIDiSTRESS global survey collects data on early human responses to the 2020 COVID-19 pandemic from 173 429 respondents in 48 countries. The open science study was co-designed by an international consortium of researchers to investigate how psychological responses differ across countries and cultures, and how this has impacted behaviour, coping and trust in government efforts to slow the spread of the virus. Starting in March 2020, COVIDiSTRESS leveraged the convenience of unpaid online recruitment to generate public data. The objective of the present analysis is to understand relationships between psychological responses in the early months of global coronavirus restrictions and help understand how different government measures succeed or fail in changing public behaviour. There were variations between and within countries. Although Western Europeans registered as more concerned over COVID-19, more stressed, and having slightly more trust in the governments' efforts, there was no clear geographical pattern in compliance with behavioural measures. Detailed plots illustrating between-countries differences are provided. Using both traditional and Bayesian analyses, we found that individuals who worried about getting sick worked harder to protect themselves and others. However, concern about the coronavirus itself did not account for all of the variances in experienced stress during the early months of COVID-19 restrictions. More alarmingly, such stress was associated with less compliance. Further, those most concerned over the coronavirus trusted in government measures primarily where policies were strict. While concern over a disease is a source of mental distress, other factors including strictness of protective measures, social support and personal lockdown conditions must also be taken into consideration to fully appreciate the psychological impact of COVID-19 and to understand why some people fail to follow behavioural guidelines intended to protect themselves and others from infection. The Stage 1 manuscript associated with this submission received in-principle acceptance (IPA) on 18 May 2020. Following IPA, the accepted Stage 1 version of the manuscript was preregistered on the Open Science Framework at https://osf.io/g2t3b. This preregistration was performed prior to data analysis

    Stress and worry in the 2020 coronavirus pandemic : relationships to trust and compliance with preventive measures across 48 countries in the COVIDiSTRESS global survey

    Get PDF
    The COVIDiSTRESS global survey collects data on early human responses to the 2020 COVID-19 pandemic from 173 429 respondents in 48 countries. The open science study was co-designed by an international consortium of researchers to investigate how psychological responses differ across countries and cultures, and how this has impacted behaviour, coping and trust in government efforts to slow the spread of the virus. Starting in March 2020, COVIDiSTRESS leveraged the convenience of unpaid online recruitment to generate public data. The objective of the present analysis is to understand relationships between psychological responses in the early months of global coronavirus restrictions and help understand how different government measures succeed or fail in changing public behaviour. There were variations between and within countries. Although Western Europeans registered as more concerned over COVID-19, more stressed, and having slightly more trust in the governments' efforts, there was no clear geographical pattern in compliance with behavioural measures. Detailed plots illustrating between-countries differences are provided. Using both traditional and Bayesian analyses, we found that individuals who worried about getting sick worked harder to protect themselves and others. However, concern about the coronavirus itself did not account for all of the variances in experienced stress during the early months of COVID-19 restrictions. More alarmingly, such stress was associated with less compliance. Further, those most concerned over the coronavirus trusted in government measures primarily where policies were strict. While concern over a disease is a source of mental distress, other factors including strictness of protective measures, social support and personal lockdown conditions must also be taken into consideration to fully appreciate the psychological impact of COVID-19 and to understand why some people fail to follow behavioural guidelines intended to protect themselves and others from infection. The Stage 1 manuscript associated with this submission received in-principle acceptance (IPA) on 18 May 2020. Following IPA, the accepted Stage 1 version of the manuscript was preregistered on the Open Science Framework at https://osf.io/g2t3b. This preregistration was performed prior to data analysis.Peer reviewe
    corecore