41 research outputs found
Uncertainty-driven Affordance Discovery for Efficient Robotics Manipulation
Robotics affordances, providing information about what actions can be taken
in a given situation, can aid robotics manipulation. However, learning about
affordances requires expensive large annotated datasets of interactions or
demonstrations. In this work, we show active learning can mitigate this problem
and propose the use of uncertainty to drive an interactive affordance discovery
process. We show that our method enables the efficient discovery of visual
affordances for several action primitives, such as grasping, stacking objects,
or opening drawers, strongly improving data efficiency and allowing us to learn
grasping affordances on a real-world setup with an xArm 6 robot arm in a small
number of trials.Comment: Presented at the GMPL workshop @ RSS 202
FOCUS: Object-Centric World Models for Robotics Manipulation
Understanding the world in terms of objects and the possible interplays with
them is an important cognition ability, especially in robotics manipulation,
where many tasks require robot-object interactions. However, learning such a
structured world model, which specifically captures entities and relationships,
remains a challenging and underexplored problem. To address this, we propose
FOCUS, a model-based agent that learns an object-centric world model. Thanks to
a novel exploration bonus that stems from the object-centric representation,
FOCUS can be deployed on robotics manipulation tasks to explore object
interactions more easily. Evaluating our approach on manipulation tasks across
different settings, we show that object-centric world models allow the agent to
solve tasks more efficiently and enable consistent exploration of robot-object
interactions. Using a Franka Emika robot arm, we also showcase how FOCUS could
be adopted in real-world settings
Choreographer: Learning and Adapting Skills in Imagination
Unsupervised skill learning aims to learn a rich repertoire of behaviors
without external supervision, providing artificial agents with the ability to
control and influence the environment. However, without appropriate knowledge
and exploration, skills may provide control only over a restricted area of the
environment, limiting their applicability. Furthermore, it is unclear how to
leverage the learned skill behaviors for adapting to downstream tasks in a
data-efficient manner. We present Choreographer, a model-based agent that
exploits its world model to learn and adapt skills in imagination. Our method
decouples the exploration and skill learning processes, being able to discover
skills in the latent state space of the model. During adaptation, the agent
uses a meta-controller to evaluate and adapt the learned skills efficiently by
deploying them in parallel in imagination. Choreographer is able to learn
skills both from offline data, and by collecting data simultaneously with an
exploration policy. The skills can be used to effectively adapt to downstream
tasks, as we show in the URL benchmark, where we outperform previous approaches
from both pixels and states inputs. The learned skills also explore the
environment thoroughly, finding sparse rewards more frequently, as shown in
goal-reaching tasks from the DMC Suite and Meta-World. Project website:
https://skillchoreographer.github.io
Learning to Navigate from Scratch using World Models and Curiosity: the Good, the Bad, and the Ugly
Learning to navigate unknown environments from scratch is a challenging
problem. This work presents a system that integrates world models with
curiosity-driven exploration for autonomous navigation in new environments. We
evaluate performance through simulations and real-world experiments of varying
scales and complexities. In simulated environments, the approach rapidly and
comprehensively explores the surroundings. Real-world scenarios introduce
additional challenges. Despite demonstrating promise in a small controlled
environment, we acknowledge that larger and dynamic environments can pose
challenges for the current system. Our analysis emphasizes the significance of
developing adaptable and robust world models that can handle environmental
changes to prevent repetitive exploration of the same areas.Comment: IROS 2023 workshop World Models and Predictive Coding in Cognitive
Robotics and IROS 2023 workshop Learning Robot Super Autonom
Disentangling Shape and Pose for Object-Centric Deep Active Inference Models
Active inference is a first principles approach for understanding the brain
in particular, and sentient agents in general, with the single imperative of
minimizing free energy. As such, it provides a computational account for
modelling artificial intelligent agents, by defining the agent's generative
model and inferring the model parameters, actions and hidden state beliefs.
However, the exact specification of the generative model and the hidden state
space structure is left to the experimenter, whose design choices influence the
resulting behaviour of the agent. Recently, deep learning methods have been
proposed to learn a hidden state space structure purely from data, alleviating
the experimenter from this tedious design task, but resulting in an entangled,
non-interpreteable state space. In this paper, we hypothesize that such a
learnt, entangled state space does not necessarily yield the best model in
terms of free energy, and that enforcing different factors in the state space
can yield a lower model complexity. In particular, we consider the problem of
3D object representation, and focus on different instances of the ShapeNet
dataset. We propose a model that factorizes object shape, pose and category,
while still learning a representation for each factor using a deep neural
network. We show that models, with best disentanglement properties, perform
best when adopted by an active agent in reaching preferred observations
A learning gap between neuroscience and reinforcement learning
Historically, artificial intelligence has drawn much inspiration from
neuroscience to fuel advances in the field. However, current progress in
reinforcement learning is largely focused on benchmark problems that fail to
capture many of the aspects that are of interest in neuroscience today. We
illustrate this point by extending a T-maze task from neuroscience for use with
reinforcement learning algorithms, and show that state-of-the-art algorithms
are not capable of solving this problem. Finally, we point out where insights
from neuroscience could help explain some of the issues encountered
Antidepressants Drug Use during COVID-19 Waves in the Tuscan General Population: An Interrupted Time-Series Analysis
In Italy, during the COVID-19 waves two lockdowns were implemented to prevent virus diffusion in the general population. Data on antidepressant (AD) use in these periods are still scarce. This study aimed at exploring the impact of COVID-19 lockdowns on prevalence and incidence of antidepressant drug use in the general population. A population-based study using the healthcare administrative database of Tuscany was performed. We selected a dynamic cohort of subjects with at least one ADs dispensing from 1 January 2018 to 27 December 2020. The weekly prevalence and incidence of drug use were estimated across different segments: pre-lockdown (1 January 2018-8 March 2020), first lockdown (9 March 2020-15 June 2020), post-first lockdown (16 June 2020-15 November 2020) and second lockdown (16 November 2020-27 December 2020). An interrupted time-series analysis was used to assess the effect of lockdowns on the observed outcomes. Compared to the pre-lockdown we observed an abrupt reduction of ADs incidence (Incidence-Ratio: 0.82; 95% Confidence-Intervals: 0.74-0.91) and a slight weekly decrease of prevalence (Prevalence-Ratio: 0.997; 0.996-0.999). During the post-first lockdown AD use increased, with higher incidence- and similar prevalence values compared with those expected in the absence of the outbreak. This pandemic has impacted AD drug use in the general population with potential rebound effects during the period between waves. This calls for future studies aimed at exploring the mid-long term effects of this phenomenon
Chronic disease prevalence from Italian administrative databases in the VALORE project: a validation through comparison of population estimates with general practice databases and national survey
BACKGROUND:
Administrative databases are widely available and have been extensively used to provide estimates of chronic disease prevalence for the purpose of surveillance of both geographical and temporal trends. There are, however, other sources of data available, such as medical records from primary care and national surveys. In this paper we compare disease prevalence estimates obtained from these three different data sources.
METHODS:
Data from general practitioners (GP) and administrative transactions for health services were collected from five Italian regions (Veneto, Emilia Romagna, Tuscany, Marche and Sicily) belonging to all the three macroareas of the country (North, Center, South). Crude prevalence estimates were calculated by data source and region for diabetes, ischaemic heart disease, heart failure and chronic obstructive pulmonary disease (COPD). For diabetes and COPD, prevalence estimates were also obtained from a national health survey. When necessary, estimates were adjusted for completeness of data ascertainment.
RESULTS:
Crude prevalence estimates of diabetes in administrative databases (range: from 4.8% to 7.1%) were lower than corresponding GP (6.2%-8.5%) and survey-based estimates (5.1%-7.5%). Geographical trends were similar in the three sources and estimates based on treatment were the same, while estimates adjusted for completeness of ascertainment (6.1%-8.8%) were slightly higher. For ischaemic heart disease administrative and GP data sources were fairly consistent, with prevalence ranging from 3.7% to 4.7% and from 3.3% to 4.9%, respectively. In the case of heart failure administrative estimates were consistently higher than GPs' estimates in all five regions, the highest difference being 1.4% vs 1.1%. For COPD the estimates from administrative data, ranging from 3.1% to 5.2%, fell into the confidence interval of the Survey estimates in four regions, but failed to detect the higher prevalence in the most Southern region (4.0% in administrative data vs 6.8% in survey data). The prevalence estimates for COPD from GP data were consistently higher than the corresponding estimates from the other two sources.
CONCLUSION:
This study supports the use of data from Italian administrative databases to estimate geographic differences in population prevalence of ischaemic heart disease, treated diabetes, diabetes mellitus and heart failure. The algorithm for COPD used in this study requires further refinement