4 research outputs found
Context-Aware Composition of Agent Policies by Markov Decision Process Entity Embeddings and Agent Ensembles
Computational agents support humans in many areas of life and are therefore
found in heterogeneous contexts. This means they operate in rapidly changing
environments and can be confronted with huge state and action spaces. In order
to perform services and carry out activities in a goal-oriented manner, agents
require prior knowledge and therefore have to develop and pursue
context-dependent policies. However, prescribing policies in advance is limited
and inflexible, especially in dynamically changing environments. Moreover, the
context of an agent determines its choice of actions. Since the environments
can be stochastic and complex in terms of the number of states and feasible
actions, activities are usually modelled in a simplified way by Markov decision
processes so that, e.g., agents with reinforcement learning are able to learn
policies, that help to capture the context and act accordingly to optimally
perform activities. However, training policies for all possible contexts using
reinforcement learning is time-consuming. A requirement and challenge for
agents is to learn strategies quickly and respond immediately in cross-context
environments and applications, e.g., the Internet, service robotics,
cyber-physical systems. In this work, we propose a novel simulation-based
approach that enables a) the representation of heterogeneous contexts through
knowledge graphs and entity embeddings and b) the context-aware composition of
policies on demand by ensembles of agents running in parallel. The evaluation
we conducted with the "Virtual Home" dataset indicates that agents with a need
to switch seamlessly between different contexts, can request on-demand composed
policies that lead to the successful completion of context-appropriate
activities without having to learn these policies in lengthy training steps and
episodes, in contrast to agents that use reinforcement learning.Comment: 30 pages, 11 figures, 9 tables, 3 listings, Re-submitted to Semantic
Web Journal, Currently, under revie
Uncertainty and social considerations for mobile assistive robot navigation
An increased interest in mobile robots has been seen over the past years.
The wide range of possible applications, from vacuum cleaners to assistant robots, makes such robots an interesting solution to many everyday problems.
A key requirement for the mass deployment of such robots is to ensure they can safely navigate around our daily living environments.
A robot colliding with or bumping into a person may be, in some contexts, unacceptable.
For example, if a robot working around elderly people collides with one of them, it may cause serious injuries.
This thesis explores four major components required for effective robot navigation: sensing the static environment, detection and tracking of moving people, obstacle and people avoidance with uncertainty measurement, and basic social navigation considerations.
First, to guarantee adherence to basic safety constraints, sensors and algorithms required to measure the complex structure of our daily living environments are explored.
Not only do the static components of the environment have to be measured, but so do any people present.
A people detection and tracking algorithm, aimed for a crowded environment is proposed, thus enhancing the robot's perception capabilities.
Our daily living environments present many inherent sources of uncertainty for robots, one of them arising due to the robot's inability to know people's intentions as they move.
To solve this problem, a motion model that assumes unknown long-term intentions is proposed.
This is used in conjunction with a novel uncertainty aware local planner to create feasible trajectories.
In social situations, the presence of groups of people cannot be neglected when navigating.
To avoid the robot interrupting groups of people, it first needs to be able to detect such groups.
A group detector is proposed which relies on a set of gaze- and geometric-based features.
Avoiding group disruption is finally incorporated into the navigation algorithm by means of taking into account the probability of disrupting a group's activities.
The effectiveness of the four different components is evaluated using real world and simulated data, demonstrating the benefits for mobile robot navigation.Open Acces