342 research outputs found
Incremental Bootstrapping of Parameterized Motor Skills
Queißer J, Reinhart F, Steil JJ. Incremental Bootstrapping of Parameterized Motor Skills. In: Proc. IEEE Humanoids. IEEE; 2016.Many motor skills have an intrinsic, low-dimensional parameterization,
e.g. reaching through a grid to different targets. Repeated policy search
for new parameterizations of such a skill is inefficient, because the structure
of the skill variability is not exploited.
This issue has been previously addressed by learning mappings from task
parameters to policy parameters. In this work, we introduce a bootstrapping
technique that establishes such parameterized skills incrementally.
The approach combines iterative learning with state-of-the-art
black-box policy optimization. We investigate the benefits of
incrementally learning parameterized skills for efficient policy
retrieval and show that the number of required rollouts can be
significantly reduced when optimizing policies for novel tasks.
The approach is demonstrated for several parameterized motor
tasks including upper-body reaching motion generation for the
humanoid robot COMAN
Intrinsically Motivated Goal Exploration Processes with Automatic Curriculum Learning
Intrinsically motivated spontaneous exploration is a key enabler of
autonomous lifelong learning in human children. It enables the discovery and
acquisition of large repertoires of skills through self-generation,
self-selection, self-ordering and self-experimentation of learning goals. We
present an algorithmic approach called Intrinsically Motivated Goal Exploration
Processes (IMGEP) to enable similar properties of autonomous or self-supervised
learning in machines. The IMGEP algorithmic architecture relies on several
principles: 1) self-generation of goals, generalized as fitness functions; 2)
selection of goals based on intrinsic rewards; 3) exploration with incremental
goal-parameterized policy search and exploitation of the gathered data with a
batch learning algorithm; 4) systematic reuse of information acquired when
targeting a goal for improving towards other goals. We present a particularly
efficient form of IMGEP, called Modular Population-Based IMGEP, that uses a
population-based policy and an object-centered modularity in goals and
mutations. We provide several implementations of this architecture and
demonstrate their ability to automatically generate a learning curriculum
within several experimental setups including a real humanoid robot that can
explore multiple spaces of goals with several hundred continuous dimensions.
While no particular target goal is provided to the system, this curriculum
allows the discovery of skills that act as stepping stone for learning more
complex skills, e.g. nested tool use. We show that learning diverse spaces of
goals with intrinsic motivations is more efficient for learning complex skills
than only trying to directly learn these complex skills
Ongoing Emergence: A Core Concept in Epigenetic Robotics
We propose ongoing emergence as a core concept in
epigenetic robotics. Ongoing emergence refers to the
continuous development and integration of new skills
and is exhibited when six criteria are satisfied: (1)
continuous skill acquisition, (2) incorporation of new
skills with existing skills, (3) autonomous development
of values and goals, (4) bootstrapping of initial skills, (5)
stability of skills, and (6) reproducibility. In this paper
we: (a) provide a conceptual synthesis of ongoing
emergence based on previous theorizing, (b) review
current research in epigenetic robotics in light of ongoing
emergence, (c) provide prototypical examples of ongoing
emergence from infant development, and (d) outline
computational issues relevant to creating robots
exhibiting ongoing emergence
Bootstrapping of parameterized skills through hybrid optimization in task and policy spaces
Queißer J, Steil JJ. Bootstrapping of parameterized skills through hybrid optimization in task and policy spaces. Frontiers in Robotics and AI. 2018;5:49.Modern robotic applications create high demands on adaptation of actions with respect to
variance in a given task. Reinforcement learning is able to optimize for these changing conditions,
but relearning from scratch is hardly feasible due to the high number of required rollouts. We
propose a parameterized skill that generalizes to new actions for changing task parameters,
which is encoded as a meta-learner that provides parameters for task-specific dynamic motion
primitives. Our work shows that utilizing parameterized skills for initialization of the optimization
process leads to a more effective incremental task learning. In addition, we introduce a hybrid
optimization method that combines a fast coarse optimization on a manifold of policy parameters
with a fine grained parameter search in the unrestricted space of actions. The proposed algorithm
reduces the number of required rollouts for adaptation to new task conditions. Application in
illustrative toy scenarios, for a 10-DOF planar arm, and a humanoid robot point reaching task
validate the approach
Multi-modal Skill Memories for Online Learning of Interactive Robot Movement Generation
Queißer J. Multi-modal Skill Memories for Online Learning of Interactive Robot Movement Generation. Bielefeld: Universität Bielefeld; 2018.Modern robotic applications pose complex requirements with respect to the adaptation of
actions regarding the variability in a given task. Reinforcement learning can optimize for
changing conditions, but relearning from scratch is hardly feasible due to the high number of
required rollouts. This work proposes a parameterized skill that generalizes to new actions
for changing task parameters. The actions are encoded by a meta-learner that provides
parameters for task-specific dynamic motion primitives. Experimental evaluation shows that
the utilization of parameterized skills for initialization of the optimization process leads to a
more effective incremental task learning. A proposed hybrid optimization method combines
a fast coarse optimization on a manifold of policy parameters with a fine-grained parameter
search in the unrestricted space of actions. It is shown that the developed algorithm reduces
the number of required rollouts for adaptation to new task conditions. Further, this work
presents a transfer learning approach for adaptation of learned skills to new situations.
Application in illustrative toy scenarios, for a 10-DOF planar arm, a humanoid robot point
reaching task and parameterized drumming on a pneumatic robot validate the approach.
But parameterized skills that are applied on complex robotic systems pose further
challenges: the dynamics of the robot and the interaction with the environment introduce
model inaccuracies. In particular, high-level skill acquisition on highly compliant robotic
systems such as pneumatically driven or soft actuators is hardly feasible. Since learning of
the complete dynamics model is not feasible due to the high complexity, this thesis examines
two alternative approaches: First, an improvement of the low-level control based on an
equilibrium model of the robot. Utilization of an equilibrium model reduces the learning
complexity and this thesis evaluates its applicability for control of pneumatic and industrial
light-weight robots. Second, an extension of parameterized skills to generalize for forward
signals of action primitives that result in an enhanced control quality of complex robotic
systems. This thesis argues for a shift in the complexity of learning the full dynamics of the
robot to a lower dimensional task-related learning problem. Due to the generalization in
relation to the task variability, online learning for complex robots as well as complex scenarios
becomes feasible. An experimental evaluation investigates the generalization capabilities of
the proposed online learning system for robot motion generation. Evaluation is performed
through simulation of a compliant 2-DOF arm and scalability to a complex robotic system
is demonstrated for a pneumatically driven humanoid robot with 8-DOF
Emerging Linguistic Functions in Early Infancy
This paper presents results from experimental
studies on early language acquisition in infants and
attempts to interpret the experimental results within
the framework of the Ecological Theory of
Language Acquisition (ETLA) recently proposed
by (Lacerda et al., 2004a). From this perspective,
the infant’s first steps in the acquisition of the
ambient language are seen as a consequence of the
infant’s general capacity to represent sensory input
and the infant’s interaction with other actors in its
immediate ecological environment. On the basis of
available experimental evidence, it will be argued
that ETLA offers a productive alternative to
traditional descriptive views of the language
acquisition process by presenting an operative
model of how early linguistic function may emerge
through interaction
Skill Memories for Parameterized Dynamic Action Primitives on the Pneumatically Driven Humanoid Robot Child Affetto
Queißer J, Ishihara H, Hammer B, Steil JJ, Asada M. Skill Memories for Parameterized Dynamic Action Primitives on the Pneumatically Driven Humanoid Robot Child Affetto. Presented at the International Conference on Development and Learning and on Epigenetic Robotics 2018 (ICDL-EPIROB2018), Tokyo
A psychology based approach for longitudinal development in cognitive robotics.
A major challenge in robotics is the ability to learn, from novel experiences, new behavior that is useful for achieving new goals and skills. Autonomous systems must be able to learn solely through the environment, thus ruling out a priori task knowledge, tuning, extensive training, or other forms of pre-programming. Learning must also be cumulative and incremental, as complex skills are built on top of primitive skills. Additionally, it must be driven by intrinsic motivation because formative experience is gained through autonomous activity, even in the absence of extrinsic goals or tasks. This paper presents an approach to these issues through robotic implementations inspired by the learning behavior of human infants. We describe an approach to developmental learning and present results from a demonstration of longitudinal development on an iCub humanoid robot. The results cover the rapid emergence of staged behavior, the role of constraints in development, the effect of bootstrapping between stages, and the use of a schema memory of experiential fragments in learning new skills. The context is a longitudinal experiment in which the robot advanced from uncontrolled motor babbling to skilled hand/eye integrated reaching and basic manipulation of objects. This approach offers promise for further fast and effective sensory-motor learning techniques for robotic learning
Developmental Bootstrapping of AIs
Although some current AIs surpass human abilities in closed artificial worlds
such as board games, their abilities in the real world are limited. They make
strange mistakes and do not notice them. They cannot be instructed easily, fail
to use common sense, and lack curiosity. They do not make good collaborators.
Mainstream approaches for creating AIs are the traditional manually-constructed
symbolic AI approach and generative and deep learning AI approaches including
large language models (LLMs). These systems are not well suited for creating
robust and trustworthy AIs. Although it is outside of the mainstream, the
developmental bootstrapping approach has more potential. In developmental
bootstrapping, AIs develop competences like human children do. They start with
innate competences. They interact with the environment and learn from their
interactions. They incrementally extend their innate competences with
self-developed competences. They interact and learn from people and establish
perceptual, cognitive, and common grounding. They acquire the competences they
need through bootstrapping. However, developmental robotics has not yet
produced AIs with robust adult-level competences. Projects have typically
stopped at the Toddler Barrier corresponding to human infant development at
about two years of age, before their speech is fluent. They also do not bridge
the Reading Barrier, to skillfully and skeptically draw on the socially
developed information resources that power current LLMs. The next competences
in human cognitive development involve intrinsic motivation, imitation
learning, imagination, coordination, and communication. This position paper
lays out the logic, prospects, gaps, and challenges for extending the practice
of developmental bootstrapping to acquire further competences and create
robust, resilient, and human-compatible AIs.Comment: 102 pages, 29 figure
- …