3,130 research outputs found

    Online Learning of a Memory for Learning Rates

    Full text link
    The promise of learning to learn for robotics rests on the hope that by extracting some information about the learning process itself we can speed up subsequent similar learning tasks. Here, we introduce a computationally efficient online meta-learning algorithm that builds and optimizes a memory model of the optimal learning rate landscape from previously observed gradient behaviors. While performing task specific optimization, this memory of learning rates predicts how to scale currently observed gradients. After applying the gradient scaling our meta-learner updates its internal memory based on the observed effect its prediction had. Our meta-learner can be combined with any gradient-based optimizer, learns on the fly and can be transferred to new optimization tasks. In our evaluations we show that our meta-learning algorithm speeds up learning of MNIST classification and a variety of learning control tasks, either in batch or online learning settings.Comment: accepted to ICRA 2018, code available: https://github.com/fmeier/online-meta-learning ; video pitch available: https://youtu.be/9PzQ25FPPO

    Pedagogical content knowledge in students majoring in physical education vs. sport science: the same but different?

    Get PDF
    Pedagogical content knowledge (PCK) is a special feature providing the teacher with knowledge to transform the content in ways that make it understandable to learners. This is of special importance in physical education (PE), since it is significantly different to other school subjects in many ways i.e., it is the only subject whereby physical activity (PA) is a primary means of accomplishing educational objectives. Given the importance of PCK, it is of special interest to explore the specificity of PCK in in the field of sport science. As research on PCK in German speaking countries is still at the beginning, a cross-sectional study was conducted among 622 students to explore potential differences in relation to education programmes (PE Teacher Education n = 431, sport science n = 191). Measurement invariance (MI) between the groups was carried out using multigroup confirmatory factor analysis models to ensure latent mean scores can be compared meaningfully. The progressive evaluation of MI confirms that it is possible to measure the PCK (scalar) equivalently across PETE and sport science students, along with additional variables relevant to PCK. PETE students outperformed sport science students in terms of the “instruction” subdimension (also in different stages of study), whereas not in the “student” subdimension. Prior experience in the field of PA is an advantage for high scores only in the “instruction” subdimension. Finally, the study provides first insights into the specificity of PCK in the field of sport science

    An investigation of the pedagogical content knowledge across German preservice (physical education) teachers

    Get PDF
    Pedagogical content knowledge (PCK) is of critical importance to Physical Education (PE), since teaching PE is fundamentally distinct from teaching other subjects in many significant ways. Despite the importance of PCK, research on PCK in German speaking countries is still at the beginning. Against this backdrop, the current study explores the extent to which PCK is a specific professional feature across German students aiming for a teaching degree in PE or not. A cross-sectional study was conducted among 762 students to explore potential differences in relation to teacher education (TE) programs (PETE students n = 431, TE students n = 331). Measurement invariance (MI) between the groups was carried out using multigroup confirmatory factor analysis models to ensure latent mean scores can be compared meaningfully. The progressive evaluation of MI confirms that it is possible to measure the PCK (scalar) equivalently across PETE and TE students. PETE students outperformed TE students in both PCK subdimensions, also in different stages of the study. The study provides evidence for the “professional knowledge” and “qualification hypothesis” within PETE programs

    Learning Sensor Feedback Models from Demonstrations via Phase-Modulated Neural Networks

    Full text link
    In order to robustly execute a task under environmental uncertainty, a robot needs to be able to reactively adapt to changes arising in its environment. The environment changes are usually reflected in deviation from expected sensory traces. These deviations in sensory traces can be used to drive the motion adaptation, and for this purpose, a feedback model is required. The feedback model maps the deviations in sensory traces to the motion plan adaptation. In this paper, we develop a general data-driven framework for learning a feedback model from demonstrations. We utilize a variant of a radial basis function network structure --with movement phases as kernel centers-- which can generally be applied to represent any feedback models for movement primitives. To demonstrate the effectiveness of our framework, we test it on the task of scraping on a tilt board. In this task, we are learning a reactive policy in the form of orientation adaptation, based on deviations of tactile sensor traces. As a proof of concept of our method, we provide evaluations on an anthropomorphic robot. A video demonstrating our approach and its results can be seen in https://youtu.be/7Dx5imy1KcwComment: 8 pages, accepted to be published at the International Conference on Robotics and Automation (ICRA) 201

    A New Data Source for Inverse Dynamics Learning

    Full text link
    Modern robotics is gravitating toward increasingly collaborative human robot interaction. Tools such as acceleration policies can naturally support the realization of reactive, adaptive, and compliant robots. These tools require us to model the system dynamics accurately -- a difficult task. The fundamental problem remains that simulation and reality diverge--we do not know how to accurately change a robot's state. Thus, recent research on improving inverse dynamics models has been focused on making use of machine learning techniques. Traditional learning techniques train on the actual realized accelerations, instead of the policy's desired accelerations, which is an indirect data source. Here we show how an additional training signal -- measured at the desired accelerations -- can be derived from a feedback control signal. This effectively creates a second data source for learning inverse dynamics models. Furthermore, we show how both the traditional and this new data source, can be used to train task-specific models of the inverse dynamics, when used independently or combined. We analyze the use of both data sources in simulation and demonstrate its effectiveness on a real-world robotic platform. We show that our system incrementally improves the learned inverse dynamics model, and when using both data sources combined converges more consistently and faster.Comment: IROS 201

    Learning Feedback Terms for Reactive Planning and Control

    Full text link
    With the advancement of robotics, machine learning, and machine perception, increasingly more robots will enter human environments to assist with daily tasks. However, dynamically-changing human environments requires reactive motion plans. Reactivity can be accomplished through replanning, e.g. model-predictive control, or through a reactive feedback policy that modifies on-going behavior in response to sensory events. In this paper, we investigate how to use machine learning to add reactivity to a previously learned nominal skilled behavior. We approach this by learning a reactive modification term for movement plans represented by nonlinear differential equations. In particular, we use dynamic movement primitives (DMPs) to represent a skill and a neural network to learn a reactive policy from human demonstrations. We use the well explored domain of obstacle avoidance for robot manipulation as a test bed. Our approach demonstrates how a neural network can be combined with physical insights to ensure robust behavior across different obstacle settings and movement durations. Evaluations on an anthropomorphic robotic system demonstrate the effectiveness of our work.Comment: 8 pages, accepted to be published at ICRA 2017 conferenc
    • …
    corecore