1,588 research outputs found

    Structure Learning in Motor Control:A Deep Reinforcement Learning Model

    Full text link
    Motor adaptation displays a structure-learning effect: adaptation to a new perturbation occurs more quickly when the subject has prior exposure to perturbations with related structure. Although this `learning-to-learn' effect is well documented, its underlying computational mechanisms are poorly understood. We present a new model of motor structure learning, approaching it from the point of view of deep reinforcement learning. Previous work outside of motor control has shown how recurrent neural networks can account for learning-to-learn effects. We leverage this insight to address motor learning, by importing it into the setting of model-based reinforcement learning. We apply the resulting processing architecture to empirical findings from a landmark study of structure learning in target-directed reaching (Braun et al., 2009), and discuss its implications for a wider range of learning-to-learn phenomena.Comment: 39th Annual Meeting of the Cognitive Science Society, to appea

    The relationship of myocardial contraction and electrical excitation—the correlation between scintigraphic phase image analysis and electrophysiologic mapping

    Get PDF
    Phase imaging derived from equilibrium radionuclide angiography presents the ventricular contraction sequence. It has been widely but only indirectly correlated with the sequence of electrical myocardial activation. We sought to determine the specific relationship between the sequence of phase progression and the sequence of myocardial activation, contraction and conduction, in order to document a noninvasive method that could monitor both. In 7 normal and 9 infarcted dogs, the sequence of phase angle was correlated with the epicardial activation map in 126 episodes of sinus rhythm and pacing from three ventricular sites. In each episode, the site of earliest phase angle was identical to the focus of initial epicardial activation. Similarly, the serial contraction pattern by phase image analysis matched the electrical epicardial activation sequence completely or demonstrated good agreement in approximately 85% of pacing episodes, without differences between normal or infarct groups. A noninvasive method to accurately determine the sequence of contraction may serve as a surrogate for the associated electrical activation sequence or be applied to identify their differences

    A Unified Theory of Dual-Process Control

    Full text link
    Dual-process theories play a central role in both psychology and neuroscience, figuring prominently in fields ranging from executive control to reward-based learning to judgment and decision making. In each of these domains, two mechanisms appear to operate concurrently, one relatively high in computational complexity, the other relatively simple. Why is neural information processing organized in this way? We propose an answer to this question based on the notion of compression. The key insight is that dual-process structure can enhance adaptive behavior by allowing an agent to minimize the description length of its own behavior. We apply a single model based on this observation to findings from research on executive control, reward-based learning, and judgment and decision making, showing that seemingly diverse dual-process phenomena can be understood as domain-specific consequences of a single underlying set of computational principles

    A deep active inference model of the rubber-hand illusion

    Full text link
    Understanding how perception and action deal with sensorimotor conflicts, such as the rubber-hand illusion (RHI), is essential to understand how the body adapts to uncertain situations. Recent results in humans have shown that the RHI not only produces a change in the perceived arm location, but also causes involuntary forces. Here, we describe a deep active inference agent in a virtual environment, which we subjected to the RHI, that is able to account for these results. We show that our model, which deals with visual high-dimensional inputs, produces similar perceptual and force patterns to those found in humans.Comment: 8 pages, 3 figures, Accepted in 1st International Workshop on Active Inference, in Conjunction with European Conference of Machine Learning 2020. The final authenticated publication is available online at https://doi.org/10.1007/978-3-030-64919-7_1

    Minimum Description Length Control

    Full text link
    We propose a novel framework for multitask reinforcement learning based on the minimum description length (MDL) principle. In this approach, which we term MDL-control (MDL-C), the agent learns the common structure among the tasks with which it is faced and then distills it into a simpler representation which facilitates faster convergence and generalization to new tasks. In doing so, MDL-C naturally balances adaptation to each task with epistemic uncertainty about the task distribution. We motivate MDL-C via formal connections between the MDL principle and Bayesian inference, derive theoretical performance guarantees, and demonstrate MDL-C's empirical effectiveness on both discrete and high-dimensional continuous control tasks

    Subgoal- and goal-related reward prediction errors in medial prefrontal cortex

    Get PDF
    A longstanding view of the organization of human and animal behavior holds that behavior is hierarchically organizedin other words, directed toward achieving superordinate goals through the achievement of subordinate goals or subgoals. However, most research in neuroscience has focused on tasks without hierarchical structure. In past work, we have shown that negative reward prediction error (RPE) signals in medial prefrontal cortex (mPFC) can be linked not only to superordinate goals but also to subgoals. This suggests that mPFC tracks impediments in the progression toward subgoals. Using fMRI of human participants engaged in a hierarchical navigation task, here we found that mPFC also processes positive prediction errors at the level of subgoals, indicating that this brain region is sensitive to advances in subgoal completion. However, when subgoal RPEs were elicited alongside with goal-related RPEs, mPFC responses reflected only the goal-related RPEs. These findings suggest that information from different levels of hierarchy is processed selectively, depending on the task context
    • …
    corecore