4,415 research outputs found

    Latent space policy search for robotics

    Get PDF
    Learning motor skills for robots is a hard task. In particular, a high number of degrees-of-freedom in the robot can pose serious challenges to existing reinforcement learning methods, since it leads to a highdimensional search space. However, complex robots are often intrinsically redundant systems and, therefore, can be controlled using a latent manifold of much smaller dimensionality. In this paper, we present a novel policy search method that performs efficient reinforcement learning by uncovering the low-dimensional latent space of actuator redundancies. In contrast to previous attempts at combining reinforcement learning and dimensionality reduction, our approach does not perform dimensionality reduction as a preprocessing step but naturally combines it with policy search. Our evaluations show that the new approach outperforms existing algorithms for learning motor skills with high-dimensional robots

    If cooperation is likely punish mildly: Insights from economic experiments based on the snowdrift game

    Get PDF
    Punishment may deter antisocial behavior. Yet to punish is costly, and the costs often do not offset the gains that are due to elevated levels of cooperation. However, the effectiveness of punishment depends not only on how costly it is, but also on the circumstances defining the social dilemma. Using the snowdrift game as the basis, we have conducted a series of economic experiments to determine whether severe punishment is more effective than mild punishment. We have observed that severe punishment is not necessarily more effective, even if the cost of punishment is identical in both cases. The benefits of severe punishment become evident only under extremely adverse conditions, when to cooperate is highly improbable in the absence of sanctions. If cooperation is likely, mild punishment is not less effective and leads to higher average payoffs, and is thus the much preferred alternative. Presented results suggest that the positive effects of punishment stem not only from imposed fines, but may also have a psychological background. Small fines can do wonders in motivating us to chose cooperation over defection, but without the paralyzing effect that may be brought about by large fines. The later should be utilized only when absolutely necessary.Comment: 15 pages, 6 figures; accepted for publication in PLoS ON

    Causal hierarchy within the thalamo-cortical network in spike and wave discharges

    Get PDF
    Background: Generalised spike wave (GSW) discharges are the electroencephalographic (EEG) hallmark of absence seizures, clinically characterised by a transitory interruption of ongoing activities and impaired consciousness, occurring during states of reduced awareness. Several theories have been proposed to explain the pathophysiology of GSW discharges and the role of thalamus and cortex as generators. In this work we extend the existing theories by hypothesizing a role for the precuneus, a brain region neglected in previous works on GSW generation but already known to be linked to consciousness and awareness. We analysed fMRI data using dynamic causal modelling (DCM) to investigate the effective connectivity between precuneus, thalamus and prefrontal cortex in patients with GSW discharges. Methodology and Principal Findings: We analysed fMRI data from seven patients affected by Idiopathic Generalized Epilepsy (IGE) with frequent GSW discharges and significant GSW-correlated haemodynamic signal changes in the thalamus, the prefrontal cortex and the precuneus. Using DCM we assessed their effective connectivity, i.e. which region drives another region. Three dynamic causal models were constructed: GSW was modelled as autonomous input to the thalamus (model A), ventromedial prefrontal cortex (model B), and precuneus (model C). Bayesian model comparison revealed Model C (GSW as autonomous input to precuneus), to be the best in 5 patients while model A prevailed in two cases. At the group level model C dominated and at the population-level the p value of model C was ∼1. Conclusion: Our results provide strong evidence that activity in the precuneus gates GSW discharges in the thalamo-(fronto) cortical network. This study is the first demonstration of a causal link between haemodynamic changes in the precuneus - an index of awareness - and the occurrence of pathological discharges in epilepsy. © 2009 Vaudano et al

    Mechanical properties of freely suspended atomically thin dielectric layers of mica

    Full text link
    We have studied the elastic deformation of freely suspended atomically thin sheets of muscovite mica, a widely used electrical insulator in its bulk form. Using an atomic force microscope, we carried out bending test experiments to determine the Young's modulus and the initial pre-tension of mica nanosheets with thicknesses ranging from 14 layers down to just one bilayer. We found that their Young's modulus is high (190 GPa), in agreement with the bulk value, which indicates that the exfoliation procedure employed to fabricate these nanolayers does not introduce a noticeable amount of defects. Additionally, ultrathin mica shows low pre-strain and can withstand reversible deformations up to tens of nanometers without breaking. The low pre-tension and high Young's modulus and breaking force found in these ultrathin mica layers demonstrates their prospective use as a complement for graphene in applications requiring flexible insulating materials or as reinforcement in nanocomposites.Comment: 9 pages, 5 figures, selected as cover of Nano Research, Volume 5, Number 8 (2012

    Search for flavour-changing neutral currents in processes with one top quark and a photon using 81 fb−1 of pp collisions at s=13TeV with the ATLAS experiment

    Get PDF
    A search for flavour-changing neutral current (FCNC) events via the coupling of a top quark, a photon, and an up or charm quark is presented using 81 fb−1 of proton–proton collision data taken at a centre-of-mass energy of 13 TeV with the ATLAS detector at the LHC. Events with a photon, an electron or muon, a b-tagged jet, and missing transverse momentum are selected. A neural network based on kinematic variables differentiates between events from signal and background processes. The data are consistent with the background-only hypothesis, and limits are set on the strength of the tqγ coupling in an effective field theory. These are also interpreted as 95% CL upper limits on the cross section for FCNC tγ production via a left-handed (right-handed) tuγ coupling of 36 fb (78 fb) and on the branching ratio for t→γu of 2.8×10−5 (6.1×10−5). In addition, they are interpreted as 95% CL upper limits on the cross section for FCNC tγ production via a left-handed (right-handed) tcγ coupling of 40 fb (33 fb) and on the branching ratio for t→γc of 22×10−5 (18×10−5)
    corecore