69 research outputs found

    Graph-based Reinforcement Learning meets Mixed Integer Programs: An application to 3D robot assembly discovery

    Full text link
    Robot assembly discovery is a challenging problem that lives at the intersection of resource allocation and motion planning. The goal is to combine a predefined set of objects to form something new while considering task execution with the robot-in-the-loop. In this work, we tackle the problem of building arbitrary, predefined target structures entirely from scratch using a set of Tetris-like building blocks and a robotic manipulator. Our novel hierarchical approach aims at efficiently decomposing the overall task into three feasible levels that benefit mutually from each other. On the high level, we run a classical mixed-integer program for global optimization of block-type selection and the blocks' final poses to recreate the desired shape. Its output is then exploited to efficiently guide the exploration of an underlying reinforcement learning (RL) policy. This RL policy draws its generalization properties from a flexible graph-based representation that is learned through Q-learning and can be refined with search. Moreover, it accounts for the necessary conditions of structural stability and robotic feasibility that cannot be effectively reflected in the previous layer. Lastly, a grasp and motion planner transforms the desired assembly commands into robot joint movements. We demonstrate our proposed method's performance on a set of competitive simulated RAD environments, showcase real-world transfer, and report performance and robustness gains compared to an unstructured end-to-end approach. Videos are available at https://sites.google.com/view/rl-meets-milp

    Robust Reinforcement Learning: A Review of Foundations and Recent Advances

    Get PDF
    Reinforcement learning (RL) has become a highly successful framework for learning in Markov decision processes (MDP). Due to the adoption of RL in realistic and complex environments, solution robustness becomes an increasingly important aspect of RL deployment. Nevertheless, current RL algorithms struggle with robustness to uncertainty, disturbances, or structural changes in the environment. We survey the literature on robust approaches to reinforcement learning and categorize these methods in four different ways: (i) Transition robust designs account for uncertainties in the system dynamics by manipulating the transition probabilities between states; (ii) Disturbance robust designs leverage external forces to model uncertainty in the system behavior; (iii) Action robust designs redirect transitions of the system by corrupting an agent’s output; (iv) Observation robust designs exploit or distort the perceived system state of the policy. Each of these robust designs alters a different aspect of the MDP. Additionally, we address the connection of robustness to the risk-based and entropy-regularized RL formulations. The resulting survey covers all fundamental concepts underlying the approaches to robust reinforcement learning and their recent advances

    Local Online Motor Babbling: Learning Motor Abundance of a Musculoskeletal Robot Arm

    Get PDF
    Motor babbling and goal babbling has been used for sensorimotor learning of highly redundant systems in soft robotics. Recent works in goal babbling have demonstrated successful learning of inverse kinematics (IK) on such systems, and suggest that babbling in the goal space better resolves motor redundancy by learning as few yet efficient sensorimotor mappings as possible. However, for musculoskeletal robot systems, motor redundancy can provide useful information to explain muscle activation patterns, thus the term motor abundance. In this work, we introduce some simple heuristics to empirically define the unknown goal space, and learn the IK of a 10 DoF musculoskeletal robot arm using directed goal babbling. We then further propose local online motor babbling guided by Covariance Matrix Adaptation Evolution Strategy (CMA-ES), which bootstraps on the goal babbling samples for initialization, such that motor abundance can be queried online for any static goal. Our approach leverages the resolving of redundancies and the efficient guided exploration of motor abundance in two stages of learning, allowing both kinematic accuracy and motor variability at the queried goal. The result shows that local online motor babbling guided by CMA-ES can efficiently explore motor abundance at queried goal positions on a musculoskeletal robot system and gives useful insights in terms of muscle stiffness and synergy.IEEE/RSJ International Conference on Intelligent Robots and Systems (iROS2019), November 4 - 8, 2019, Macau, Chin

    Approaches to monitor and evaluate OER policies in higher education : tracing developments in Germany, Austria, and Switzerland

    Get PDF
    The 2019 UNESCO recommendation on Open Educational Resources (OER) encourages member states to monitor policies and mechanisms in OER across the world. In higher education, there are many initiatives and policies around OER. This contribution gives insights into the current situation concerning OER policy documents that are of national or institutional relevance for public higher education institutions in Germany, Switzerland, and Austria. For each country, a different approach for identifying OER policy documents was chosen, dependent on the availability of documents and different dominant forms of documentation. Whereas digital documents available on the web were found as helpful sources for Germany, and performance agreements between the national ministry and individual universities were used for analysis in Austria, a survey amongst all universities was the chosen research approach in Switzerland to give an overview about potentially OER related policy documents. All these documents are now made available via the OER World Map. With this contribution, the authors also highlight the possibility of using the OER World Map as a powerful tool to collect and evaluate OER policy documents

    Approaches to monitor and evaluate OER policies in higher education : tracing developments in Germany, Austria, and Switzerland

    Get PDF
    The 2019 UNESCO recommendation on Open Educational Resources (OER) encourages member states to monitor policies and mechanisms in OER across the world. In higher education, there are many initiatives and policies around OER. This contribution gives insights into the current situation concerning OER policy documents that are of national or institutional relevance for public higher education institutions in Germany, Switzerland, and Austria. For each country, a different approach for identifying OER policy documents was chosen, dependent on the availability of documents and different dominant forms of documentation. Whereas digital documents available on the web were found as helpful sources for Germany, and performance agreements between the national ministry and individual universities were used for analysis in Austria, a survey amongst all universities was the chosen research approach in Switzerland to give an overview about potentially OER related policy documents. All these documents are now made available via the OER World Map. With this contribution, the authors also highlight the possibility of using the OER World Map as a powerful tool to collect and evaluate OER policy documents

    Tacaribe Virus but Not Junin Virus Infection Induces Cytokine Release from Primary Human Monocytes and Macrophages

    Get PDF
    The mechanisms underlying the development of disease during arenavirus infection are poorly understood. However, common to all hemorrhagic fever diseases is the involvement of macrophages as primary target cells, suggesting that the immune response in these cells may be of paramount importance during infection. Thus, in order to identify features of the immune response that contribute to arenavirus pathogenesis, we have examined the growth kinetics and cytokine profiles of two closely related New World arenaviruses, the apathogenic Tacaribe virus (TCRV) and the hemorrhagic fever-causing Junin virus (JUNV), in primary human monocytes and macrophages. Both viruses grew robustly in VeroE6 cells; however, TCRV titres were decreased by approximately 10 fold compared to JUNV in both monocytes and macrophages. Infection of both monocytes and macrophages with TCRV also resulted in the release of high levels of IL-6, IL-10 and TNF-α, while levels of IFN-α, IFN-β and IL-12 were not affected. However, we could show that the presence of these cytokines had no direct effect on growth of either TCRV of JUNV in macrophages. Further analysis also showed that while the production of IL-6 and IL-10 are dependent on viral replication, production of TNF-α also occurs after exposure to UV-inactivated TCRV particles and is thus independent of productive virus infection. Surprisingly, JUNV infection did not have an effect on any of the cytokines examined indicating that, in contrast to other viral hemorrhagic fever viruses, macrophage-derived cytokine production is unlikely to play an active role in contributing to the cytokine dysregulation observed in JUNV infected patients. Rather, these results suggest that an early, controlled immune response by infected macrophages may be critical for the successful control of infection of apathogenic viruses and prevention of subsequent disease, including systemic cytokine dysregulation

    Salinity variations in the northern Coorong Lagoon, South Australia: Significant changes in the ecosystem following human alteration to the natural water regime

    Get PDF
    European settlement and drought have significantly impacted the hydrology of the Coorong, a shallow coastal lagoon complex in South Australia, which is part of a terminal wetland at the mouth of the River Murray. An increased salinity associated with lower water levels and progressive isolation from ocean flushes contributed to a severe decline in ecological diversity over the past decades. Here we have conducted a molecular and stable isotopic study of a sedimentary core from the northern Coorong Lagoon spanning more than 5000 years to investigate the recent palaeoenvironmental history of the ecosystem. Major alterations were evident in many biogeochemical parameters in sediments deposited after the 1950s coinciding with the beginning of intensified water regulations. The most prominent shift occurred in δ13C profiles of C21–C33n-alkanes from average values of −23.5‰ to an average of −28.2‰.Further changes included decreases in carbon preference index (CPI) and average chain length (ACL) of the n-alkane series as well as significant increases in algal (e.g. C20 HBI, long chain alkenes and C29-alkadiene) and bacterial (e.g. 13C depleted short chain n-alkanes and hopanoids, δ13C: −35.9‰ to −30.1‰) derived hydrocarbons. Long chain n-alkanes with a strong odd/even predominance as observed here are typically attributed to terrigenous plants. In the Coorong however, terrigenous input to sedimentary OM is only minor. Therefore changes in the before mentioned parameters were attributed to a source transition from a major contribution of macrophytes towards predominantly microalgae and bacteria.δD values of C21–C33n-alkanes showed a general trend towards more enriched values in younger sediments, indicating an overall rising salinity. However, the most pronounced positive shift in these profiles again occurred after the 1950s. Altogether this study demonstrates that the recent human induced changes of the Coorong hydrology, compounded by a severe drought led to an increase in salinity and alterations of primary production which have been much more significant than natural variations occurring throughout the Holocene over several thousands of years
    • …
    corecore