Search CORE

69 research outputs found

Graph-based Reinforcement Learning meets Mixed Integer Programs: An application to 3D robot assembly discovery

Author: Chalvatzaki Georgia
Funk Niklas
Menzenbach Svenja
Peters Jan
Publication venue
Publication date: 02/08/2022
Field of study

Robot assembly discovery is a challenging problem that lives at the intersection of resource allocation and motion planning. The goal is to combine a predefined set of objects to form something new while considering task execution with the robot-in-the-loop. In this work, we tackle the problem of building arbitrary, predefined target structures entirely from scratch using a set of Tetris-like building blocks and a robotic manipulator. Our novel hierarchical approach aims at efficiently decomposing the overall task into three feasible levels that benefit mutually from each other. On the high level, we run a classical mixed-integer program for global optimization of block-type selection and the blocks' final poses to recreate the desired shape. Its output is then exploited to efficiently guide the exploration of an underlying reinforcement learning (RL) policy. This RL policy draws its generalization properties from a flexible graph-based representation that is learned through Q-learning and can be refined with search. Moreover, it accounts for the necessary conditions of structural stability and robotic feasibility that cannot be effectively reflected in the previous layer. Lastly, a grasp and motion planner transforms the desired assembly commands into robot joint movements. We demonstrate our proposed method's performance on a set of competitive simulated RAD environments, showcase real-world transfer, and report performance and robustness gains compared to an unstructured end-to-end approach. Videos are available at https://sites.google.com/view/rl-meets-milp

arXiv.org e-Print Archive

Robust Reinforcement Learning: A Review of Foundations and Recent Advances

Author: Abdulsamad Hany
Clever Debora
Hansel Kay
Moos Janosch
Peters Jan
Stark Svenja
Publication venue: MDPI
Publication date: 01/01/2022
Field of study

Reinforcement learning (RL) has become a highly successful framework for learning in Markov decision processes (MDP). Due to the adoption of RL in realistic and complex environments, solution robustness becomes an increasingly important aspect of RL deployment. Nevertheless, current RL algorithms struggle with robustness to uncertainty, disturbances, or structural changes in the environment. We survey the literature on robust approaches to reinforcement learning and categorize these methods in four different ways: (i) Transition robust designs account for uncertainties in the system dynamics by manipulating the transition probabilities between states; (ii) Disturbance robust designs leverage external forces to model uncertainty in the system behavior; (iii) Action robust designs redirect transitions of the system by corrupting an agent’s output; (iv) Observation robust designs exploit or distort the perceived system state of the policy. Each of these robust designs alters a different aspect of the MDP. Additionally, we address the connection of robustness to the risk-based and entropy-regularized RL formulations. The resulting survey covers all fundamental concepts underlying the approaches to robust reinforcement learning and their recent advances

TUbiblio

tuprints

Local Online Motor Babbling: Learning Motor Abundance of a Musculoskeletal Robot Arm

Author: Hitzmann Arne
Hosoda Koh
Ikemoto Shuhei
Liu Zinan
Peters Jan
Stark Svenja
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 27/01/2020
Field of study

Motor babbling and goal babbling has been used for sensorimotor learning of highly redundant systems in soft robotics. Recent works in goal babbling have demonstrated successful learning of inverse kinematics (IK) on such systems, and suggest that babbling in the goal space better resolves motor redundancy by learning as few yet efficient sensorimotor mappings as possible. However, for musculoskeletal robot systems, motor redundancy can provide useful information to explain muscle activation patterns, thus the term motor abundance. In this work, we introduce some simple heuristics to empirically define the unknown goal space, and learn the IK of a 10 DoF musculoskeletal robot arm using directed goal babbling. We then further propose local online motor babbling guided by Covariance Matrix Adaptation Evolution Strategy (CMA-ES), which bootstraps on the goal babbling samples for initialization, such that motor abundance can be queried online for any static goal. Our approach leverages the resolving of redundancies and the efficient guided exploration of motor abundance in two stages of learning, allowing both kinematic accuracy and motor variability at the queried goal. The result shows that local online motor babbling guided by CMA-ES can efficiently explore motor abundance at queried goal positions on a musculoskeletal robot system and gives useful insights in terms of muscle stiffness and synergy.IEEE/RSJ International Conference on Intelligent Robots and Systems (iROS2019), November 4 - 8, 2019, Macau, Chin

Kyutacar : Kyushu Institute of Technology Academic Repository

Approaches to monitor and evaluate OER policies in higher education : tracing developments in Germany, Austria, and Switzerland

Author: Bedenlier Svenja
Ebner Martin
Edelsbrunner Sarah
Krüger Nicole
Lüthi-Esposito Gabriela
Marín Victoria I.
Neumann Jan
Orr Dominic
Peters Laura N.
Reimer Ricarda T. D.
Schön Sandra
Zawacki-Richter Olaf
Publication venue: Asian Society for Open and Distance Education
Publication date: 30/04/2022
Field of study

The 2019 UNESCO recommendation on Open Educational Resources (OER) encourages member states to monitor policies and mechanisms in OER across the world. In higher education, there are many initiatives and policies around OER. This contribution gives insights into the current situation concerning OER policy documents that are of national or institutional relevance for public higher education institutions in Germany, Switzerland, and Austria. For each country, a different approach for identifying OER policy documents was chosen, dependent on the availability of documents and different dominant forms of documentation. Whereas digital documents available on the web were found as helpful sources for Germany, and performance agreements between the national ministry and individual universities were used for analysis in Austria, a survey amongst all universities was the chosen research approach in Switzerland to give an overview about potentially OER related policy documents. All these documents are now made available via the OER World Map. With this contribution, the authors also highlight the possibility of using the OER World Map as a powerful tool to collect and evaluate OER policy documents

ZENODO

Repositori Obert UdL

ZHAW digitalcollection

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Approaches to monitor and evaluate OER policies in higher education : tracing developments in Germany, Austria, and Switzerland

Author: Bedenlier Svenja
Ebner Martin
Edelsbrunner Sarah
Krüger Nicole
Lüthi-Esposito Gabriela
Marín Victoria I.
Neumann Jan
Orr Dominic
Peters Laura N.
Reimer Ricarda T. D.
Schön Sandra
Zawacki-Richter Olaf
Publication venue: Asian Society for Open and Distance Education
Publication date: 30/04/2022
Field of study

ZHAW digitalcollection

Tacaribe Virus but Not Junin Virus Infection Induces Cytokine Release from Primary Human Monocytes and Macrophages

Author: A Groseth
AL Farone
Allison Groseth
Andreas Kaufmann
Astrid Herwig
C Peters
CB Dejean
CJ Peters
D Enria
D Garcin
D Pannetier
DA Enria
DA Enria
E Lecompte
EM Leroy
G Carballal
H Feldmann
HJ Schnittler
IS Lukashevich
JB Marq
JB McCormick
JL Blejer
L Fan
L Malmgaard
L Martinez-Sobrido
LH Elliott
M Ambrosio
M Bray
MC Weissenbacher
MF Carter
Michaela Weber
ML Flanagan
MS Salvato
MV Heller
PE Marik
PH Gonzalez
RF Marta
RN Charrel
S Baize
S Baize
S Delgado
S Kunz
SC Levis
SI Medeot
SR Paludan
Stephan Becker
Svenja Wolff
T Briese
Thomas Hoenen
Thomas William Geisbert
TW Geisbert
TW Geisbert
U Stroher
V Wahl-Jensen
WJ McBride
Publication venue: Public Library of Science
Publication date: 10/05/2011
Field of study

The mechanisms underlying the development of disease during arenavirus infection are poorly understood. However, common to all hemorrhagic fever diseases is the involvement of macrophages as primary target cells, suggesting that the immune response in these cells may be of paramount importance during infection. Thus, in order to identify features of the immune response that contribute to arenavirus pathogenesis, we have examined the growth kinetics and cytokine profiles of two closely related New World arenaviruses, the apathogenic Tacaribe virus (TCRV) and the hemorrhagic fever-causing Junin virus (JUNV), in primary human monocytes and macrophages. Both viruses grew robustly in VeroE6 cells; however, TCRV titres were decreased by approximately 10 fold compared to JUNV in both monocytes and macrophages. Infection of both monocytes and macrophages with TCRV also resulted in the release of high levels of IL-6, IL-10 and TNF-α, while levels of IFN-α, IFN-β and IL-12 were not affected. However, we could show that the presence of these cytokines had no direct effect on growth of either TCRV of JUNV in macrophages. Further analysis also showed that while the production of IL-6 and IL-10 are dependent on viral replication, production of TNF-α also occurs after exposure to UV-inactivated TCRV particles and is thus independent of productive virus infection. Surprisingly, JUNV infection did not have an effect on any of the cytokines examined indicating that, in contrast to other viral hemorrhagic fever viruses, macrophage-derived cytokine production is unlikely to play an active role in contributing to the cytokine dysregulation observed in JUNV infected patients. Rather, these results suggest that an early, controlled immune response by infected macrophages may be critical for the successful control of infection of apathogenic viruses and prevention of subsequent disease, including systemic cytokine dysregulation

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Salinity variations in the northern Coorong Lagoon, South Australia: Significant changes in the ecosystem following human alteration to the natural water regime

Author: Ahmad
Allard
Andersen
Andrew T. Revill
Atahan
Attaway
Bieger
Boon
Botello
Cane
Chikaraishi
Collister
Collister
Cranwell
Cranwell
Davis
Dawson
Dick
Eglinton
Evelyn Krull
Ficken
Fluin
Freeman
Freeman
Gat
Gelin
Gell
Gelpi
Gribble
Gribble
Grice
Grice
Grice
Grice
Grice
Grice
Grice
Grosjean
Grossi
Gschwend
Han
Huang
Ingram
Jaffé
Jiang
Jones
Kingsford
Kliti Grice
Krull
Krull
Lichtfouse
Logan
Mackenzie
Maheshwari
McKirdy
Mee
Metzger
Metzger
Monson
Mynderse
Mügler
Nabbefeld
Nichols
Oró
Paton
Paul Greenwood
Peters
Polissar
Revill
Sachse
Sauer
Schidlowski
Sessions
Shuttleworth
Suzuki
Svenja Tulipani
Takahashi
ten Haven
ten Haven
Tibby
Tomy
Verardo
Volkman
Volkman
Volkman
Volkman
Webster
Winterton
Zhang
Zhou
Štejnarová
Publication venue: 'Elsevier BV'
Publication date: 01/01/2014
Field of study

European settlement and drought have significantly impacted the hydrology of the Coorong, a shallow coastal lagoon complex in South Australia, which is part of a terminal wetland at the mouth of the River Murray. An increased salinity associated with lower water levels and progressive isolation from ocean flushes contributed to a severe decline in ecological diversity over the past decades. Here we have conducted a molecular and stable isotopic study of a sedimentary core from the northern Coorong Lagoon spanning more than 5000 years to investigate the recent palaeoenvironmental history of the ecosystem. Major alterations were evident in many biogeochemical parameters in sediments deposited after the 1950s coinciding with the beginning of intensified water regulations. The most prominent shift occurred in δ13C profiles of C21–C33n-alkanes from average values of −23.5‰ to an average of −28.2‰.Further changes included decreases in carbon preference index (CPI) and average chain length (ACL) of the n-alkane series as well as significant increases in algal (e.g. C20 HBI, long chain alkenes and C29-alkadiene) and bacterial (e.g. 13C depleted short chain n-alkanes and hopanoids, δ13C: −35.9‰ to −30.1‰) derived hydrocarbons. Long chain n-alkanes with a strong odd/even predominance as observed here are typically attributed to terrigenous plants. In the Coorong however, terrigenous input to sedimentary OM is only minor. Therefore changes in the before mentioned parameters were attributed to a source transition from a major contribution of macrophytes towards predominantly microalgae and bacteria.δD values of C21–C33n-alkanes showed a general trend towards more enriched values in younger sediments, indicating an overall rising salinity. However, the most pronounced positive shift in these profiles again occurred after the 1950s. Altogether this study demonstrates that the recent human induced changes of the Coorong hydrology, compounded by a severe drought led to an increase in salinity and alterations of primary production which have been much more significant than natural variations occurring throughout the Holocene over several thousands of years

Crossref

espace@Curtin