1,145 research outputs found

    Hierarchical relative entropy policy search

    Get PDF
    Many reinforcement learning (RL) tasks, especially in robotics, consist of multiple sub-tasks that are strongly structured. Such task structures can be exploited by incorporating hierarchical policies that consist of gating networks and sub-policies. However, this concept has only been partially explored for real world settings and complete methods, derived from first principles, are needed. Real world settings are challenging due to large and continuous state-action spaces that are prohibitive for exhaustive sampling methods. We define the problem of learning sub-policies in continuous state action spaces as finding a hierarchical policy that is composed of a high-level gating policy to select the low-level sub-policies for execution by the agent. In order to efficiently share experience with all sub-policies, also called inter-policy learning, we treat these sub-policies as latent variables which allows for distribution of the update information between the sub-policies. We present three different variants of our algorithm, designed to be suitable for a wide variety of real world robot learning tasks and evaluate our algorithms in two real robot learning scenarios as well as several simulations and comparisons

    Learning sequential motor tasks

    Get PDF
    Many real robot applications require the sequential use of multiple distinct motor primitives. This requirement implies the need to learn the individual primitives as well as a strategy to select the primitives sequentially. Such hierarchical learning problems are commonly either treated as one complex monolithic problem which is hard to learn, or as separate tasks learned in isolation. However, there exists a strong link between the robots strategy and its motor primitives. Consequently, a consistent framework is needed that can learn jointly on the level of the individual primitives and the robots strategy. We present a hierarchical learning method which improves individual motor primitives and, simultaneously, learns how to combine these motor primitives sequentially to solve complex motor tasks. We evaluate our method on the game of robot hockey, which is both difficult to learn in terms of the required motor primitives as well as its strategic elements

    Learning to predict phases of manipulation tasks as hidden states

    Get PDF
    Phase transitions in manipulation tasks often occur when contacts between objects are made or broken. A switch of the phase can result in the robot’s actions suddenly influencing different aspects of its environment. Therefore, the boundaries between phases often correspond to constraints or subgoals of the manipulation task. In this paper, we investigate how the phases of manipulation tasks can be learned from data. The task is modeled as an autoregressive hidden Markov model, wherein the hidden phase transitions depend on the observed states. The model is learned from data using the expectation-maximization algorithm. We demonstrate the proposed method on both a pushing task and a pepper mill turning task. The proposed approach was compared to a standard autoregressive hidden Markov model. The experiments show that the learned models can accurately predict the transitions in phases during the manipulation tasks

    Anodic respiration of Pseudomonas putida KT2440 in a stirred-tank bioreactor

    Get PDF
    Anodic batch production of para-hydroxybenzoic acid (pHBA) from citric acid with a genetically modified Pseudomonas putida KT2440 strain was studied in a bio-electrochemical system (BES) based on a standard lab-scale stirred-tank bioreactor at fully controlled anaerobic reaction conditions. Electron transfer to the anode was mediated by addition of KFe(CN) to the medium. Effects of varying anode surface areas (graphite rod, felt and brush), power input (stirrer speed) and mediator concentrations were investigated. The obligate aerobic P. putida grew anaerobically with mediated anodic respiration and pHBA production was observed. Anodic respiration was best applying the graphite rod electrode which showed a maximal current density of 12.5 mA cm. This is the highest measured for non-porous electrodes in BES until now. Increasing the power input to 2.93 W L (700 rpm) and online control of the redox potential E at 225 mV (vs. Ag/AgCl) in the medium by controlled addition of mediator resulted in a maximal pHBA yield of 9.91 mmolC molC which exceeds pHBA yields in the aerobic batch process by 69 % (5.87 mmolC molC )

    Interaction primitives for human-robot cooperation tasks

    Get PDF
    To engage in cooperative activities with human partners, robots have to possess basic interactive abilities and skills. However, programming such interactive skills is a challenging task, as each interaction partner can have different timing or an alternative way of executing movements. In this paper, we propose to learn interaction skills by observing how two humans engage in a similar task. To this end, we introduce a new representation called Interaction Primitives. Interaction primitives build on the framework of dynamic motor primitives (DMPs) by maintaining a distribution over the parameters of the DMP. With this distribution, we can learn the inherent correlations of cooperative activities which allow us to infer the behavior of the partner and to participate in the cooperation. We will provide algorithms for synchronizing and adapting the behavior of humans and robots during joint physical activities

    Synthetic induction of immunogenic cell death by genetic stimulation of endoplasmic reticulum stress.

    Get PDF
    Cis-diamminedichloridoplatinum(II) (CDDP), commonly referred to as cisplatin, is a chemotherapeutic drug used for the treatment of a wide range of solid cancers. CDDP is a relatively poor inducer of immunogenic cell death (ICD), a cell death modality that converts dying cells into a tumor vaccine, stimulating an immune response against residual cancer cells that permits long-lasting immunity and a corresponding reduction in tumor growth. The incapacity of CDDP to trigger ICD is at least partially due to its failure to stimulate the premortem endoplasmic reticulum (ER)-stress response required for the externalization of the "eat-me" signal calreticulin (CRT) on the surface of dying cancer cells. Here, we developed a murine cancer cell line genetically modified to express the ER resident protein reticulon-1c (Rtn-1c) by virtue of tetracycline induction and showed that enforced Rtn-1c expression combined with CDDP treatment promoted CRT externalization to the surface of cancer cells. In contrast to single agent treatments, the tetracycline-mediated Rtn-1c induction combined with CDDP chemotherapy stimulated ICD as measured by the capacity of dying tumor cells, inoculated into syngenic immunocompetent mice, to mount an immune response to tumor re-challenge 1 week later. More importantly, established tumors, forced to constitutively express Rtn-1c in vivo by continuous treatment with tetracycline, became responsive to CDDP and exhibited a corresponding reduction in the rate of tumor growth. The combined therapeutic effects of Rtn-1c induction with CDDP treatment was only detected in the context of an intact immune system and not in nu/nu mice lacking thymus-dependent T lymphocytes. Altogether, these results indicate that the artificial or "synthetic" induction of immunogenic cell death by genetic manipulation of the ER-stress response can improve the efficacy of chemotherapy with CDDP by stimulating anticancer immunity

    Trial Watch: experimental TLR7/TLR8 agonists for oncological indications

    Get PDF
    Resiquimod (R848) and motolimod (VTX-2337) are second-generation experimental derivatives of imiquimod, an imidazoquinoline with immunostimulatory properties originally approved by the US Food and Drug Administration for the topical treatment of actinic keratosis and genital warts more than 20 years ago. Both resiquimod and motolimod operate as agonists of Toll-like receptor 7 (TLR7) and/or TLR8, in thus far delivering adjuvant-like signals to antigen-presenting cells (APCs). In line with such an activity, these compounds are currently investigated as immunostimulatory agents for the treatment of various malignancies, especially in combination with peptide-based, dendritic cell-based, cancer cell lysate-based, or DNA-based vaccines. Here, we summarize preclinical and clinical evidence recently collected to support the development of resiquimod and motolimod and other TLR7/TLR8 agonists as anticancer agents

    A long non-coding RNA links calreticulin-mediated immunogenic cell removal to RB1 transcription

    Get PDF
    A subset of promoters bidirectionally expresses long non-coding RNAs (ncRNAs) of unknown function and protein-coding genes (PCGs) in parallel. Here, we define a set of 1107 highly conserved human bidirectional promoters that mediate the linked expression of long ncRNAs and PCGs. Depletion of the long ncRNA expressed from the RB1 promoter, ncRNA-RB1, reveals regulatory effects different from the RB1-controlled transcriptional program. ncRNA-RB1 positively regulates the expression of calreticulin (CALR) that in response to certain therapeutic interventions can translocate from the endoplasmic reticulum to the cell surface, hence activating anticancer immune responses. Knockdown of ncRNA-RB1 in tumor cells reduced expression of CALR, impaired the translocation of the protein to the cell surface upon treatment with anthracylines and consequently inhibited the cellular uptake by macrophages. In conclusion, co-transcription of ncRNA-RB1 and RB1 provides a positive link between the expression of the two tumor suppressors RB1 and the immune-relevant CALR protein. This regulatory interplay exemplifies disease-relevant co-regulation of two distinct gene products, in which loss of expression of one oncosuppressor protein entails the abolition of additional tumor-inhibitory mechanisms

    Bim and Bmf Synergize To Induce Apoptosis in Neisseria Gonorrhoeae Infection

    Get PDF
    Bcl-2 family proteins including the pro-apoptotic BH3-only proteins are central regulators of apoptotic cell death. Here we show by a focused siRNA miniscreen that the synergistic action of the BH3-only proteins Bim and Bmf is required for apoptosis induced by infection with Neisseria gonorrhoeae (Ngo). While Bim and Bmf were associated with the cytoskeleton of healthy cells, they both were released upon Ngo infection. Loss of Bim and Bmf from the cytoskeleton fraction required the activation of Jun-N-terminal kinase-1 (JNK-1), which in turn depended on Rac-1. Depletion and inhibition of Rac-1, JNK-1, Bim, or Bmf prevented the activation of Bak and Bax and the subsequent activation of caspases. Apoptosis could be reconstituted in Bim-depleted and Bmf-depleted cells by additional silencing of antiapoptotic Mcl-1 and Bcl-X-L, respectively. Our data indicate a synergistic role for both cytoskeletal-associated BH3-only proteins, Bim, and Bmf, in an apoptotic pathway leading to the clearance of Ngo-infected cells

    Newton's law for Bloch electrons, Klein factors and deviations from canonical commutation relations

    Full text link
    The acceleration theorem for Bloch electrons in a homogenous external field is usually presented using quasiclassical arguments. In quantum mechanical versions the Heisenberg equations of motion for an operator k⃗^(t)\hat {\vec k}(t) are presented mostly without properly defining this operator. This leads to the surprising fact that the generally accepted version of the theorem is incorrect for the most natural definition of k⃗^\hat {\vec k}. This operator is shown not to obey canonical commutation relations with the position operator. A similar result is shown for the phase operators defined via the Klein factors which take care of the change of particle number in the bosonization of the field operator in the description of interacting fermions in one dimension. The phase operators are also shown not to obey canonical commutation relations with the corresponding particle number operators. Implications of this fact are discussed for Tomonaga-Luttinger type models.Comment: 9 pages,1 figur
    • …
    corecore