583 research outputs found

    Batch Reinforcement Learning on the Industrial Benchmark: First Experiences

    Full text link
    The Particle Swarm Optimization Policy (PSO-P) has been recently introduced and proven to produce remarkable results on interacting with academic reinforcement learning benchmarks in an off-policy, batch-based setting. To further investigate the properties and feasibility on real-world applications, this paper investigates PSO-P on the so-called Industrial Benchmark (IB), a novel reinforcement learning (RL) benchmark that aims at being realistic by including a variety of aspects found in industrial applications, like continuous state and action spaces, a high dimensional, partially observable state space, delayed effects, and complex stochasticity. The experimental results of PSO-P on IB are compared to results of closed-form control policies derived from the model-based Recurrent Control Neural Network (RCNN) and the model-free Neural Fitted Q-Iteration (NFQ). Experiments show that PSO-P is not only of interest for academic benchmarks, but also for real-world industrial applications, since it also yielded the best performing policy in our IB setting. Compared to other well established RL techniques, PSO-P produced outstanding results in performance and robustness, requiring only a relatively low amount of effort in finding adequate parameters or making complex design decisions

    A Benchmark Environment Motivated by Industrial Control Problems

    Full text link
    In the research area of reinforcement learning (RL), frequently novel and promising methods are developed and introduced to the RL community. However, although many researchers are keen to apply their methods on real-world problems, implementing such methods in real industry environments often is a frustrating and tedious process. Generally, academic research groups have only limited access to real industrial data and applications. For this reason, new methods are usually developed, evaluated and compared by using artificial software benchmarks. On one hand, these benchmarks are designed to provide interpretable RL training scenarios and detailed insight into the learning process of the method on hand. On the other hand, they usually do not share much similarity with industrial real-world applications. For this reason we used our industry experience to design a benchmark which bridges the gap between freely available, documented, and motivated artificial benchmarks and properties of real industrial problems. The resulting industrial benchmark (IB) has been made publicly available to the RL community by publishing its Java and Python code, including an OpenAI Gym wrapper, on Github. In this paper we motivate and describe in detail the IB's dynamics and identify prototypic experimental settings that capture common situations in real-world industry control problems

    A Benchmark Environment Motivated by Industrial Control Problems

    Full text link
    In the research area of reinforcement learning (RL), frequently novel and promising methods are developed and introduced to the RL community. However, although many researchers are keen to apply their methods on real-world problems, implementing such methods in real industry environments often is a frustrating and tedious process. Generally, academic research groups have only limited access to real industrial data and applications. For this reason, new methods are usually developed, evaluated and compared by using artificial software benchmarks. On one hand, these benchmarks are designed to provide interpretable RL training scenarios and detailed insight into the learning process of the method on hand. On the other hand, they usually do not share much similarity with industrial real-world applications. For this reason we used our industry experience to design a benchmark which bridges the gap between freely available, documented, and motivated artificial benchmarks and properties of real industrial problems. The resulting industrial benchmark (IB) has been made publicly available to the RL community by publishing its Java and Python code, including an OpenAI Gym wrapper, on Github. In this paper we motivate and describe in detail the IB's dynamics and identify prototypic experimental settings that capture common situations in real-world industry control problems

    Achieving near-zero particle generation by simplicity of design—A compliant-mechanism-based gripper for clean-room environments

    Get PDF
    Lab Automation facilitates high-throughput processes and improves reproducibility and efficiency while removing human action, primary source of contaminating particles. Handling poses a risk of contamination due to close contact with the objects. We propose a novel gripper (CrocoGrip) relying on compliant mechanisms to reduce the amount of contaminating particles generated by the gripper rather than preventing their emission, the latter being the common approach in current grippers. Using a structured design approach including simplified motion models and Finite Element Methods, we developed a novel gripper that is actuated by linear solenoids and purely relies on deformation for its motion. As a result, abrasive behavior and, therefore, the generation of particles is reduced without the need for additional sealing. We experimentally proved that the number of particles emitted by the CrocoGrip fulfills the demands of ISO14644 class 5. Due to the monolithic design of the CrocoGrip and, as a result, the need for few components, we achieve a simplicity of design, making cleaning, sterilization and maintenance easy, even for nonexperts. Furthermore, all parts but the two solenoids can be sterilized through autoclaving. The gripping is performed by utilizing the deformation energy of the compliant mechanism, making the gripping energy-efficient and safe. By using interchangeable jaws, the CrocoGrip was able to handle a microplate in SBS-standard, a 50 mL Falcon tube, and a Ø60 mm Petri dish using a robotic arm. CrocoGrip exploits the advantages of compliant mechanisms, especially for applications requiring clean-room environments. This approach of CM-based grippers enables an increase in the cleanliness of handling processes without an increase in system complexity of the gripper to facilitate the lab automation of highly sensitive processes, such as in tissue engineering

    A New Method for Reliable Rotary Tool Shank Use at Speeds above the Critical Speed Range

    Get PDF
    AbstractDynamic tool characteristics affect both the design and the use of rotary tools applied to high-speed cutting. At present, spindle-driven tools commonly operate at speed values below their critical bending speed. Currently, this is the only way to guarantee reliable tool operation. Consequently, the low natural frequency – in particular for tools with a high length-to-diameter ratio – often results in spindle speeds that utilise neither the cutting materials’ nor the machine tools’ potential. The paper elucidates a method to design tool shanks and tools to be run above the critical speed, as well as its implementation. The reliable and safe operation of these special tool shanks is based on the physical effects of self-centering and self-balancing, which come into play above the critical speed

    FDA and EMA Approvals of New Breast Cancer Drugs—A Comparative Regulatory Analysis

    Get PDF
    Breast cancer is the most common cancer in women worldwide and the solid tumor type for which the highest number of drugs have been approved to date. This study examines new drug approvals for breast cancer by the United States Food and Drug Administration (FDA) and the European Medicines Agency (EMA), based on an analysis of regulatory documents from both agencies for the period from 1995 to 2018. Of the 29 breast cancer drugs approved over this time span, 17 received positive decisions from both the FDA and EMA, including all drugs licensed after 2008. Nineteen of the 25 FDA-approved drugs, but none of the EMA approvals, benefited from special regulatory pathways (such as fast track, breakthrough therapy, or priority review). In the U.S.A., four accelerated approvals were granted (of which one, for bevacizumab, was later revoked), while only two drugs received provisional approvals following EMA review. New breast cancer drugs were approved approximately twelve months earlier in the United States than in Europe. These results suggest that a broader use of special regulatory pathways by EMA could help to accelerate access to novel drugs for European breast cancer patients

    Trajectory sampling and finite-size effects in first-principles stopping power calculations

    Full text link
    Real-time time-dependent density functional theory (TDDFT) is presently the most accurate available method for computing electronic stopping powers from first principles. However, obtaining application-relevant results often involves either costly averages over multiple calculations or ad hoc selection of a representative ion trajectory. We consider a broadly applicable, quantitative metric for evaluating and optimizing trajectories in this context. This methodology enables rigorous analysis of the failure modes of various common trajectory choices in crystalline materials. Although randomly selecting trajectories is common practice in stopping power calculations in solids, we show that nearly 30% of random trajectories in an FCC aluminium crystal will not representatively sample the material over the time and length scales feasibly simulated with TDDFT, and unrepresentative choices incur errors of up to 60%. We also show that finite-size effects depend on ion trajectory via "ouroboros" effects beyond the prevailing plasmon-based interpretation, and we propose a cost-reducing scheme to obtain converged results even when expensive core-electron contributions preclude large supercells. This work helps to mitigate poorly controlled approximations in first-principles stopping power calculations, allowing 1-2 order of magnitude cost reductions for obtaining representatively averaged and converged results

    Die Spontanheilungsversuche des vorderen Kreuzbands : eine experimentelle Untersuchung

    Get PDF
    Fragestellung Spontanheilungen von Rupturen des vorderen Kreuzbandes (VKB) wurden wiederholt mittels MRT beschrieben. In grundlegenden tierexperimentellen Untersuchungen wird geschlussfolgert, dass es keine Regeneration gäbe. Die Morphologie der Spontanheilung des VKB wurde im Tiermodell des Kaninchens untersucht. Methoden 32 männliche Kaninchen wurden medial arthrotomiert und das VKB durchtrennt. Die Durchtrennung erfolgte unter Sicht mittels Nr. 15 Skalpell, zusätzlich wurde die vollständige Trennung der Stümpfe durch mehrfache Instrumentenpassage kontrolliert. Eine Resektion von Fasern erfolgte nicht. Je 8 Tiere wurden 2, 4, 8 und 12 Wochen nach dem Eingriff getötet. Die Morphologie der Heilungsverläufe wurde nach Lo et al. [1] klassifiziert, neue Typen der Klassifikation hinzugefügt. Typ B (intrasynoviale Ruptur) und Typ C (knöcherner Ausriss) waren studienbedingt nicht möglich. Die Studie war vom Regierungspräsidium genehmigt. Ergebnisse Die untersuchten Tiere hatten einen unauffälligen postoperativen Verlauf. Bei unterschiedlicher Ausprägung hatten alle Tiere makroskopisch eine Knorpelschädigung. Nach chirurgischer Durchtrennung waren die VKBenden durchschnitten (mop ended Typ A). Dieser Typ wurde bei der Dissektion nicht mehr vorgefunden. Die VKBstümpfe waren bei Dissektion: retrahiert (Typ D), mit dem hinteren Kreuzband verwachsen (E), resorbiert (F), miteinander verbunden = narbengeheilt (G), oder es lag eine Kombination (H) verschiedener Typen vor. Als neue Typen wurden Verwachsungen mit dem Meniskus (I) oder mit der Gelenkkapsel bzw. dem Fettkörper (K) beschrieben. Die VKBstümpfe waren gering (G 1 = 7), deutlich (G2 = 3 bzw. 4 insges. bei zusätzlicher Verwachsung mit dem Innenmeniskus) oder hypertroph (G 3 = 3) miteinander verwachsen. [Tab. 1] Schlussfolgerungen Der VKBriss führt häufig zur Kniegelenksinstabilität und Osteoarthrose. Nach Durchtrennung des VKB wird in Studien der Osteoarthroseforschung im Kaninchenmodell der Befund nie detailliert. In einem systematischen Vergleich der Regenerationsfähigkeit von partiell und komplett durchtrennten VKBs fanden Hefti et al. [2] weder bei skeletall unreifen noch bei jungen erwachsenen Kaninchen eine Regeneration nach kompletter Durchtrennung. Sie berichteten als Regelfall die Resorption nach 3 Monaten, in 2 Fällen Verwachsungen mit dem hinteren Kreuzband. Die hier berichteten Daten sind im Gegensatz zu Hefti's Untersuchungen. Sie stimmen aber gut mit humanen MRT-Untersuchungen und Arthroskopien nach Kreuzbandverletzungen überein. Das VKB versucht durch Verwachsung eine Heilung der VKBstümpfe zu erzielen oder Anschluss an andere Kniegelenksstrukturen zu gewinnen. Somit besteht auch im Tiermodell des Kaninchens eine gewisse Spontanheilungsaktivität des VKB, deren biomechanische Stabilität allerdings oft unzureichend erschien. Eine Kniegelenksarthrose entwickelte sich in den untersuchten Gelenken
    corecore