11 research outputs found

    Toward human-in-the-loop PID control based on CACLA reinforcement learning

    Get PDF
    A self-tuning PID control strategy using a reinforcement learning method, called CACLA (Continuous Actor-critic Learning Automata) is proposed in this paper with the example application of humanin-the-loop physical assistive control. An advantage of using reinforcement learning is that it can be done in an online manner. Moreover, since human is a time-variant system. The demonstration also shows that the reinforcement learning framework would be beneficial to give semi-supervision signal to reinforce the positive learning performance in any time-step

    Mapping structural diversity in networks sharing a given degree distribution and global clustering: Adaptive resolution grid search evolution with Diophantine equation-based mutations

    Get PDF
    Methods that generate networks sharing a given degree distribution and global clustering can induce changes in structural properties other than that controlled for. Diversity in structural properties, in turn, can affect the outcomes of dynamical processes operating on those networks. Since exhaustive sampling is not possible, we propose a novel evolutionary framework for mapping this structural diversity. The three main features of this framework are: (a) subgraph-based encoding of networks, (b) exact mutations based on solving systems of Diophantine equations, and (c) heuristic diversity-driven mechanism to drive resolution changes in the MapElite algorithm.We show that our framework can elicit networks with diversity in their higher-order structure and that this diversity affects the behaviour of the complex contagion model. Through a comparison with state of the art clustered network generation methods, we demonstrate that our approach can uncover a comparably diverse range of networks without needing computationally unfeasible mixing times. Further, we suggest that the subgraph-based encoding provides greater confidence in the diversity of higher-order network structure for low numbers of samples and is the basis for explaining our results with complex contagion model. We believe that this framework could be applied to other complex landscapes that cannot be practically mapped via exhaustive sampling

    On the Tradeoff Between Hardware Protection and Optimization Success: A Case Study in Onboard Evolutionary Robotics for Autonomous Parallel Parking

    No full text
    Making the transition from simulation to reality in evolutionary robotics is known to be challenging. What is known as the reality gap, summa-rizes the set of problems that arises when robot controllers have been evolved in simulation and then are transferred to the real robot. In this paper we study an additional problem that is beyond the reality gap. In simulations, the robot needs no protection against damage, while on the real robot that is essential to stay cost-effective. We investigate how the probability of collisions can be minimized by introducing appropri-ate penalties to the fitness function. A change to the fitness function, however, changes the evolutionary dynamics and can influence the opti-mization success negatively. Therefore, we detect a tradeoff between a required hardware protection and a reduced efficiency of the evolutionary optimization process. We study this tradeoff on the basis of a robotics case study in autonomous parallel parking.

    Adaptive Multi-Objective Local Search Algorithms for the Permutation Flowshop Scheduling Problem

    No full text
    International audienceAutomatic algorithm configuration (AAC) is an increasingly critical factor in the design of efficient metaheuristics. AAC was previously successfully applied to multi-objective local search (MOLS) algorithms using offline tools. However, offline approaches are usually very expensive, draw general recommendations regarding algorithm design for a given set of instances, and does generally not allow per-instance adaptation. Online techniques for automatic algorithm control are usually applied to single-objective evolutionary algorithms. In this work we investigate the impact of including control mechanisms to MOLS algorithms on a classical bi-objective permutation flowshop scheduling problem (PFSP), and demonstrate how even simple control mechanisms can complement traditional offline configuration techniques
    corecore