50 research outputs found

    Greed Is Good: Exploration and Exploitation Trade-offs in Bayesian Optimisation

    Get PDF
    The performance of acquisition functions for Bayesian optimisation to locate the global optimum of continuous functions is investigated in terms of the Pareto front between exploration and exploitation. We show that Expected Improvement (EI) and the Upper Confidence Bound (UCB) always select solutions to be expensively evaluated on the Pareto front, but Probability of Improvement is not guaranteed to do so and Weighted Expected Improvement does so only for a restricted range of weights. We introduce two novel ϵ-greedy acquisition functions. Extensive empirical evaluation of these together with random search, purely exploratory, and purely exploitative search on 10 benchmark problems in 1 to 10 dimensions shows that ϵ-greedy algorithms are generally at least as effective as conventional acquisition functions (e.g. EI and UCB), particularly with a limited budget. In higher dimensions ϵ-greedy approaches are shown to have improved performance over conventional approaches. These results are borne out on a real world computational fluid dynamics optimisation problem and a robotics active learning problem. Our analysis and experiments suggest that the most effective strategy, particularly in higher dimensions, is to be mostly greedy, occasionally selecting a random exploratory solution

    Scientists’ warning on affluence

    Get PDF
    For over half a century, worldwide growth in affluence has continuously increased resource use and pollutant emissions far more rapidly than these have been reduced through better technology. The affluent citizens of the world are responsible for most environmental impacts and are central to any future prospect of retreating to safer environmental conditions. We summarise the evidence and present possible solution approaches. Any transition towards sustainability can only be effective if far-reaching lifestyle changes complement technological advancements. However, existing societies, economies and cultures incite consumption expansion and the structural imperative for growth in competitive market economies inhibits necessary societal change

    Are long-term trends changing?

    No full text

    Beta and return: One-day effect

    No full text

    Towards Learning of Safety Knowledge from Human Demonstrations

    No full text
    Future autonomous service robots are intended to operate in open and complex environments. This in turn implies complications ensuring safe operation. The tenor of few available investigations is the need for dynamically assessing operational risks. Furthermore, a new kind of hazards being implicated by the robot’s capability to manipulate the environment occurs: hazardous environmental object interactions. One of the open questions in safety research is integrating safety knowledge into robotic systems, enabling these systems behaving safety-conscious in hazardous situations. In this paper a safety procedure is described, in which learning of safety knowledge from human demonstration is considered. Within the procedure, a task is demonstrated to the robot, which observes object-to-object relations and labels situational data as commanded by the human. Based on this data, several supervised learning techniques are evaluated used for finally extracting safety knowledge. Results indicate that Decision Trees allow interesting opportunities

    Robust Exploration/Exploitation Trade-Offs in Safety-Critical Applications

    No full text
    With regard to future service robots, unsafe exceptional circumstances can occur in complex systems that are hardly to foresee. In this paper, the assumption of having no knowledge about the environment is investigated using reinforcement learning as an option for learning behavior by trial-and-error. In such a scenario, action-selection decisions are made based on future reward predictions for minimizing costs in reaching a goal. It is shown that the selection of safetycritical actions leading to highly negative costs from the environment is directly related to the exploration/exploitation dilemma in temporal-di erence learning. For this, several exploration policies are investigated with regard to worst- and best-case performance in a dynamic environment. Our results show that in contrast to established exploration policies like epsilon-Greedy and Softmax, the recently proposed VDBE-Softmax policy seems to be more appropriate for such applications due to its robustness of the exploration parameter for unexpected situations

    Conceptual Design of a Dynamic Risk-Assessment Server for Autonomous Robots

    No full text
    Future autonomous service robots are intended to operate in open and complex environments. This in turn implies complications ensuring safe operation. The tenor of few available investigations is the need for dynamically assessing operational risks. Furthermore, there is a new kind of hazards being implicated by the robot’s capability to manipulate the environment: Hazardous environmental object interactions. Therefore, the realization of the Dynamic Risk-Assessment approach with special scope on object-interaction risks is addressed in this paper. A server-based architecture is proposed facilitating a feasible integration into robotic systems and realization of software and hardware redundancy as well

    From Solar Cells to Ocean Buoys: Wide-Bandwidth Limits to Absorption by Metaparticle Arrays

    No full text
    In this paper, we develop an approximate wide-bandwidth upper bound to the absorption enhancement in arrays of metaparticles, applicable to general wave-scattering problems and motivated here by ocean-buoy energy extraction. We show that general limits, including the well-known Yablonovitch result in solar cells, arise from reciprocity conditions. The use of reciprocity in the stochastic regime leads us to a corrected diffusion model from which we derive our main result: an analytical prediction of optimal array absorption that closely matches exact simulations for both random and optimized arrays under angle and frequency averaging. This result also enables us to propose and quantify approaches to increase performance through careful particle design and/or using external reflectors. We show, in particular, that the use of membranes on the water's surface allows substantial enhancement. ©2019 American Physical Society.Army Research Office - Cooperative Agreement (W911NF-18-2-0048
    corecore