Search CORE

3,181 research outputs found

Computation Approaches for Continuous Reinforcement Learning Problems

Author: Effraimidis D.
Effraimidis D.
Publication venue
Publication date: 01/01/2016
Field of study

Optimisation theory is at the heart of any control process, where we seek to control the behaviour of a system through a set of actions. Linear control problems have been extensively studied, and optimal control laws have been identified. But the world around us is highly non-linear and unpredictable. For these dynamic systems, which don’t possess the nice mathematical properties of the linear counterpart, the classic control theory breaks and other methods have to be employed. But nature thrives by optimising non-linear and over-complicated systems. Evolutionary Computing (EC) methods exploit nature’s way by imitating the evolution process and avoid to solve the control problem analytically. Reinforcement Learning (RL) from the other side regards the optimal control problem as a sequential one. In every discrete time step an action is applied. The transition of the system to a new state is accompanied by a sole numerical value, the “reward” that designate the quality of the control action. Even though the amount of feedback information is limited into a sole real number, the introduction of the Temporal Difference method made possible to have accurate predictions of the value-functions. This paved the way to optimise complex structures, like the Neural Networks, which are used to approximate the value functions. In this thesis we investigate the solution of continuous Reinforcement Learning control problems by EC methodologies. The accumulated reward of such problems throughout an episode suffices as information to formulate the required measure, fitness, in order to optimise a population of candidate solutions. Especially, we explore the limits of applicability of a specific branch of EC, that of Genetic Programming (GP). The evolving population in the GP case is comprised from individuals, which are immediately translated to mathematical functions, which can serve as a control law. The major contribution of this thesis is the proposed unification of these disparate Artificial Intelligence paradigms. The provided information from the systems are exploited by a step by step basis from the RL part of the proposed scheme and by an episodic basis from GP. This makes possible to augment the function set of the GP scheme with adaptable Neural Networks. In the quest to achieve stable behaviour of the RL part of the system a modification of the Actor-Critic algorithm has been implemented. Finally we successfully apply the GP method in multi-action control problems extending the spectrum of the problems that this method has been proved to solve. Also we investigated the capability of GP in relation to problems from the food industry. These type of problems exhibit also non-linearity and there is no definite model describing its behaviour

WestminsterResearch

Meta-heuristic algorithms in car engine design: a literature survey

Author: Tayarani-N Mohammad-H.
Xu Hongming
Yao Xin
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/10/2015
Field of study

Meta-heuristic algorithms are often inspired by natural phenomena, including the evolution of species in Darwinian natural selection theory, ant behaviors in biology, flock behaviors of some birds, and annealing in metallurgy. Due to their great potential in solving difficult optimization problems, meta-heuristic algorithms have found their way into automobile engine design. There are different optimization problems arising in different areas of car engine management including calibration, control system, fault diagnosis, and modeling. In this paper we review the state-of-the-art applications of different meta-heuristic algorithms in engine management systems. The review covers a wide range of research, including the application of meta-heuristic algorithms in engine calibration, optimizing engine control systems, engine fault diagnosis, and optimizing different parts of engines and modeling. The meta-heuristic algorithms reviewed in this paper include evolutionary algorithms, evolution strategy, evolutionary programming, genetic programming, differential evolution, estimation of distribution algorithm, ant colony optimization, particle swarm optimization, memetic algorithms, and artificial immune system

University of Birmingham Research Portal

Enlighten

Neural and Genetic Control Approaches in Process Engineering

Author: Alfonso Garcia-Cerezo
Inmaculada Garcia-Moral
Javier Fernandez De Canete
Pablo Del Saz
Publication venue: 'IntechOpen'
Publication date: 25/07/2012
Field of study

IntechOpen

Evolutionary robotics and neuroscience

Author: Husbands Phil
Moioli Renan
O'Shea Michael
Philippides Andy
Shim Yoonsik
Vargas Patricia
Publication venue: 'MIT Press - Journals'
Publication date: 01/03/2014
Field of study

No description supplie

Heriot Watt Pure

Sussex Research Online

Computational intelligence techniques for HVAC systems: a review

Author: Ahmad Muhammad
Mourshed Monjur
Rezgui Yacine
Yuce Baris
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 31/03/2016
Field of study

Buildings are responsible for 40% of global energy use and contribute towards 30% of the total CO2 emissions. The drive to reduce energy use and associated greenhouse gas emissions from buildings has acted as a catalyst in the development of advanced computational methods for energy efficient design, management and control of buildings and systems. Heating, ventilation and air conditioning (HVAC) systems are the major source of energy consumption in buildings and an ideal candidate for substantial reductions in energy demand. Significant advances have been made in the past decades on the application of computational intelligence (CI) techniques for HVAC design, control, management, optimization, and fault detection and diagnosis. This article presents a comprehensive and critical review on the theory and applications of CI techniques for prediction, optimization, control and diagnosis of HVAC systems.The analysis of trends reveals the minimization of energy consumption was the key optimization objective in the reviewed research, closely followed by the optimization of thermal comfort, indoor air quality and occupant preferences. Hardcoded Matlab program was the most widely used simulation tool, followed by TRNSYS, EnergyPlus, DOE–2, HVACSim+ and ESP–r. Metaheuristic algorithms were the preferred CI method for solving HVAC related problems and in particular genetic algorithms were applied in most of the studies. Despite the low number of studies focussing on MAS, as compared to the other CI techniques, interest in the technique is increasing due to their ability of dividing and conquering an HVAC optimization problem with enhanced overall performance. The paper also identifies prospective future advancements and research directions

Online Research @ Cardiff

Springer - Publisher Connector

Adaptive Neuro-Fuzzy Systems

Author: Ahmad Taher
Azar
Publication venue: 'IntechOpen'
Publication date: 01/02/2010
Field of study

IntechOpen

Flight Control System Design Optimisation via Genetic Programming

Author: Anna Bourmistrova
Sergey Khantsis
Publication venue: 'IntechOpen'
Publication date: 01/01/2009
Field of study

IntechOpen

Process control for WAAM using computer vision

Author: Zhang Shiyu
Publication venue: Faculty of Engineering and Information Sciences
Publication date: 01/01/2020
Field of study

This study is mainly about the vision system and control algorithm programming for wire arc additive manufacturing (WAAM). Arc additive manufacturing technology is formed by the principle of heat source cladding produced by welders using molten inert gas shielded welding (MIG), tungsten inert gas shielded welding (TIG) and layered plasma welding power supply (PA). It has high deposition efficiency, short manufacturing cycle, low cost, and easy maintenance. Although WAAM has very good uses in various fields, the inability to control the adding process in real time has led to defects in the weld and reduced quality. Therefore, it is necessary to develop the real-time feedback through computer vision and algorithms for WAAM to ensure that the thickness and the width of each layer during the addition process are the same

Research Online

A Comprehensive Survey on Particle Swarm Optimization Algorithm and Its Applications

Author: Genlin Ji
Shuihua Wang
Yudong Zhang
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2015
Field of study

Particle swarm optimization (PSO) is a heuristic global optimization method, proposed originally by Kennedy and Eberhart in 1995. It is now one of the most commonly used optimization techniques. This survey presented a comprehensive investigation of PSO. On one hand, we provided advances with PSO, including its modifications (including quantum-behaved PSO, bare-bones PSO, chaotic PSO, and fuzzy PSO), population topology (as fully connected, von Neumann, ring, star, random, etc.), hybridization (with genetic algorithm, simulated annealing, Tabu search, artificial immune system, ant colony algorithm, artificial bee colony, differential evolution, harmonic search, and biogeography-based optimization), extensions (to multiobjective, constrained, discrete, and binary optimization), theoretical analysis (parameter selection and tuning, and convergence analysis), and parallel implementation (in multicore, multiprocessor, GPU, and cloud computing forms). On the other hand, we offered a survey on applications of PSO to the following eight fields: electrical and electronic engineering, automation control systems, communication theory, operations research, mechanical engineering, fuel and energy, medicine, chemistry, and biology. It is hoped that this survey would be beneficial for the researchers studying PSO algorithms

Crossref

Directory of Open Access Journals

Hierarchically organised genetic algorithm for fuzzy network synthesis

Author: Filloy-García Enrique Rafael
Publication venue: The University of Edinburgh
Publication date: 01/01/2002
Field of study

Edinburgh Research Archive