7 research outputs found

    Proceedings of the Second Joint Technology Workshop on Neural Networks and Fuzzy Logic, volume 2

    Get PDF
    Documented here are papers presented at the Neural Networks and Fuzzy Logic Workshop sponsored by NASA and the University of Texas, Houston. Topics addressed included adaptive systems, learning algorithms, network architectures, vision, robotics, neurobiological connections, speech recognition and synthesis, fuzzy set theory and application, control and dynamics processing, space applications, fuzzy logic and neural network computers, approximate reasoning, and multiobject decision making

    Evolutionary and Reinforcement Fuzzy Control

    Get PDF
    Many modern and classical techniques exist for the design of control systems. However, many real world applications are inherently complex and the application of traditional design and control techniques is limited. In addition, no single design method exists which can be applied to all types of system. Due to this 'deficiency', recent years have seen an exponential increase in the use of methods loosely termed 'computational intelligent techniques' or 'soft- computing techniques'. Such techniques tend to solve problems using a population of individual elements or potential solutions or the flexibility of a network as opposed to using a rigid, single point of computing. Through use of computational redundancies, soft-computing allows unmatched tractability in practical problem solving. The intelligent paradigm most successfully applied to control engineering, is that of fuzzy logic in the form of fuzzy control. The motivation of using fuzzy control is twofold. First, it allows one to incorporate heuristics into the control strategy, such as the model operator actions. Second, it allows nonlinearities to be defined in an intuitive way using rules and interpolations. Although it is an attractive tool, there still exist many problems to be solved in fuzzy control. To date most applications have been limited to relatively simple problems of low dimensionality. This is primarily due to the fact that the design process is very much a trial and error one and is heavily dependent on the quality of expert knowledge provided by the operator. In addition, fuzzy control design is virtually ad hoc, lacking a systematic design procedure. Other problems include those associated with the curse of dimensionality and the inability to learn and improve from experience. While much work has been carried out to alleviate most of these difficulties, there exists a lack of drive and exploration in the last of these points. The objective of this thesis is to develop an automated, systematic procedure for optimally learning fuzzy logic controllers (FLCs), which provides for autonomous and simple implementations. In pursuit of this goal, a hybrid method is to combine the advantages artificial neural networks (ANNs), evolutionary algorithms (EA) and reinforcement learning (RL). This overcomes the deficiencies of conventional EAs that may omit representation of the region within a variable's operating range and that do not in practice achieve fine learning. This method also allows backpropagation when necessary or feasible. It is termed an Evolutionary NeuroFuzzy Learning Intelligent Control technique (ENFLICT) model. Unlike other hybrids, ENFLICT permits globally structural learning and local offline or online learning. The global EA and local neural learning processes should not be separated. Here, the EA learns and optimises the ENFLICT structure while ENFLICT learns the network parameters. The EA used here is an improved version of a technique known as the messy genetic algorithm (mGA), which utilises flexible cellular chromosomes for structural optimisation. The properties of the mGA as compared with other flexible length EAs, are that it enables the addressing of issues such as the curse of dimensionality and redundant genetic information. Enhancements to the algorithm are in the coding and decoding of the genetic information to represent a growing and shrinking network; the defining of the network properties such as neuron activation type and network connectivity; and that all of this information is represented in a single gene. Another step forward taken in this thesis on neurofuzzy learning is that of learning online. Online in this case refers to learning unsupervised and adapting to real time system parameter changes. It is much more attractive because the alternative (supervised offline learning) demands quality learning data which is often expensive to obtain, and unrepresentative of and inaccurate about the real environment. First, the learning algorithm is developed for the case of a given model of the system where the system dynamics are available or can be obtained through, for example, system identification. This naturally leads to the development of a method for learning by directly interacting with the environment. The motivation for this is that usually real world applications tend to be large and complex, and obtaining a mathematical model of the plant is not always possible. For this purpose the reinforcement learning paradigm is utilised, which is the primary learning method of biological systems, systems that can adapt to their environment and experiences, in this thesis, the reinforcement learning algorithm is based on the advantage learning method and has been extended to deal with continuous time systems and online implementations, and which does not use a lookup table. This means that large databases containing the system behaviour need not be constructed, and the procedure can work online where the information available is that of the immediate situation. For complex systems of higher order dimensions, and where identifying the system model is difficult, a hierarchical method has been developed and is based on a hybrid of all the other methods developed. In particular, the procedure makes use of a method developed to work directly with plant step response, thus avoiding the need for mathematical model fitting which may be time-consuming and inaccurate. All techniques developed and contributions in the thesis are illustrated by several case studies, and are validated through simulations

    Autonomous obstacle avoidance and positioning control of mobile robots using fuzzy neural networks

    Get PDF
    Navigation and obstacle avoidance are important tasks in the research field of au- tonomous mobile robots. The challenge tackled in this work is the navigation of a 4- wheeled car-type robot to a desired parking position while avoiding obstacles on the way. The taken approach to solve this problem is based on neural fuzzy techniques. Earlier works resulted in a controller to navigate the robot in a clear environment. It is extended by considering additional parameters in the training process. The learning method used in this training is dynamic backpropagation. For the obstacle avoidance problem an additional neuro-fuzzy controller is set up and trained. It influences the results from the navigation controller to avoid collisions with objects blocking the path. The controller is trained with dynamic backpropagation and a reinforcement learning algorithm called deep deterministic policy gradient.Tesi

    Adaptive dynamic programming with eligibility traces and complexity reduction of high-dimensional systems

    Get PDF
    This dissertation investigates the application of a variety of computational intelligence techniques, particularly clustering and adaptive dynamic programming (ADP) designs especially heuristic dynamic programming (HDP) and dual heuristic programming (DHP). Moreover, a one-step temporal-difference (TD(0)) and n-step TD (TD(位)) with their gradients are utilized as learning algorithms to train and online-adapt the families of ADP. The dissertation is organized into seven papers. The first paper demonstrates the robustness of model order reduction (MOR) for simulating complex dynamical systems. Agglomerative hierarchical clustering based on performance evaluation is introduced for MOR. This method computes the reduced order denominator of the transfer function by clustering system poles in a hierarchical dendrogram. Several numerical examples of reducing techniques are taken from the literature to compare with our work. In the second paper, a HDP is combined with the Dyna algorithm for path planning. The third paper uses DHP with an eligibility trace parameter (位) to track a reference trajectory under uncertainties for a nonholonomic mobile robot by using a first-order Sugeno fuzzy neural network structure for the critic and actor networks. In the fourth and fifth papers, a stability analysis for a model-free action-dependent HDP(位) is demonstrated with batch- and online-implementation learning, respectively. The sixth work combines two different gradient prediction levels of critic networks. In this work, we provide a convergence proofs. The seventh paper develops a two-hybrid recurrent fuzzy neural network structures for both critic and actor networks. They use a novel n-step gradient temporal-difference (gradient of TD(位)) of an advanced ADP algorithm called value-gradient learning (VGL(位)), and convergence proofs are given. Furthermore, the seventh paper is the first to combine the single network adaptive critic with VGL(位). --Abstract, page iv

    Locomotion training of legged robots using hybrid machine learning techniques

    Get PDF
    In this study artificial neural networks and fuzzy logic are used to control the jumping behavior of a three-link uniped robot. The biped locomotion control problem is an increment of the uniped locomotion control. Study of legged locomotion dynamics indicates that a hierarchical controller is required to control the behavior of a legged robot. A structured control strategy is suggested which includes navigator, motion planner, biped coordinator and uniped controllers. A three-link uniped robot simulation is developed to be used as the plant. Neurocontrollers were trained both online and offline. In the case of on-line training, a reinforcement learning technique was used to train the neurocontroller to make the robot jump to a specified height. After several hundred iterations of training, the plant output achieved an accuracy of 7.4%. However, when jump distance and body angular momentum were also included in the control objectives, training time became impractically long. In the case of off-line training, a three-layered backpropagation (BP) network was first used with three inputs, three outputs and 15 to 40 hidden nodes. Pre-generated data were presented to the network with a learning rate as low as 0.003 in order to reach convergence. The low learning rate required for convergence resulted in a very slow training process which took weeks to learn 460 examples. After training, performance of the neurocontroller was rather poor. Consequently, the BP network was replaced by a Cerebeller Model Articulation Controller (CMAC) network. Subsequent experiments described in this document show that the CMAC network is more suitable to the solution of uniped locomotion control problems in terms of both learning efficiency and performance. A new approach is introduced in this report, viz., a self-organizing multiagent cerebeller model for fuzzy-neural control of uniped locomotion is suggested to improve training efficiency. This is currently being evaluated for a possible patent by NASA, Johnson Space Center. An alternative modular approach is also developed which uses separate controllers for each stage of the running stride. A self-organizing fuzzy-neural controller controls the height, distance and angular momentum of the stride. A CMAC-based controller controls the movement of the leg from the time the foot leaves the ground to the time of landing. Because the leg joints are controlled at each time step during flight, movement is smooth and obstacles can be avoided. Initial results indicate that this approach can yield fast, accurate results

    Intelligent Control Strategies for an Autonomous Underwater Vehicle

    Get PDF
    The dynamic characteristics of autonomous underwater vehicles (AUVs) present a control problem that classical methods cannot often accommodate easily. Fundamentally, AUV dynamics are highly non-linear, and the relative similarity between the linear and angular velocities about each degree of freedom means that control schemes employed within other flight vehicles are not always applicable. In such instances, intelligent control strategies offer a more sophisticated approach to the design of the control algorithm. Neurofuzzy control is one such technique, which fuses the beneficial properties of neural networks and fuzzy logic in a hybrid control architecture. Such an approach is highly suited to development of an autopilot for an AUV. Specifically, the adaptive network-based fuzzy inference system (ANFIS) is discussed in Chapter 4 as an effective new approach for neurally tuning course-changing fuzzy autopilots. However, the limitation of this technique is that it cannot be used for developing multivariable fuzzy structures. Consequently, the co-active ANFIS (CANFIS) architecture is developed and employed as a novel multi variable AUV autopilot within Chapter 5, whereby simultaneous control of the AUV yaw and roll channels is achieved. Moreover, this structure is flexible in that it is extended in Chapter 6 to perform on-line control of the AUV leading to a novel autopilot design that can accommodate changing vehicle pay loads and environmental disturbances. Whilst the typical ANFIS and CANFIS structures prove effective for AUV control system design, the well known properties of radial basis function networks (RBFN) offer a more flexible controller architecture. Chapter 7 presents a new approach to fuzzy modelling and employs both ANFIS and CANFIS structures with non-linear consequent functions of composite Gaussian form. This merger of CANFIS and a RBFN lends itself naturally to tuning with an extended form of the hybrid learning rule, and provides a very effective approach to intelligent controller development.The Sea Systems and Platform Integration Sector, Defence Evaluation and Research Agency, Winfrit

    Quality analysis modelling for development of a process controller in resistance spot welding using neural networks techniques

    Get PDF
    Student Number : 9811923K - PhD thesis - School of Mechanical Engineering - Faculty of Engineering and the Built EnvironmentMethods are presented for obtaining models used for predicting welded sample resistance and effective weld current (RMS) for desired weld diameter (weld quality) in the resistance spot welding process. These models were used to design predictive controllers for the welding process. A suitable process model forms an important step in the development and design of process controllers for achieving good weld quality with good reproducibility. Effective current, dynamic resistance and applied electrode force are identified as important input parameters necessary to predict the output weld diameter. These input parameters are used for the process model and design of a predictive controller. A three parameter empirical model with dependent and independent variables was used for curve fitting the nonlinear halfwave dynamic resistance. The estimates of the parameters were used to develop charts for determining overall resistance of samples for any desired weld diameter. Estimating resistance for samples welded in the machines from which dataset obtained were used to plot the chart yielded accurate results. However using these charts to estimate sample resistance for new and unknown machines yielded high estimation error. To improve the prediction accuracy the same set of data generated from the model were used to train four different neural network types. These were the Generalised Feed Forward (GFF) neural network, Multilayer Perceptron (MLP) network, Radial Basis Function (RBF) and Recurrent neural network (RNN). Of the four network types trained, the MLP had the least mean square error for training and cross validation of 0.00037 and 0.00039 respectively with linear correlation coefficient in testing of 0.999 and maximum estimation error range from 0.1% to 3%. A prediction accuracy of about 97% to 99.9%. This model was selected for the design and implementation of the controller for predicting overall sample resistance. Using this predicted overall sample resistance, and applied electrode force, a second model was developed for predicting required effective weld current for any desired weld diameter. The prediction accuracy of this model was in the range of 94% to 99%. The neural network predictive controller was designed using the MLP neural network models. The controller outputs effective current for any desired weld diameter and is observed to track the desired output accurately with same prediction accuracy of the model used which was about 94% to 99%. The controller works by utilizing the neural network output embedded in Microsoft Excel as a digital link library and is able to generate outputs for given inputs on activating the process by the push of a command button
    corecore