50 research outputs found
Pattern Recognition
A wealth of advanced pattern recognition algorithms are emerging from the interdiscipline between technologies of effective visual features and the human-brain cognition process. Effective visual features are made possible through the rapid developments in appropriate sensor equipments, novel filter designs, and viable information processing architectures. While the understanding of human-brain cognition process broadens the way in which the computer can perform pattern recognition tasks. The present book is intended to collect representative researches around the globe focusing on low-level vision, filter design, features and image descriptors, data mining and analysis, and biologically inspired algorithms. The 27 chapters coved in this book disclose recent advances and new ideas in promoting the techniques, technology and applications of pattern recognition
Computational intelligence techniques for missing data imputation
Despite considerable advances in missing data imputation techniques over the last three decades, the
problem of missing data remains largely unsolved. Many techniques have emerged in the literature
as candidate solutions, including the Expectation Maximisation (EM), and the combination of autoassociative
neural networks and genetic algorithms (NN-GA). The merits of both these techniques
have been discussed at length in the literature, but have never been compared to each other. This
thesis contributes to knowledge by firstly, conducting a comparative study of these two techniques..
The significance of the difference in performance of the methods is presented. Secondly, predictive
analysis methods suitable for the missing data problem are presented. The predictive analysis in
this problem is aimed at determining if data in question are predictable and hence, to help in
choosing the estimation techniques accordingly. Thirdly, a novel treatment of missing data for online
condition monitoring problems is presented. An ensemble of three autoencoders together with
hybrid Genetic Algorithms (GA) and fast simulated annealing was used to approximate missing
data. Several significant insights were deduced from the simulation results. It was deduced that for
the problem of missing data using computational intelligence approaches, the choice of optimisation
methods plays a significant role in prediction. Although, it was observed that hybrid GA and Fast
Simulated Annealing (FSA) can converge to the same search space and to almost the same values
they differ significantly in duration. This unique contribution has demonstrated that a particular
interest has to be paid to the choice of optimisation techniques and their decision boundaries.
iii
Another unique contribution of this work was not only to demonstrate that a dynamic programming
is applicable in the problem of missing data, but to also show that it is efficient in addressing the
problem of missing data. An NN-GA model was built to impute missing data, using the principle
of dynamic programing. This approach makes it possible to modularise the problem of missing
data, for maximum efficiency. With the advancements in parallel computing, various modules of
the problem could be solved by different processors, working together in parallel. Furthermore, a
method for imputing missing data in non-stationary time series data that learns incrementally even
when there is a concept drift is proposed. This method works by measuring the heteroskedasticity
to detect concept drift and explores an online learning technique. New direction for research, where
missing data can be estimated for nonstationary applications are opened by the introduction of this
novel method. Thus, this thesis has uniquely opened the doors of research to this area. Many
other methods need to be developed so that they can be compared to the unique existing approach
proposed in this thesis.
Another novel technique for dealing with missing data for on-line condition monitoring problem was
also presented and studied. The problem of classifying in the presence of missing data was addressed,
where no attempts are made to recover the missing values. The problem domain was then extended
to regression. The proposed technique performs better than the NN-GA approach, both in accuracy
and time efficiency during testing. The advantage of the proposed technique is that it eliminates
the need for finding the best estimate of the data, and hence, saves time. Lastly, instead of using
complicated techniques to estimate missing values, an imputation approach based on rough sets is
explored. Empirical results obtained using both real and synthetic data are given and they provide a
valuable and promising insight to the problem of missing data. The work, has significantly confirmed
that rough sets can be reliable for missing data estimation in larger and real databases
Multiple adaptive mechanisms for predictive models on streaming data.
Making predictions on non-stationary streaming data remains a challenge in many application areas. Changes in data may cause a decrease in predictive accuracy, which in a streaming setting require a prompt response. In recent years many adaptive predictive models have been proposed for dealing with these issues. Most of these methods use more than one adaptive mechanism, deploying all of them at the same time at regular intervals or in some other fixed manner. However, this manner is often determined in an ad-hoc way, as the effects of adaptive mechanisms are largely unexplored. This thesis therefore investigates different aspects of adaptation with multiple adaptive mechanisms with the aim to increase knowledge in the area, and propose heuristic approaches for more accurate adaptive predictive models. This is done by systematising and formalising the “adaptive mechanism” notion, proposing a categorisation of adaptive mechanisms and a metric to measure their usefulness, comparing the results after deployment of different orders of adaptive mechanisms during the run of the predictive method, and suggesting techniques on how to select the most appropriate adaptive mechanisms. The literature review suggests that during the prediction process, adaptive mechanisms are selected to be deployed in a certain order which is usually fixed beforehand at the design time of the algorithm. For this reason, it was investigated whether changing the selection method for the adaptive mechanisms significantly affects predictive accuracy and whether there are certain deployment orders which provide better results than others. Commonly used adaptive mechanism selection methods are then examined and new methods are proposed. A novel regression ensemble method which uses several common adaptive mechanisms has been developed to be used as a vehicle for the experimentation. The predictive accuracy and behaviour of adaptive mechanisms while predicting on different real world datasets from the process industry were analysed. Empirical results suggest that different selection of adaptive mechanisms result in significantly different performance. It has been found that while some adaptive mechanisms adapt the predictive model better than others, there is none which is the best at all times. Finally, flexible orders of adaptive mechanisms generated using the proposed selection techniques often result in significantly more accurate models than fixed orders commonly used in literature
Neuroengineering of Clustering Algorithms
Cluster analysis can be broadly divided into multivariate data visualization, clustering algorithms, and cluster validation. This dissertation contributes neural network-based techniques to perform all three unsupervised learning tasks. Particularly, the first paper provides a comprehensive review on adaptive resonance theory (ART) models for engineering applications and provides context for the four subsequent papers. These papers are devoted to enhancements of ART-based clustering algorithms from (a) a practical perspective by exploiting the visual assessment of cluster tendency (VAT) sorting algorithm as a preprocessor for ART offline training, thus mitigating ordering effects; and (b) an engineering perspective by designing a family of multi-criteria ART models: dual vigilance fuzzy ART and distributed dual vigilance fuzzy ART (both of which are capable of detecting complex cluster structures), merge ART (aggregates partitions and lessens ordering effects in online learning), and cluster validity index vigilance in fuzzy ART (features a robust vigilance parameter selection and alleviates ordering effects in offline learning). The sixth paper consists of enhancements to data visualization using self-organizing maps (SOMs) by depicting in the reduced dimension and topology-preserving SOM grid information-theoretic similarity measures between neighboring neurons. This visualization\u27s parameters are estimated using samples selected via a single-linkage procedure, thereby generating heatmaps that portray more homogeneous within-cluster similarities and crisper between-cluster boundaries. The seventh paper presents incremental cluster validity indices (iCVIs) realized by (a) incorporating existing formulations of online computations for clusters\u27 descriptors, or (b) modifying an existing ART-based model and incrementally updating local density counts between prototypes. Moreover, this last paper provides the first comprehensive comparison of iCVIs in the computational intelligence literature --Abstract, page iv
Machine Learning
Machine Learning can be defined in various ways related to a scientific domain concerned with the design and development of theoretical and implementation tools that allow building systems with some Human Like intelligent behavior. Machine learning addresses more specifically the ability to improve automatically through experience
Combined optimization algorithms applied to pattern classification
Accurate classification by minimizing the error on test samples is the main
goal in pattern classification. Combinatorial optimization is a well-known
method for solving minimization problems, however, only a few examples of
classifiers axe described in the literature where combinatorial optimization is
used in pattern classification. Recently, there has been a growing interest
in combining classifiers and improving the consensus of results for a greater
accuracy. In the light of the "No Ree Lunch Theorems", we analyse the combination
of simulated annealing, a powerful combinatorial optimization method
that produces high quality results, with the classical perceptron algorithm.
This combination is called LSA machine. Our analysis aims at finding paradigms
for problem-dependent parameter settings that ensure high classifica,
tion results. Our computational experiments on a large number of benchmark
problems lead to results that either outperform or axe at least competitive to
results published in the literature. Apart from paxameter settings, our analysis
focuses on a difficult problem in computation theory, namely the network
complexity problem. The depth vs size problem of neural networks is one of
the hardest problems in theoretical computing, with very little progress over
the past decades. In order to investigate this problem, we introduce a new
recursive learning method for training hidden layers in constant depth circuits.
Our findings make contributions to a) the field of Machine Learning, as the
proposed method is applicable in training feedforward neural networks, and to
b) the field of circuit complexity by proposing an upper bound for the number
of hidden units sufficient to achieve a high classification rate. One of the major
findings of our research is that the size of the network can be bounded by
the input size of the problem and an approximate upper bound of 8 + √2n/n
threshold gates as being sufficient for a small error rate, where n := log/SL
and SL is the training set
Evolutionary multi-objective optimization in uncertain environments
Ph.DDOCTOR OF PHILOSOPH
Time series forecasting methodologies for electricity supply systems
Forecasting is an essential function in the electricity supply industry. Electricity demand forecasting is performed on number of different time-scales depending on the function for which they are required. In the short term (hourly) forecasts of electricity demand are required for the safe and efficient operation of the power system. Medium term forecasts (weekly) are needed for economic planning and long term (yearly) forecasts are required for deciding on system generation and transmission expansion plans. In recent years the electricity supply industry in some countries has undergone significant changes mainly due to a levelling off in the growth of electricity demand and also due to technological advances. There has been a move toward the existence of a number of smaller generating companies and the emergence of a competitors market has resulted. These changes in the structure of the industry have led to new requirements in the area of forecasting, where forecasts are now required on a small time-scale over a longer forecasting horizon, for example, the production of hourly forecasts over a period of a month.
The thesis presents a novel approach to the solution of the production of short term forecasts over a relatively long term forecast horizon. The mathematical formulation of the technique is presented and an application procedure is developed. Two applications of the technique are given and the issues involved in the implementation investigated. In addition, the production of weekly electricity demand forecasts using the optimal form of the available weather variables is investigated. The value of using such a variable in cases where it is not a dominant influencing factor in the system is assessed. The application of neural networks to the problem of weekly electricity demand forecasting is examined. Neural networks are also applied to the problem of the production of both aggregate and disaggregate electricity sales forecasts for up to five years in advance. Conclusions regarding the methodologies presented in the thesis are drawn and directions for future works are considered
Study of Adaptation Methods Towards Advanced Brain-computer Interfaces
Ph.DDOCTOR OF PHILOSOPH