18 research outputs found
EDEN: Evolutionary Deep Networks for Efficient Machine Learning
Deep neural networks continue to show improved performance with increasing
depth, an encouraging trend that implies an explosion in the possible
permutations of network architectures and hyperparameters for which there is
little intuitive guidance. To address this increasing complexity, we propose
Evolutionary DEep Networks (EDEN), a computationally efficient
neuro-evolutionary algorithm which interfaces to any deep neural network
platform, such as TensorFlow. We show that EDEN evolves simple yet successful
architectures built from embedding, 1D and 2D convolutional, max pooling and
fully connected layers along with their hyperparameters. Evaluation of EDEN
across seven image and sentiment classification datasets shows that it reliably
finds good networks -- and in three cases achieves state-of-the-art results --
even on a single GPU, in just 6-24 hours. Our study provides a first attempt at
applying neuro-evolution to the creation of 1D convolutional networks for
sentiment analysis including the optimisation of the embedding layer.Comment: 7 pages, 3 figures, 3 tables and see video
https://vimeo.com/23451009
Data classification using genetic programming.
Master of Science in Computer Science.Genetic programming (GP), a field of artificial intelligence, is an evolutionary algorithm
which evolves a population of trees which represent programs. These programs
are used to solve problems. This dissertation investigates the use of genetic programming
for data classification. In machine learning, data classification is the process
of allocating a class label to an instance of data. A classifier is created in order to
perform these allocations. Several studies have investigated the use of GP to solve
data classification problems. These studies have shown that GP is able to create
classifiers with high classification accuracies. However, there are certain aspects
which have not previously been investigated.
Five areas were investigated in this dissertation. The first was an investigation
into how discretisation could be incorporated into a GP algorithm. An adaptive
discretisation algorithm was proposed, and outperformed certain existing methods.
The second was a comparison of GP representations for binary data classification.
The findings indicated that from the representations examined (arithmetic trees,
decision trees, and logical trees), the decision trees performed the best. The third
was to investigate the use of the encapsulation genetic operator and its effect on
data classification. The findings revealed that an improvement in both training and
test results was achieved when encapsulation was incorporated. The fourth was an
investigative analysis of several hybridisations of a GP algorithm with a genetic algorithm
in order to evolve a population of ensembles. Four methods were proposed and
these methods outperformed certain existing GP and ensemble methods. Finally,
the fifth area was to investigate an ensemble construction method for classification.
In this approach GP evolved a single ensemble. The proposed method resulted in
an improvement in training and test accuracy when compared to the standard GP
algorithm.
The methods proposed in this dissertation were tested on publicly available data
sets, and the results were statistically tested in order to determine the effectiveness
of the proposed approaches
Evolutionary deep learning
The primary objective of this thesis is to investigate whether evolutionary concepts can improve the performance, speed and convenience of algorithms in various active areas of machine learning research. Deep neural networks are exhibiting an explosion in the number of parameters that need to be trained, as well as the number of permutations of possible network architectures and hyper-parameters. There is little guidance on how to choose these and brute-force experimentation is prohibitively time consuming. We show that evolutionary algorithms can help tame this explosion of freedom, by developing an algorithm that robustly evolves near optimal deep neural network architectures and hyper-parameters across a wide range of image and sentiment classification problems. We further develop an algorithm that automatically determines whether a given data science problem is of classification or regression type, successfully choosing the correct problem type with more than 95% accuracy. Together these algorithms show that a great deal of the current "art" in the design of deep learning networks - and in the job of the data scientist - can be automated. Having discussed the general problem of optimising deep learning networks the thesis moves on to a specific application: the automated extraction of human sentiment from text and images of human faces. Our results reveal that our approach is able to outperform several public and/or commercial text sentiment analysis algorithms using an evolutionary algorithm that learned to encode and extend sentiment lexicons. A second analysis looked at using evolutionary algorithms to estimate text sentiment while simultaneously compressing text data. An extensive analysis of twelve sentiment datasets reveal that accurate compression is possible with 3.3% loss in classification accuracy even with 75% compression of text size, which is useful in environments where data volumes are a problem. Finally, the thesis presents improvements to automated sentiment analysis of human faces to identify emotion, an area where there has been a tremendous amount of progress using convolutional neural networks. We provide a comprehensive critique of past work, highlight recommendations and list some open, unanswered questions in facial expression recognition using convolutional neural networks. One serious challenge when implementing such networks for facial expression recognition is the large number of trainable parameters which results in long training times. We propose a novel method based on evolutionary algorithms, to reduce the number of trainable parameters whilst simultaneously retaining classification performance, and in some cases achieving superior performance. We are robustly able to reduce the number of parameters on average by 95% with no loss in classification accuracy. Overall our analyses show that evolutionary algorithms are a valuable addition to machine learning in the deep learning era: automating, compressing and/or improving results significantly, depending on the desired goal