21 research outputs found
Cascade of classifier ensembles for reliable medical image classification
Medical image analysis and recognition is one of the most important tools in modern medicine. Different types of imaging technologies such as X-ray, ultrasonography, biopsy, computed tomography and optical coherence tomography have been widely used in clinical diagnosis for various kinds of diseases. However, in clinical applications, it is usually time consuming to examine an image manually. Moreover, there is always a subjective element related to the pathological examination of an image. This produces the potential risk of a doctor to make a wrong decision. Therefore, an automated technique will provide valuable assistance for physicians. By utilizing techniques from machine learning and image analysis, this thesis aims to construct reliable diagnostic models for medical image data so as to reduce the problems faced by medical experts in image examination. Through supervised learning of the image data, the diagnostic model can be constructed automatically. The process of image examination by human experts is very difficult to simulate, as the knowledge of medical experts is often fuzzy and not easy to be quantified. Therefore, the problem of automatic diagnosis based on images is usually converted to the problem of image classification. For the image classification tasks, using a single classifier is often hard to capture all aspects of image data distributions. Therefore, in this thesis, a classifier ensemble based on random subspace method is proposed to classify microscopic images. The multi-layer perceptrons are used as the base classifiers in the ensemble. Three types of feature extraction methods are selected for microscopic image description. The proposed method was evaluated on two microscopic image sets and showed promising results compared with the state-of-art results. In order to address the classification reliability in biomedical image classification problems, a novel cascade classification system is designed. Two random subspace based classifier ensembles are serially connected in the proposed system. In the first stage of the cascade system, an ensemble of support vector machines are used as the base classifiers. The second stage consists of a neural network classifier ensemble. Using the reject option, the images whose classification results cannot achieve the predefined rejection threshold at the current stage will be passed to the next stage for further consideration. The proposed cascade system was evaluated on a breast cancer biopsy image set and two UCI machine learning datasets, the experimental results showed that the proposed method can achieve high classification reliability and accuracy with small rejection rate. Many computer aided diagnosis systems face the problem of imbalance data. The datasets used for diagnosis are often imbalanced as the number of normal cases is usually larger than the number of the disease cases. Classifiers that generalize over the data are not the most appropriate choice in such an imbalanced situation. To tackle this problem, a novel one-class classifier ensemble is proposed. The Kernel Principle Components are selected as the base classifiers in the ensemble; the base classifiers are trained by different types of image features respectively and then combined using a product combining rule. The proposed one-class classifier ensemble is also embedded into the cascade scheme to improve classification reliability and accuracy. The proposed method was evaluated on two medical image sets. Favorable results were obtained comparing with the state-of-art results
Pattern Recognition
Pattern recognition is a very wide research field. It involves factors as diverse as sensors, feature extraction, pattern classification, decision fusion, applications and others. The signals processed are commonly one, two or three dimensional, the processing is done in real- time or takes hours and days, some systems look for one narrow object class, others search huge databases for entries with at least a small amount of similarity. No single person can claim expertise across the whole field, which develops rapidly, updates its paradigms and comprehends several philosophical approaches. This book reflects this diversity by presenting a selection of recent developments within the area of pattern recognition and related fields. It covers theoretical advances in classification and feature extraction as well as application-oriented works. Authors of these 25 works present and advocate recent achievements of their research related to the field of pattern recognition
Pattern Recognition
A wealth of advanced pattern recognition algorithms are emerging from the interdiscipline between technologies of effective visual features and the human-brain cognition process. Effective visual features are made possible through the rapid developments in appropriate sensor equipments, novel filter designs, and viable information processing architectures. While the understanding of human-brain cognition process broadens the way in which the computer can perform pattern recognition tasks. The present book is intended to collect representative researches around the globe focusing on low-level vision, filter design, features and image descriptors, data mining and analysis, and biologically inspired algorithms. The 27 chapters coved in this book disclose recent advances and new ideas in promoting the techniques, technology and applications of pattern recognition
Influence modelling and learning between dynamic bayesian networks using score-based structure learning
A Ph.D. thesis submitted to the Faculty of Science, University of the Witwatersrand,
in fulfillment of the requirements for the degree of Doctor of Philosophy in Computer
Science
May 2018Although partially observable stochastic processes are ubiquitous in many fields of science,
little work has been devoted to discovering and analysing the means by which several such
processes may interact to influence each other. In this thesis we extend probabilistic structure
learning between random variables to the context of temporal models which represent
partially observable stochastic processes. Learning an influence structure and distribution
between processes can be useful for density estimation and knowledge discovery.
A common approach to structure learning, in observable data, is score-based structure
learning, where we search for the most suitable structure by using a scoring metric to value
structural configurations relative to the data. Most popular structure scores are variations on
the likelihood score which calculates the probability of the data given a potential structure.
In observable data, the decomposability of the likelihood score, which is the ability to
represent the score as a sum of family scores, allows for efficient learning procedures and
significant computational saving. However, in incomplete data (either by latent variables or
missing samples), the likelihood score is not decomposable and we have to perform
inference to evaluate it. This forces us to use non-linear optimisation techniques to optimise
the likelihood function. Furthermore, local changes to the network can affect other parts of
the network, which makes learning with incomplete data all the more difficult.
We define two general types of influence scenarios: direct influence and delayed influence
which can be used to define influence around richly structured spaces; consisting of
multiple processes that are interrelated in various ways. We will see that although it is
possible to capture both types of influence in a single complex model by using a setting of
the parameters, complex representations run into fragmentation issues. This is handled by
extending the language of dynamic Bayesian networks to allow us to construct single
compact models that capture the properties of a system’s dynamics, and produce influence
distributions dynamically.
The novelty and intuition of our approach is to learn the optimal influence structure in
layers. We firstly learn a set of independent temporal models, and thereafter, optimise a
structure score over possible structural configurations between these temporal models. Since
the search for the optimal structure is done using complete data we can take advantage of
efficient learning procedures from the structure learning literature. We provide the
following contributions: we (a) introduce the notion of influence between temporal models;
(b) extend traditional structure scores for random variables to structure scores for temporal
models; (c) provide a complete algorithm to recover the influence structure between
temporal models; (d) provide a notion of structural assembles to relate temporal models for
types of influence; and finally, (e) provide empirical evidence for the effectiveness of our
method with respect to generative ground-truth distributions.
The presented results emphasise the trade-off between likelihood of an influence structure to
the ground-truth and the computational complexity to express it. Depending on the
availability of samples we might choose different learning methods to express influence
relations between processes. On one hand, when given too few samples, we may choose to
learn a sparse structure using tree-based structure learning or even using no influence
structure at all. On the other hand, when given an abundant number of samples, we can use
penalty-based procedures that achieve rich meaningful representations using local search
techniques.
Once we consider high-level representations of dynamic influence between temporal models,
we open the door to very rich and expressive representations which emphasise the
importance of knowledge discovery and density estimation in the temporal setting.MT 201
Free-text keystroke dynamics authentication with a reduced need for training and language independency
This research aims to overcome the drawback of the large amount of training data required
for free-text keystroke dynamics authentication. A new key-pairing method, which is based
on the keyboard’s key-layout, has been suggested to achieve that. The method extracts
several timing features from specific key-pairs. The level of similarity between a user’s
profile data and his or her test data is then used to decide whether the test data was provided
by the genuine user. The key-pairing technique was developed to use the smallest amount of
training data in the best way possible which reduces the requirement for typing long text in
the training stage. In addition, non-conventional features were also defined and extracted
from the input stream typed by the user in order to understand more of the users typing
behaviours. This helps the system to assemble a better idea about the user’s identity from the
smallest amount of training data. Non-conventional features compute the average of users
performing certain actions when typing a whole piece of text. Results were obtained from the
tests conducted on each of the key-pair timing features and the non-conventional features,
separately. An FAR of 0.013, 0.0104 and an FRR of 0.384, 0.25 were produced by the timing
features and non-conventional features, respectively. Moreover, the fusion of these two
feature sets was utilized to enhance the error rates. The feature-level fusion thrived to reduce
the error rates to an FAR of 0.00896 and an FRR of 0.215 whilst decision-level fusion
succeeded in achieving zero FAR and FRR. In addition, keystroke dynamics research suffers
from the fact that almost all text included in the studies is typed in English. Nevertheless, the
key-pairing method has the advantage of being language-independent. This allows for it to be
applied on text typed in other languages. In this research, the key-pairing method was applied
to text in Arabic. The results produced from the test conducted on Arabic text were similar to
those produced from English text. This proves the applicability of the key-pairing method on
a language other than English even if that language has a completely different alphabet and
characteristics. Moreover, experimenting with texts in English and Arabic produced results
showing a direct relation between the users’ familiarity with the language and the
performance of the authentication system
Epilepsy
With the vision of including authors from different parts of the world, different educational backgrounds, and offering open-access to their published work, InTech proudly presents the latest edited book in epilepsy research, Epilepsy: Histological, electroencephalographic, and psychological aspects. Here are twelve interesting and inspiring chapters dealing with basic molecular and cellular mechanisms underlying epileptic seizures, electroencephalographic findings, and neuropsychological, psychological, and psychiatric aspects of epileptic seizures, but non-epileptic as well