4 research outputs found
Clustering-based Source-aware Assessment of True Robustness for Learning Models
We introduce a novel validation framework to measure the true robustness of
learning models for real-world applications by creating source-inclusive and
source-exclusive partitions in a dataset via clustering. We develop a
robustness metric derived from source-aware lower and upper bounds of model
accuracy even when data source labels are not readily available. We clearly
demonstrate that even on a well-explored dataset like MNIST, challenging
training scenarios can be constructed under the proposed assessment framework
for two separate yet equally important applications: i) more rigorous learning
model comparison and ii) dataset adequacy evaluation. In addition, our findings
not only promise a more complete identification of trade-offs between model
complexity, accuracy and robustness but can also help researchers optimize
their efforts in data collection by identifying the less robust and more
challenging class labels.Comment: Submitted to UAI 201
Multi-Model Network Intrusion Detection System Using Distributed Feature Extraction and Supervised Learning
Intrusion Detection Systems (IDSs) monitor network traffic and system activities to identify any unauthorized or malicious behaviors. These systems usually leverage the principles of data science and machine learning to detect any deviations from normalcy by learning from the data associated with normal and abnormal patterns. The IDSs continue to suffer from issues like distributed high-dimensional data, inadequate robustness, slow detection, and high false-positive rates (FPRs). We investigate these challenges, determine suitable strategies, and propose relevant solutions based on the appropriate mathematical and computational concepts.
To handle high-dimensional data in a distributed network, we optimize the feature space in a distributed manner using the PCA-based feature extraction method. The experimental results display that the classifiers built upon the features so extracted perform well by giving a similar level of accuracy as given by the ones that use the centrally extracted features. This method also significantly reduces the cumulative time needed for extraction. By utilizing the extracted features, we construct a distributed probabilistic classifier based on Naïve Bayes. Each node counts the local frequencies and passes those on to the central coordinator. The central coordinator accumulates the local frequencies and computes the global frequencies, which are used by the nodes to compute the required prior probabilities to perform classifications. Each node, being evenly trained, is capable of detecting intrusions individually to improve the overall robustness of the system.
We also propose a similarity measure-based classification (SMC) technique that works by computing the cosine similarities between the class-specific frequential weights of the values in an observed instance and the average frequency-based data centroid. An instance is classified into the class whose weights for the values in it share the highest level of similarity with the centroid. SMC contributes alongside Naïve Bayes in a multi-model classification approach, which we introduce to reduce the FPR and improve the detection accuracy. This approach utilizes the similarities associated with each class label determined by SMC and the probabilities associated with each class label determined by Naïve Bayes. The similarities and probabilities are aggregated, separately, to form new features that are used to train and validate a tertiary classifier. We demonstrate that such a multi-model approach can attain a higher level of accuracy compared with the single-model classification techniques.
The contributions made by this dissertation to enhance the scalability, robustness, and accuracy can help improve the efficacy of IDSs
Hierarchical age estimation using enhanced facial features.
Doctor of Philosopy in Computer Science, University of KwaZulu-Natal, Westville, 2018.Ageing is a stochastic, inevitable and uncontrollable process that constantly affect
shape, texture and general appearance of the human face. Humans can easily determine
ones’ gender, identity and ethnicity with highest accuracy as compared to
age. This makes development of automatic age estimation techniques that surpass
human performance an attractive yet challenging task. Automatic age estimation
requires extraction of robust and reliable age discriminative features. Local binary
patterns (LBP) sensitivity to noise makes it insufficiently reliable in capturing age
discriminative features. Although local ternary patterns (LTP) is insensitive to noise,
it uses a single static threshold for all images regardless of varied image conditions.
Local directional patterns (LDP) uses k directional responses to encode image gradient
and disregards not only central pixel in the local neighborhood but also 8 k
directional responses. Every pixel in an image carry subtle information. Discarding
8 k directional responses lead to lose of discriminative texture features. This
study proposes two variations of LDP operator for texture extraction. Significantorientation
response LDP (SOR-LDP) encodes image gradient by grouping eight
directional responses into four pairs. Each pair represents orientation of an edge
with respect to central reference pixel. Values in each pair are compared and the
bit corresponding to the maximum value in the pair is set to 1 while the other is
set to 0. The resultant binary code is converted to decimal and assigned to the central
pixel as its’ SOR-LDP code. Texture features are contained in the histogram of
SOR-LDP encoded image. Local ternary directional patterns (LTDP) first gets the
difference between neighboring pixels and central pixel in 3 3 image region. These
differential values are convolved with Kirsch edge detectors to obtain directional
responses. These responses are normalized and used as probability of an edge occurring
towards a respective direction. An adaptive threshold is applied to derive
LTDP code. The LTDP code is split into its positive and negative LTDP codes. Histograms
of negative and positive LTDP encoded images are concatenated to obtain
texture feature. Regardless of there being evidence of spatial frequency processing
in primary visual cortex, biologically inspired features (BIF) that model visual cortex
uses only scale and orientation selectivity in feature extraction. Furthermore,
these BIF are extracted using holistic (global) pooling across scale and orientations
leading to lose of substantive information. This study proposes multi-frequency BIF
(MF-BIF) where frequency selectivity is introduced in BIF modelling. Local statistical
BIF (LS-BIF) uses local pooling within scale, orientation and frequency in n n
region for BIF extraction. Using Leave-one-person-out (LOPO) validation protocol,
this study investigated performance of proposed feature extractors in age estimation
in a hierarchical way by performing age-group classification using Multi-layer
Perceptron (MLP) followed by within age-group exact age regression using support
vector regression (SVR). Mean absolute error (MAE) and cumulative score (CS) were
used to evaluate performance of proposed face descriptors. Experimental results on
FG-NET ageing dataset show that SOR-LDP, LTDP, MF-BIF and LS-BIF outperform
state-of-the-art feature descriptors in age estimation. Experimental results show that
performing gender discrimination before age-group and age estimation further improves
age estimation accuracies. Shape, appearance, wrinkle and texture features
are simultaneously extracted by visual system in primates for the brain to process
and understand an image or a scene. However, age estimation systems in the literature
use a single feature for age estimation. A single feature is not sufficient enough
to capture subtle age discriminative traits due to stochastic and personalized nature
of ageing. This study propose fusion of different facial features to enhance their
discriminative power. Experimental results show that fusing shape, texture, wrinkle
and appearance result into robust age discriminative features that achieve lower
MAE compared to single feature performance