4 research outputs found
Manifold Alignment Aware Ants:a Markovian process for manifold extraction
Dimensionality reduction and clustering are often used as preliminary steps for many complex machine learning tasks. The presence of noise and outliers can deteriorate the performance of such preprocessing and therefore impair the subsequent analysis tremendously. In manifold learning, several studies indicate solutions for removing background noise or noise close to the structure when the density is substantially higher than that exhibited by the noise. However, in many applications, including astronomical datasets, the density varies alongside manifolds that are buried in a noisy background. We propose a novel method to extract manifolds in the presence of noise based on the idea of Ant colony optimization. In contrast to the existing random walk solutions, our technique captures points which are locally aligned with major directions of the manifold. Moreover, we empirically show that the biologically inspired formulation of ant pheromone reinforces this behavior enabling it to recover multiple manifolds embedded in extremely noisy data clouds. The algorithm's performance is demonstrated in comparison to the state-of-the-art approaches, such as Markov Chain, LLPD, and Disperse, on several synthetic and real astronomical datasets stemming from an N-body simulation of a cosmological volum
Novel techniques of computational intelligence for analysis of astronomical structures
Gravitational forces cause the formation and evolution of a variety of cosmological structures. The detailed investigation and study of these structures is a crucial step towards our understanding of the universe. This thesis provides several solutions for the detection and classification of such structures. In the first part of the thesis, we focus on astronomical simulations, and we propose two algorithms to extract stellar structures. Although they follow different strategies (while the first one is a downsampling method, the second one keeps all samples), both techniques help to build more effective probabilistic models. In the second part, we consider observational data, and the goal is to overcome some of the common challenges in observational data such as noisy features and imbalanced classes. For instance, when not enough examples are present in the training set, two different strategies are used: a) nearest neighbor technique and b) outlier detection technique. In summary, both parts of the thesis show the effectiveness of automated algorithms in extracting valuable information from astronomical databases
Recommended from our members
User-centric anomaly detection in activities of daily living
The current system for providing care to older adults is not sustainable due to its excessive cost. It places an unbearable financial burden on the government and families and pressure on the workforce due to the demand for human carers. Studies have also shown that older adults prefer to be looked after in their homes rather than in a care facility. An automated system of monitoring can provide much-needed support at a lower cost and give peace of mind to relatives.
The focus of the research reported in this thesis is to investigate the concept of abnormality detection in activities of daily living. More precisely, this work is aimed at proposing a dynamic approach for anomaly detection capable of adapting to changes in human behaviour. Abnormalities in daily activities can be an early indication of health decline. Therefore, early detection can inform the families of the need for intervention. Anomalies are often detected by modelling the existing activity data representing the usual behavioural routine of an individual to serve as a baseline model. Subsequent activities deviating from the baseline are then classified as outliers or anomalies. However, existing approaches suffer from a high rate of false prediction due to the static nature and the inability of the approaches to adapt to the changing human behaviour.
The contributions of the research are reported in four main categories. First, a novel ensemble approach termed "Consensus Novelty Detection Ensemble" is proposed. The outlying activities are predicted by computing their normality score using the internal and external consensus vote and the estimated weights of the models in the ensemble. Activities with a score exceeding a threshold estimated using a statistical method based on data distribution are predicted as outliers and vice versa.
Secondly, a similarity measure approach for identifying the likely sources of the ADL anomalies is proposed. While the models can detect anomalous activities, they are unable to identify the source (cause) of the anomaly. Identifying the anomaly source allows for the development of an adaptive system. The approach is based on a pairwise distance measurement of the features extracted from the activity data. Two approaches for performing the similarity measures are presented, namely, One vs One Similarity Measure (OOSM) and One vs All Similarity Measure (OASM). Features of the data with a higher dissimilarity rate are predicted as the source.
To make the proposed model adaptive to the changes in human behaviour, a novel adaptive approach is proposed based on the concept of forgetting factors. This allows the model to forget (discard) outdated activity data and adapt to the current behavioural patterns by incorporating newly verified data. The data verification can be performed by incorporating human feedback into the system. Two forgetting factor approaches are proposed namely; Forgetting Factor based on Data Ageing (FFDD) and Forgetting Factor based on Data Dissimilarity (FFDA). The data ageing forgetting factor discard old behavioural routine based on the age of the activity data, while in the data dissimilarity approach, this is achieved by measuring the similarity of the activity data.
Lastly, the means of utilising an assistive robot as a communication intermediary is explored for incorporating human feedback into the learning process using hand gestures as a communication modality. Experimental data used for the gesture recognition model is collected using a wearable sensor and a 2D camera. The feasibility of utilising the robotic platform as an exercise coach to encourage physical activity and promote a healthy lifestyle is explored. To this end, an exercise training solution is developed for the robotic platform to coach, motivate and assess the older adults in the recommended physical activities