Search CORE

256 research outputs found

Machine Learning for the Diagnosis of Autism Spectrum Disorder

Author: Mostafa Sakib 1994-
Publication venue: 'University of Saskatchewan Library'
Publication date: 18/10/2019
Field of study

Autism Spectrum Disorder (ASD) is a neurological disorder. It refers to a wide range of behavioral and social abnormality and causes problems with social skills, repetitive behaviors, speech, and nonverbal communication. Even though there is no exact cure to ASD, an early diagnosis can help the patient take precautionary steps. Diagnosis of ASD has been of great interest recently, as researchers are yet to find a specific biomarker to detect the disease successfully. For the diagnosis of ASD, subjects need to go through behavioral observation and interview, which are not accurate sometimes. Also, there is a lack of dissimilarity between neuroimages of ASD subjects and healthy control (HC) subjects which make the use of neuroimages difficult for the diagnosis. So, machine learning-based approaches to diagnose ASD are becoming popular day by day. In the machine learning-based approach, features are extracted either from the functional MRI images or the structural MRI images to build the models. In this study at first, I created brain networks from the resting-state functional MRI (rs-fMRI) images, by using the 264 regions based parcellation scheme. These 264 regions capture the functional activity of the brain more accurately compared to regions defined in other parcellation schemes. Next, I extracted spectrum as a raw feature and combined it with other network based topological centralities: assortativity, clustering coefficient, the average degree. By applying a feature selection algorithm on the extracted features, I reduced the dimension of the features to cope up with overfitting. Then I used the selected features in support vector machine (SVM), K-nearest neighbor (KNN), linear discriminant analysis (LDA), and logistic regression (LR) for the diagnosis of ASD. Using the proposed method on Autism Brain Imaging Data Exchange (ABIDE) I achieved the classification accuracy of 78.4% for LDA, 77.0% for LR, 73.5% for SVM, and 73.8% for KNN. Next, I built a deep neural network model for the classification and feature selection using the autoencoder. In this approach, I used the previously defined features to build the DNN classifier. The DNN classifier is pre-trained using the autoencoder. Due to the pre-training, there has been a significant increase in the performance of the DNN classifier. I also proposed an autoencoder based feature selector. The latent space representation of the autoencoder is used to create a discriminate and compressed representation of the features. To make a more discriminate representation, the autoencoder is pre-trained with the DNN classifier. The classification accuracy of the DNN classifier and the autoencoder based feature selector is 79.2% and 74.6%, respectively. Finally, I studied the structural MRI images and proposed a convolutional autoencoder (CAE) based classification model. The T1-weighted MRI images without any pre-processing are used in this study. As the effect of age is very important when studying the structural images for the diagnosis of ASD, I used the ABIDE 1 dataset, which covers subjects with a wide range of ages. Using the proposed CAE based diagnosis method, I achieved a classification accuracy of 96.6%, which is better than any other study for the diagnosis of ASD using the ABIDE 1 dataset. The results of this thesis demonstrate that the spectrum of the brain networks is an essential feature for the diagnosis of ASD and rather than extracting features from the structural MRI image a more efficient way is to use the images directly into deep learning models. The proposed studies in this thesis can help to build an early diagnosis model for ASD

University of Saskatchewan Research Archive

Constructive Incremental Learning for Fault Diagnosis of Rolling Bearings with Ensemble Domain Adaptation

Author: Dai Wei
Liu Jiang
Publication venue
Publication date: 28/08/2023
Field of study

Given the prevalence of rolling bearing fault diagnosis as a practical issue across various working conditions, the limited availability of samples compounds the challenge. Additionally, the complexity of the external environment and the structure of rolling bearings often manifests faults characterized by randomness and fuzziness, hindering the effective extraction of fault characteristics and restricting the accuracy of fault diagnosis. To overcome these problems, this paper presents a novel approach termed constructive Incremental learning-based ensemble domain adaptation (CIL-EDA) approach. Specifically, it is implemented on stochastic configuration networks (SCN) to constructively improve its adaptive performance in multi-domains. Concretely, a cloud feature extraction method is employed in conjunction with wavelet packet decomposition (WPD) to capture the uncertainty of fault information from multiple resolution aspects. Subsequently, constructive Incremental learning-based domain adaptation (CIL-DA) is firstly developed to enhance the cross-domain learning capability of each hidden node through domain matching and construct a robust fault classifier by leveraging limited labeled data from both target and source domains. Finally, fault diagnosis results are obtained by a majority voting of CIL-EDA which integrates CIL-DA and parallel ensemble learning. Experimental results demonstrate that our CIL-DA outperforms several domain adaptation methods and CIL-EDA consistently outperforms state-of-art fault diagnosis methods in few-shot scenarios

arXiv.org e-Print Archive

High Definition Metrology based Process Control: Measurement System Analysis and Process Monitoring.

Author: Puchala Saumuy Suriano
Publication venue
Publication date: 01/01/2013
Field of study

Process control in high precision machining necessitates high-definition metrology (HDM) systems that provide fine resolution data needed to characterize surface shape. HDM data is critical for the evaluation of process surface variation, as it reveals local surface patterns that are undetectable using low definition metrology (LDM) systems. Monitoring of the part-to-part variation of these patterns identified by HDM enables the detection of abnormal surface variation and the degradation of process conditions. HDM systems present many opportunities for surface variation reduction. However, there are challenges to using HDM data for process control. Conventional HDM systems are inefficient and may take a long time to measure a part, such that sufficient samples cannot be obtained for process control purposes. In addition, conventional monitoring methods are difficult to implement due to the high density of data. A new study uncovered significant cross-correlations between part surface height and process variables in an automotive engine milling process. This dissertation aims to apply new insights gained from HDM to develop algorithms and methods for surface variation control, specifically: - Surface modeling through fusion of process variables and HDM data: An improved surface model is developed by incorporating process and multi-resolution data through spatial and cross-correlation to increase prediction accuracy and reduce the amount of HDM measurements necessary for process control. - Measurement system analysis for HDM using: A method to effectively estimate the gage capability for HDM systems is proposed. - Surface variation monitoring using HDM data: A sequential monitoring framework is developed to monitor surface variations as reflected by HDM data. Based on the surface data-process fusion model, a progressive monitoring algorithm under a Bayesian framework is developed to monitor surface variations when limited HDM measurements are available. - Multistage modeling and monitoring of HDM Data: A morphing-based approach is proposed to model process multistage interdependence. A new multistage monitoring procedure is developed based on the morphing model. The research presented in this dissertation will aid in transforming quality control practices from dimensional variation reduction to surface shape variation control. The proposed HDM data monitoring algorithms can be extended to other high precision manufacturing processes.PHDIndustrial & Operations EngineeringUniversity of Michigan, Horace H. Rackham School of Graduate Studieshttp://deepblue.lib.umich.edu/bitstream/2027.42/99874/1/ssuriano_1.pd

Data mining methodologies for supporting engineers during system identification

Author: Saitta Sandro
Publication venue: Lausanne, EPFL
Publication date: 07/02/2008
Field of study

Data alone are worth almost nothing. While data collection is increasing exponentially worldwide, a clear distinction between retrieving data and obtaining knowledge has to be made. Data are retrieved while measuring phenomena or gathering facts. Knowledge refers to data patterns and trends that are useful for decision making. Data interpretation creates a challenge that is particularly present in system identification, where thousands of models may explain a given set of measurements. Manually interpreting such data is not reliable. One solution is to use data mining. This thesis thus proposes an integration of techniques from data mining, a field of research where the aim is to find knowledge from data, into an existing multiple-model system identification methodology. It is shown that, within a framework for decision support, data mining techniques constitute a valuable tool for engineers performing system identification. For example, clustering techniques group similar models together in order to guide subsequent decisions since they might indicate possible states of a structure. A main issue concerns the number of clusters, which, usually, is unknown. For determining the correct number of clusters in data and estimating the quality of a clustering algorithm, a score function is proposed. The score function is a reliable index for estimating the number of clusters in a given data set, thus increasing understanding of results. Furthermore, useful information for engineers who perform system identification is achieved through the use of feature selection techniques. They allow selection of relevant parameters that explain candidate models. The core algorithm is a feature selection strategy based on global search. In addition to providing information about the candidate model space, data mining is found to be a valuable tool for supporting decisions related to subsequent sensor placement. When integrated into a methodology for iterative sensor placement, clustering is found to provide useful support through providing a rational basis for decisions related to subsequent sensor placement on existing structures. Greedy and global search strategies should be selected according to the context. Experiments show that whereas global search is more efficient for initial sensor placement, a greedy strategy is more suitable for iterative sensor placement

MACHINERY ANOMALY DETECTION UNDER INDETERMINATE OPERATING CONDITIONS

Author: Tian Jing
Publication venue
Publication date: 01/01/2018
Field of study

Anomaly detection is a critical task in system health monitoring. Current practice of anomaly detection in machinery systems is still unsatisfactory. One issue is with the use of features. Some features are insensitive to the change of health, and some are redundant with each other. These insensitive and redundant features in the data mislead the detection. Another issue is from the influence of operating conditions, where a change in operating conditions can be mistakenly detected as an anomalous state of the system. Operating conditions are usually changing, and they may not be readily identified. They contribute to false positive detection either from non-predictive features driven by operating conditions, or from influencing predictive features. This dissertation contributes to the reduction of false detection by developing methods to select predictive features and use them to span a space for anomaly detection under indeterminate operating conditions. Available feature selection methods fail to provide consistent results when some features are correlated. A method was developed in this dissertation to explore the correlation structure of features and group correlated features into the same clusters. A representative feature from each cluster is selected to form a non-correlated set of features, where an optimized subset of predictive features is selected. After feature selection, the influence of operating conditions through non-predictive variables are removed. To remove the influence on predictive features, a clustering-based anomaly detection method is developed. Observations are collected when the system is healthy, and these observations are grouped into clusters corresponding to the states of operating conditions with automatic estimation of clustering parameters. Anomalies are detected if the test data are not members of the clusters. Correct partitioning of clusters is an open challenge due to the lack of research on the clustering of the machinery health monitoring data. This dissertation uses unimodality of the data as a criterion for clustering validation, and a unimodality-based clustering method is developed. Methods of this dissertation were evaluated by simulated data, benchmark data, experimental study and field data. These methods provide consistent results and outperform representatives of available methods. Although the focus of this dissertation is on the application of machinery systems, the methods developed in this dissertation can be adapted for other application scenarios for anomaly detection, feature selection, and clustering