194 research outputs found

    One-Class Classification: Taxonomy of Study and Review of Techniques

    Full text link
    One-class classification (OCC) algorithms aim to build classification models when the negative class is either absent, poorly sampled or not well defined. This unique situation constrains the learning of efficient classifiers by defining class boundary just with the knowledge of positive class. The OCC problem has been considered and applied under many research themes, such as outlier/novelty detection and concept learning. In this paper we present a unified view of the general problem of OCC by presenting a taxonomy of study for OCC problems, which is based on the availability of training data, algorithms used and the application domains applied. We further delve into each of the categories of the proposed taxonomy and present a comprehensive literature review of the OCC algorithms, techniques and methodologies with a focus on their significance, limitations and applications. We conclude our paper by discussing some open research problems in the field of OCC and present our vision for future research.Comment: 24 pages + 11 pages of references, 8 figure

    Novel Semi-Supervised Learning Models to Balance Data Inclusivity and Usability in Healthcare Applications

    Get PDF
    abstract: Semi-supervised learning (SSL) is sub-field of statistical machine learning that is useful for problems that involve having only a few labeled instances with predictor (X) and target (Y) information, and abundance of unlabeled instances that only have predictor (X) information. SSL harnesses the target information available in the limited labeled data, as well as the information in the abundant unlabeled data to build strong predictive models. However, not all the included information is useful. For example, some features may correspond to noise and including them will hurt the predictive model performance. Additionally, some instances may not be as relevant to model building and their inclusion will increase training time and potentially hurt the model performance. The objective of this research is to develop novel SSL models to balance data inclusivity and usability. My dissertation research focuses on applications of SSL in healthcare, driven by problems in brain cancer radiomics, migraine imaging, and Parkinson’s Disease telemonitoring. The first topic introduces an integration of machine learning (ML) and a mechanistic model (PI) to develop an SSL model applied to predicting cell density of glioblastoma brain cancer using multi-parametric medical images. The proposed ML-PI hybrid model integrates imaging information from unbiopsied regions of the brain as well as underlying biological knowledge from the mechanistic model to predict spatial tumor density in the brain. The second topic develops a multi-modality imaging-based diagnostic decision support system (MMI-DDS). MMI-DDS consists of modality-wise principal components analysis to incorporate imaging features at different aggregation levels (e.g., voxel-wise, connectivity-based, etc.), a constrained particle swarm optimization (cPSO) feature selection algorithm, and a clinical utility engine that utilizes inverse operators on chosen principal components for white-box classification models. The final topic develops a new SSL regression model with integrated feature and instance selection called s2SSL (with “s2” referring to selection in two different ways: feature and instance). s2SSL integrates cPSO feature selection and graph-based instance selection to simultaneously choose the optimal features and instances and build accurate models for continuous prediction. s2SSL was applied to smartphone-based telemonitoring of Parkinson’s Disease patients.Dissertation/ThesisDoctoral Dissertation Industrial Engineering 201

    Automatic Diagnosis of Schizophrenia and Attention Deficit Hyperactivity Disorder in rs-fMRI Modality using Convolutional Autoencoder Model and Interval Type-2 Fuzzy Regression

    Get PDF
    Nowadays, many people worldwide suffer from brain disorders, and their health is in danger. So far, numerous methods have been proposed for the diagnosis of Schizophrenia (SZ) and attention deficit hyperactivity disorder (ADHD), among which functional magnetic resonance imaging (fMRI) modalities are known as a popular method among physicians. This paper presents an SZ and ADHD intelligent detection method of resting-state fMRI (rs-fMRI) modality using a new deep learning method. The University of California Los Angeles dataset, which contains the rs-fMRI modalities of SZ and ADHD patients, has been used for experiments. The FMRIB software library toolbox first performed preprocessing on rs-fMRI data. Then, a convolutional Autoencoder model with the proposed number of layers is used to extract features from rs-fMRI data. In the classification step, a new fuzzy method called interval type-2 fuzzy regression (IT2FR) is introduced and then optimized by genetic algorithm, particle swarm optimization, and gray wolf optimization (GWO) techniques. Also, the results of IT2FR methods are compared with multilayer perceptron, k-nearest neighbors, support vector machine, random forest, and decision tree, and adaptive neuro-fuzzy inference system methods. The experiment results show that the IT2FR method with the GWO optimization algorithm has achieved satisfactory results compared to other classifier methods. Finally, the proposed classification technique was able to provide 72.71% accuracy

    Parameter identification in networks of dynamical systems

    Get PDF
    Mathematical models of real systems allow to simulate their behavior in conditions that are not easily or affordably reproducible in real life. Defining accurate models, however, is far from trivial and there is no one-size-fits-all solution. This thesis focuses on parameter identification in models of networks of dynamical systems, considering three case studies that fall under this umbrella: two of them are related to neural networks and one to power grids. The first case study is concerned with central pattern generators, i.e. small neural networks involved in animal locomotion. In this case, a design strategy for optimal tuning of biologically-plausible model parameters is developed, resulting in network models able to reproduce key characteristics of animal locomotion. The second case study is in the context of brain networks. In this case, a method to derive the weights of the connections between brain areas is proposed, utilizing both imaging data and nonlinear dynamics principles. The third and last case study deals with a method for the estimation of the inertia constant, a key parameter in determining the frequency stability in power grids. In this case, the method is customized to different challenging scenarios involving renewable energy sources, resulting in accurate estimations of this parameter

    Diagnosis of Autism Spectrum Disorder Based on Brain Network Clustering

    Get PDF
    Developments in magnetic resonance imaging (MRI) provide new non-invasive approach—functional MRI (fMRI)—to study functions of brain. With the help of fMRI, I can build functional brain networks (FBN) to model correlations of brain activities between cortical regions. Studies focused on brain diseases, including autism spectrum disorder (ASD), have been conducted based on analyzing alterations in FBNs of patients. New biomarkers are identified, and new theories and assumptions are proposed on pathology of brain diseases. Considering that traditional clinical ASD diagnosis instruments, which greatly rely on interviews and observations, can yield large variance, recent studies start to combine machine learning methods and FBN to perform auto-classification of ASD. Such studies have achieved relatively good accuracy. However, in most of these studies, features they use are extracted from the whole brain networks thus the dimension of the features can be high. High-dimensional features may yield overfitting issues and increase computational complexity. Therefore, I need a feature selection strategy that effectively reduces feature dimensions while keeping a good classification performance. In this study, I present a new feature selection strategy that extracting features from functional modules but not the whole brain networks. I will show that my strategy not only reduces feature dimensions, but also improve performances of auto-classifications of ASD. The whole study can be separated into 4 stages: building FBNs, identification of functional modules, statistical analysis of modular alterations and, finally, training classifiers with modular features for auto-classification of ASD. I firstly demonstrate the whole procedure to build FBNs from fMRI images. To identify functional module, I propose a new network clustering algorithm based on joint non-negative matrix factorization. Different from traditional brain network clustering algorithms that mostly perform on an average network of all subjects, I design my algorithm to factorize multiple brain networks simultaneously because the clustering results should be valid not only on the average network but also on each individual network. I show the modules I find are more valid in both views. Then I statistically analyze the alterations in functional modules between ASD and typically developed (TD) group to determine from which modules I extract features from. Several indices based on graph theory are calculated to measure modular properties. I find significant alterations in two modules. With features from these two modules, I train several widely-used classifiers and validate the classifiers on a real-world dataset. The performances of classifiers trained by modular features are better than those with whole-brain features, which demonstrates the effectiveness of my feature selection strategy
    • …
    corecore