3 research outputs found

    Multi-View Robust Tensor-Based Subspace Clustering

    No full text
    In this era of technology advancement, huge amount of data is collected from different disciplines. This data needs to be stored, processed and analyzed to understand its nature. Networks or graphs arise to model real-world systems in the different fields. Early work in network theory adopted simple graphs to model systems where the system’s entities and interactions among them are modeled as nodes and static, single-type edges, respectively. However, this representation is considered limited when the system’s entities interact through different sources. Multi-view networks have recently attracted attention due to its ability to consider the different interactions between entities explicitly. An important tool to understand the structure of multi-view networks is community detection. Community detection or clustering reveals the significant communities in the network which provides dimensionality reduction and a better understanding of the network. In this paper, a new robust clustering algorithm is proposed to detect the community structure in multi-view networks. In particular, the proposed approach constructs a 3-mode tensor from the normalized adjacency matrices that represent the different views. The constructed tensor is decomposed into a self-representation and error components where the extracted self-representation tensor is used to detect the community structure of the multi-view network. Moreover, a common subspace is computed among all views where the contribution of each view to the common subspace is optimized. The proposed method is applied to several real-world data sets and the results show that the proposed method achieves the best performance compared to other state-of-the-art algorithms

    Multivariate and Dimensionality-Reduction-Based Machine Learning Techniques for Tumor Classification of RNA-Seq Data

    No full text
    Cancer, a genetic disease, is considered one of the leading causes of death globally and affects people of all ages. Ribonucleic acid sequencing (RNA-Seq) is a technique used to quantify the expression of genes of interest and can be used to classify cancer tumor types. This paper describes a machine learning technique to classify cancer tissue samples by tumor type, such as breast cancer, lung cancer, colon cancer, and others. More than 60,000 RNA-Seq features were analyzed using six different machine learning classification algorithms, both individually and as an ensemble. Numerous dimensionality reduction techniques addressed the challenges of working with enormous amounts of genetic data. In particular, we were able to reduce the number of features from over 60,000 to 660 in the random forest feature selection and to 68 factor features using factor analysis with an accuracy of 99% in classifying tumor types

    Discovering Community Structure in Multiplex Networks via a Co-Regularized Robust Tensor-Based Spectral Approach

    No full text
    Complex networks arise in various fields, such as biology, sociology and communication, to model interactions among entities. Entities in many real-world systems exhibit different types of interactions, which requires modeling these type of systems properly. Multiplex networks are used to model these systems, as they can reflect the nodes’ pair-wise interactions as multiple distinct types of links across layers. Community detection is a widely studied application in network analysis as it provides insights into the structure and organization of the network. Even though multiple algorithms have been developed in the community detection field, many of them have a limited performance in the presence of noise. In this article, we develop a novel algorithm that combines tensor low-rank representation, spectral clustering and distance regularization to improve the accuracy in discovering communities in multiplex networks. The low-rank representation leads to reducing the noise and errors existing in the network and the optimization of an accurate consensus set of eigenvectors that reveals the communities in the network. Moreover, the proposed approach balances the agreement between the eigenvectors of each layer, i.e., individual subspaces, and the consensus set of eigenvectors, i.e., common subspaces, by minimizing the projection distance between them. The common and individual subspaces are computed efficiently through Tucker decomposition and modified spectral clustering, respectively. Finally, multiple experiments are conducted on real and simulated networks to evaluate the proposed approach and compare it to state-of-the-art algorithms. The proposed approach shows its robustness and efficiency in discovering the communities in multiplex networks
    corecore