Search CORE

42,623 research outputs found

Unsupervised Feature Selection with Adaptive Structure Learning

Author: Alelyani S.
He X.
Hou C.
Krzanowski W.
Li Z.
Liu J.
Liu X.
Nie F.
Nie F.
Qian M.
Takeuchi I.
Yang Y.
Zhao Z.
Publication venue
Publication date: 02/04/2015
Field of study

The problem of feature selection has raised considerable interests in the past decade. Traditional unsupervised methods select the features which can faithfully preserve the intrinsic structures of data, where the intrinsic structures are estimated using all the input features of data. However, the estimated intrinsic structures are unreliable/inaccurate when the redundant and noisy features are not removed. Therefore, we face a dilemma here: one need the true structures of data to identify the informative features, and one need the informative features to accurately estimate the true structures of data. To address this, we propose a unified learning framework which performs structure learning and feature selection simultaneously. The structures are adaptively learned from the results of feature selection, and the informative features are reselected to preserve the refined structures of data. By leveraging the interactions between these two essential tasks, we are able to capture accurate structures and select more informative features. Experimental results on many benchmark data sets demonstrate that the proposed method outperforms many state of the art unsupervised feature selection methods

arXiv.org e-Print Archive

CiteSeerX

Crossref

A Novel Deep Learning Framework to Identify Latent Neuroendophenotypes from Multimodal Brain Imaging Data

Author: Wu Yitao
Publication venue
Publication date: 13/06/2021
Field of study

The expertise required to ensure adequate treatment for patients with complex cases is significantly deficient, which leads to the high demand for subtyping or clustering analysis on different clinical situations. The identification and refinement of disease-related subtypes will support both medical treatments and pathological research. Clinically, clustering can narrow down the possible causes and provide effective treatment options. However, the clustering on non-invasive multimodal brain imaging data has not been well addressed. In this thesis, we explore this clustering issue using a deep unsupervised embedded clustering (DEMC) method on multimodal brain imaging data. T1-weighted magnetic resonance imaging (MRI) features and resting-state functional MRI-derived brain networks are learned by a sparse autoencoder and a stacked autoencoder separately and then transformed into the embedding space. Then, the K-Means approach was adopted to set the initial center of the deeply embedded clustering structure (DEC) as the centroids, after which DEC clusters with the KL divergence. In the entire processing, the deep embedding and clustering are optimized simultaneously. This new framework was tested on 994 subjects from Human Connectome Project (HCP) and the results show that this new framework has better clustering performance in comparison with other benchmark algorithms

D-Scholarship@Pitt

Large Scale Spectral Clustering Using Approximate Commute Time Embedding

Author: C. Fowlkes
D. Achlioptas
D. Mavroeidis
D.A. Spielman
F. Fouss
H. Qiu
I. Koutis
L. Wang
P.G. Doyle
U. von Luxburg
W.Y. Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Spectral clustering is a novel clustering method which can detect complex shapes of data clusters. However, it requires the eigen decomposition of the graph Laplacian matrix, which is proportion to

O(n^3)

and thus is not suitable for large scale systems. Recently, many methods have been proposed to accelerate the computational time of spectral clustering. These approximate methods usually involve sampling techniques by which a lot information of the original data may be lost. In this work, we propose a fast and accurate spectral clustering approach using an approximate commute time embedding, which is similar to the spectral embedding. The method does not require using any sampling technique and computing any eigenvector at all. Instead it uses random projection and a linear time solver to find the approximate embedding. The experiments in several synthetic and real datasets show that the proposed approach has better clustering quality and is faster than the state-of-the-art approximate spectral clustering methods

arXiv.org e-Print Archive

Crossref