2 research outputs found
Revisiting Large Scale Distributed Machine Learning
Nowadays, with the widespread of smartphones and other portable gadgets
equipped with a variety of sensors, data is ubiquitous available and the focus
of machine learning has shifted from being able to infer from small training
samples to dealing with large scale high-dimensional data. In domains such as
personal healthcare applications, which motivates this survey, distributed
machine learning is a promising line of research, both for scaling up learning
algorithms, but mostly for dealing with data which is inherently produced at
different locations. This report offers a thorough overview of and
state-of-the-art algorithms for distributed machine learning, for both
supervised and unsupervised learning, ranging from simple linear logistic
regression to graphical models and clustering. We propose future directions for
most categories, specific to the potential personal healthcare applications.
With this in mind, the report focuses on how security and low communication
overhead can be assured in the specific case of a strictly client-server
architectural model. As particular directions we provides an exhaustive
presentation of an empirical clustering algorithm, k-windows, and proposed an
asynchronous distributed machine learning algorithm that would scale well and
also would be computationally cheap and easy to implement
Sparse Representation Based Augmented Multinomial Logistic Extreme Learning Machine with Weighted Composite Features for Spectral Spatial Hyperspectral Image Classification
Although extreme learning machine (ELM) has been successfully applied to a
number of pattern recognition problems, it fails to pro-vide sufficient good
results in hyperspectral image (HSI) classification due to two main drawbacks.
The first is due to the random weights and bias of ELM, which may lead to
ill-posed problems. The second is the lack of spatial information for
classification. To tackle these two problems, in this paper, we propose a new
framework for ELM based spectral-spatial classification of HSI, where
probabilistic modelling with sparse representation and weighted composite
features (WCF) are employed respectively to derive the op-timized output
weights and extract spatial features. First, the ELM is represented as a
concave logarithmic likelihood function under statistical modelling using the
maximum a posteriori (MAP). Second, the sparse representation is applied to the
Laplacian prior to effi-ciently determine a logarithmic posterior with a unique
maximum in order to solve the ill-posed problem of ELM. The variable splitting
and the augmented Lagrangian are subsequently used to further reduce the
computation complexity of the proposed algorithm and it has been proven a more
efficient method for speed improvement. Third, the spatial information is
extracted using the weighted compo-site features (WCFs) to construct the
spectral-spatial classification framework. In addition, the lower bound of the
proposed method is derived by a rigorous mathematical proof. Experimental
results on two publicly available HSI data sets demonstrate that the proposed
methodology outperforms ELM and a number of state-of-the-art approaches.Comment: 16 pages, 6 figuers and 4 table