Search CORE

71,992 research outputs found

Growing Regression Forests by Classification: Applications to Object Pose Estimation

Author: Chellappa Rama
Hara Kota
Publication venue
Publication date: 01/01/2014
Field of study

In this work, we propose a novel node splitting method for regression trees and incorporate it into the regression forest framework. Unlike traditional binary splitting, where the splitting rule is selected from a predefined set of binary splitting rules via trial-and-error, the proposed node splitting method first finds clusters of the training data which at least locally minimize the empirical loss without considering the input space. Then splitting rules which preserve the found clusters as much as possible are determined by casting the problem into a classification problem. Consequently, our new node splitting method enjoys more freedom in choosing the splitting rules, resulting in more efficient tree structures. In addition to the Euclidean target space, we present a variant which can naturally deal with a circular target space by the proper use of circular statistics. We apply the regression forest employing our node splitting to head pose estimation (Euclidean target space) and car direction estimation (circular target space) and demonstrate that the proposed method significantly outperforms state-of-the-art methods (38.5% and 22.5% error reduction respectively).Comment: Paper accepted by ECCV 201

arXiv.org e-Print Archive

Crossref

Robust EM algorithm for model-based curve clustering

Author: Chamroukhi Faicel
Publication venue
Publication date: 25/12/2013
Field of study

Model-based clustering approaches concern the paradigm of exploratory data analysis relying on the finite mixture model to automatically find a latent structure governing observed data. They are one of the most popular and successful approaches in cluster analysis. The mixture density estimation is generally performed by maximizing the observed-data log-likelihood by using the expectation-maximization (EM) algorithm. However, it is well-known that the EM algorithm initialization is crucial. In addition, the standard EM algorithm requires the number of clusters to be known a priori. Some solutions have been provided in [31, 12] for model-based clustering with Gaussian mixture models for multivariate data. In this paper we focus on model-based curve clustering approaches, when the data are curves rather than vectorial data, based on regression mixtures. We propose a new robust EM algorithm for clustering curves. We extend the model-based clustering approach presented in [31] for Gaussian mixture models, to the case of curve clustering by regression mixtures, including polynomial regression mixtures as well as spline or B-spline regressions mixtures. Our approach both handles the problem of initialization and the one of choosing the optimal number of clusters as the EM learning proceeds, rather than in a two-fold scheme. This is achieved by optimizing a penalized log-likelihood criterion. A simulation study confirms the potential benefit of the proposed algorithm in terms of robustness regarding initialization and funding the actual number of clusters.Comment: In Proceedings of the 2013 International Joint Conference on Neural Networks (IJCNN), 2013, Dallas, TX, US

arXiv.org e-Print Archive

Crossref

Disease Mapping via Negative Binomial Regression M-quantiles

Author: Chambers Ray
Dreassi Emanuela
Salvati Nicola
Publication venue: 'Wiley'
Publication date: 01/01/2013
Field of study

We introduce a semi-parametric approach to ecological regression for disease mapping, based on modelling the regression M-quantiles of a Negative Binomial variable. The proposed method is robust to outliers in the model covariates, including those due to measurement error, and can account for both spatial heterogeneity and spatial clustering. A simulation experiment based on the well-known Scottish lip cancer data set is used to compare the M-quantile modelling approach and a random effects modelling approach for disease mapping. This suggests that the M-quantile approach leads to predicted relative risks with smaller root mean square error than standard disease mapping methods. The paper concludes with an illustrative application of the M-quantile approach, mapping low birth weight incidence data for English Local Authority Districts for the years 2005-2010.Comment: 23 pages, 7 figure

arXiv.org e-Print Archive

CiteSeerX

Crossref

Research Online

Provable Sparse Tensor Decomposition

Author: Cheng Guang
Liu Han
Lu Junwei
Sun Will Wei
Publication venue
Publication date: 02/05/2016
Field of study

We propose a novel sparse tensor decomposition method, namely Tensor Truncated Power (TTP) method, that incorporates variable selection into the estimation of decomposition components. The sparsity is achieved via an efficient truncation step embedded in the tensor power iteration. Our method applies to a broad family of high dimensional latent variable models, including high dimensional Gaussian mixture and mixtures of sparse regressions. A thorough theoretical investigation is further conducted. In particular, we show that the final decomposition estimator is guaranteed to achieve a local statistical rate, and further strengthen it to the global statistical rate by introducing a proper initialization procedure. In high dimensional regimes, the obtained statistical rate significantly improves those shown in the existing non-sparse decomposition methods. The empirical advantages of TTP are confirmed in extensive simulated results and two real applications of click-through rate prediction and high-dimensional gene clustering.Comment: To Appear in JRSS-

arXiv.org e-Print Archive

Princeton University Open Access Repository

Crossref

University of Miami: Scholarship Miami