Search CORE

6 research outputs found

A parallel formulation of the spatial autoregression model for mining large geo-spatial datasets

Author: Baris M. Kazar
Daniel Boley
David J. Lilja
Shashi Shekhar
Publication venue
Publication date
Field of study

The spatial auto-regression model (SAM) is a popular spatial data mining technique which has been used in many applications with geo-spatial datasets. However, serial procedures for estimating SAM parameters are computationally expensive due to the need to compute all the eigenvalues of a very large matrix. We propose a parallel formulation of the SAM parameter estimation procedure in this paper using data parallelism and hybrid programming technique. Experimental results on an IBM Regatta show that the proposed parallel formulation achieves a speedup of up to 7 on 8 processors. We are developing algebraic cost models to analyze the experimental results to further improve the speedups

CiteSeerX

Comparing exact and approximate spatial auto-regression model solutions for spatial data analysis

Author: Baris M. Kazar
David J. Lilja
R. Kelley Pace
Ranga R. Vatsavai
Shashi Shekhar
Publication venue
Publication date: 01/01/2004
Field of study

Abstract. The spatial auto-regression (SAR) model is a popular spatial data analysis technique, which has been used in many applications with geo-spatial datasets. However, exact solutions for estimating SAR parameters are computationally expensive due to the need to compute all the eigenvalues of a very large matrix. Recently we developed a dense-exact parallel formulation of the SAR parameter estimation procedure using data parallelism and a hybrid programming technique. Though this parallel implementation showed scalability up to eight processors, the exact solution still suffers from high computational complexity and memory requirements. These limitations have led us to investigate approximate solutions for SAR model parameter estimation with the main objective of scaling the SAR model for large spatial data analysis problems. In this paper we present two candidate approximate-semi-sparse solutions of the SAR model based on Taylor series expansion and Chebyshev polynomials. Our initial experiments showed that these new techniques scale well for very large data sets, such as remote sensing images having millions of pixels. The results also show that the differences between exact and approximate SAR parameter estimates are within 0.7 % and 8.2 % for Chebyshev polynomials and Taylor series expansion, respectively, and have no significant effect on the prediction accuracy.

CiteSeerX