5 research outputs found
Parallelization of Partitioning Around Medoids (PAM) in K-Medoids Clustering on GPU
K-medoids clustering is categorized as partitional clustering. K-medoids offers better result when dealing with outliers and arbitrary distance metric also in the situation when the mean or median does not exist within data. However, k-medoids suffers a high computational complexity. Partitioning Around Medoids (PAM) has been developed to improve k-medoids clustering, consists of build and swap steps and uses the entire dataset to find the best potential medoids. Thus, PAM produces better medoids than other algorithms. This research proposes the parallelization of PAM in k-medoids clustering on GPU to reduce computational time at the swap step of PAM. The parallelization scheme utilizes shared memory, reduction algorithm, and optimization of the thread block configuration to maximize the occupancy. Based on the experiment result, the proposed parallelized PAM k-medoids is faster than CPU and Matlab implementation and efficient for large dataset
Piece‐wise constant cluster modelling of dynamics of upwelling patterns
A comprehensive approach is presented to analyse season's coastal upwelling represented by weekly sea surface temperature (SST) image grids. Our three-stage data recovery clustering method assumes that the season's upwelling can be divided into shorter periods of stability, ranges, each to be represented by a constant core and variable shell parts. Corresponding clustering algorithms parameters are automatically derived by using the least-squares clustering criterion. The approach has been successfully applied to real-world SST data covering two distinct regions: Portuguese coast and Morocco coast, for 16 years each.LA/P/0101/2020info:eu-repo/semantics/publishedVersio
Parallelization of Partitioning Around Medoids (PAM) in K-Medoids Clustering on GPU
K-medoids clustering is categorized as partitional clustering. K-medoids
offers better result when dealing with outliers and arbitrary distance metric
also in the situation when the mean or median does not exist within data.
However, k-medoids suffers a high computational complexity. Partitioning
Around Medoids (PAM) has been developed to improve k-medoids
clustering, consists of build and swap steps and uses the entire dataset to find
the best potential medoids. Thus, PAM produces better medoids than other
algorithms. This research proposes the parallelization of PAM in k-medoids
clustering on GPU to reduce computational time at the swap step of PAM.
The parallelization scheme utilizes shared memory, reduction algorithm, and
optimization of the thread block configuration to maximize the occupancy.
Based on the experiment result, the proposed parallelized PAM k-medoids is
faster than CPU and Matlab implementation and efficient for large dataset