A Parallel Space Saving Algorithm For Frequent Items and the Hurwitz
  zeta distribution

Cafaro, Massimo; Pulimeno, Marco; Tempesta, Piergiulio

slides

A Parallel Space Saving Algorithm For Frequent Items and the Hurwitz zeta distribution

Authors: Massimo Cafaro
Marco Pulimeno
Piergiulio Tempesta
Publication date: 11 September 2015
Publisher: 'Elsevier BV'
Doi

Abstract

We present a message-passing based parallel version of the Space Saving algorithm designed to solve the

k

--majority problem. The algorithm determines in parallel frequent items, i.e., those whose frequency is greater than a given threshold, and is therefore useful for iceberg queries and many other different contexts. We apply our algorithm to the detection of frequent items in both real and synthetic datasets whose probability distribution functions are a Hurwitz and a Zipf distribution respectively. Also, we compare its parallel performances and accuracy against a parallel algorithm recently proposed for merging summaries derived by the Space Saving or Frequent algorithms.Comment: Accepted for publication. To appear in Information Sciences, Elsevier. http://www.sciencedirect.com/science/article/pii/S002002551500657

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Archivio Istituzionale della Ricerca- Università del Salento

oai:iris.unisalento.it:11587/3...

Last time updated on 07/05/2019