Search CORE

8,268 research outputs found

A New Algorithm for Join Processing with the Internet Transfer Delays

Author: Dandamudi S.
Imasaki K.
Publication venue: John von Neumann Institute for Computing
Publication date: 01/01/2006
Field of study

Frequent Itemsets Mining for Big Data: A Comparative Analysis

Author: APILETTI DANIELE
BARALIS ELENA MARIA
CERQUITELLI TANIA
GARZA PAOLO
PULVIRENTI FABIO
VENTURINI LUCA
Publication venue: 'Elsevier BV'
Publication date: 01/09/2017
Field of study

Itemset mining is a well-known exploratory data mining technique used to discover interesting correlations hidden in a data collection. Since it supports different targeted analyses, it is profitably exploited in a wide range of different domains, ranging from network traffic data to medical records. With the increasing amount of generated data, different scalable algorithms have been developed, exploiting the advantages of distributed computing frameworks, such as Apache Hadoop and Spark. This paper reviews Hadoop- and Spark-based scalable algorithms addressing the frequent itemset mining problem in the Big Data domain through both theoretical and experimental comparative analyses. Since the itemset mining task is computationally expensive, its distribution and parallelization strategies heavily affect memory usage, load balancing, and communication costs. A detailed discussion of the algorithmic choices of the distributed methods for frequent itemset mining is followed by an experimental analysis comparing the performance of state-of-the-art distributed implementations on both synthetic and real datasets. The strengths and weaknesses of the algorithms are thoroughly discussed with respect to the dataset features (e.g., data distribution, average transaction length, number of records), and specific parameter settings. Finally, based on theoretical and experimental analyses, open research directions for the parallelization of the itemset mining problem are presented

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Energy-aware data processing techniques for wireless sensor networks: a review

Author: Chong S.
Gaber M.
Krishnaswamy S.
Loke S.
Publication venue
Publication date: 01/06/2011
Field of study

Portsmouth University Research Portal (Pure)