Search CORE

3 research outputs found

A distributed computing model for big data anonymization in the networks.

Author: Farough Ashkouti
Keyhan Khamforoosh
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2023
Field of study

Recently big data and its applications had sharp growth in various fields such as IoT, bioinformatics, eCommerce, and social media. The huge volume of data incurred enormous challenges to the architecture, infrastructure, and computing capacity of IT systems. Therefore, the compelling need of the scientific and industrial community is large-scale and robust computing systems. Since one of the characteristics of big data is value, data should be published for analysts to extract useful patterns from them. However, data publishing may lead to the disclosure of individuals' private information. Among the modern parallel computing platforms, Apache Spark is a fast and in-memory computing framework for large-scale data processing that provides high scalability by introducing the resilient distributed dataset (RDDs). In terms of performance, Due to in-memory computations, it is 100 times faster than Hadoop. Therefore, Apache Spark is one of the essential frameworks to implement distributed methods for privacy-preserving in big data publishing (PPBDP). This paper uses the RDD programming of Apache Spark to propose an efficient parallel implementation of a new computing model for big data anonymization. This computing model has three-phase of in-memory computations to address the runtime, scalability, and performance of large-scale data anonymization. The model supports partition-based data clustering algorithms to preserve the λ-diversity privacy model by using transformation and actions on RDDs. Therefore, the authors have investigated Spark-based implementation for preserving the λ-diversity privacy model by two designed City block and Pearson distance functions. The results of the paper provide a comprehensive guideline allowing the researchers to apply Apache Spark in their own researches

Directory of Open Access Journals

Cooperative multi-agent actor–critic control of traffic network flow based on edge computing

Author: Alibaba
Ashkouti
Bu
Chen
Chu
Elaziz
Foerster
Ge
Khan
Li
Lillicrap
Liu
Liu
Lowe
Mnih
Mukherjee
Nord
Rasheed
Rumelhart
Saleem
Shi
Silver
Sodhro
Sunehag
Talavera-Llames
Tan
Tan
Tang
Wu
Wu
Yang
Zhang
Zhang
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

DI-Mondrian: Distributed improved Mondrian for satisfaction of the L-diversity privacy model using Apache Spark

Author: Abdelhameed
Al-Zobbi
Ali
Amir Sheikhahmadi
Ayyub
Canbay
Clifton
de Montjoye
Farough Ashkouti
Fung
Fung
Han
Jain
Keyhan khamforoosh
LeFevre
Li
Mehmood
Meier
Nayahi
Nergiz
Ninghui
Puri
Salloum
Sweeney
Temuujin
Xiao
Xu
Xu
Yaseen
Yu
Zaharia
Zakerzadeh
Zhang
Zhang
Zhang
Zheng
Zigomitros
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref