Critical Analysis of Solutions to Hadoop Small File Problem

Dr. Chandramouli H; Prof. Shwetha K S

Critical Analysis of Solutions to Hadoop Small File Problem

Authors: Dr. Chandramouli H
Prof. Shwetha K S
Publication date: 28 October 2023
Publisher: Global Journals Inc. (US)

Abstract

Hadoop big data platform is designed to process large volume of data Small file problem is a performance bottleneck in Hadoop processing Small files lower than the block size of Hadoop creates huge storage overhead at Namenode s and also wastes computational resources due to spawning of many map tasks Various solutions like merging small files mapping multiple map threads to same java virtual machine instance etc have been proposed to solve the small file problems in Hadoop This survey does a critical analysis of existing works addressing small file problems in Hadoop and its variant platforms like Spark The aim is to understand their effectiveness in reducing the storage computational overhead and identify the open issues for further researc

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Global Journal of Computer Science and Technology (GJCST)

oai:ojs2.computerresearch.org:...

Last time updated on 23/11/2023