Addressing Name Node Scalability Issue in  Hadoop Distributed File System using Cache Approach

Chetan Agrawal; Pooja Yedale; Devesh Maru; Pranav Gadekar

Published Sep 11, 2014

Download

PDF

Statistic

Downloads

Download data is not yet available.

Volume 3, Issue 9, September 2014

Chetan Agrawal

Pooja Yedale

Devesh Maru

Pranav Gadekar

Abstract

Hadoop is a distributed batch processing infrastructure which is currently being used for big data management. At the foundation of Hadoop lies Hadoop Distributed File System (HDFS). HDFS presents a client-server architecture comprised of a NameNode and many DataNodes. The NameNode stores the metadata for the DataNodes and DataNode stores application data. The NameNode holds file system metadata in memory, and thus the limit to the number of files in a file system is governed by the amount of memory on the NameNode. Thus when the memory on NameNode is full there is no further chance of increasing the cluster capacity. In this paper we have used the concept of cache memory for handling the issue of NameNode scalability.

About Journal

Open Access Policy

Addressing Name Node Scalability Issue in Hadoop Distributed File System using Cache Approach

Downloads

Abstract

About Journal

Open Access Policy

##plugins.themes.academic_pro.article.sidebar##

Downloads

##plugins.themes.academic_pro.article.main##

Abstract

##plugins.themes.academic_pro.article.details##