UCSY's Research Repository

Replication Based on Data Locality for Hadoop Distributed File System

Show simple item record

dc.contributor.author Thu, May Phyo
dc.contributor.author Nwe, Khine Moe
dc.contributor.author Aye, Kyar Nyo
dc.date.accessioned 2019-07-16T07:10:51Z
dc.date.available 2019-07-16T07:10:51Z
dc.date.issued 2019-06-15
dc.identifier.isbn 978-981-14-1684-2
dc.identifier.uri https://onlineresource.ucsy.edu.mm/handle/123456789/918
dc.description.abstract Replication plays an important role for storage system to improve data availability, throughput and response time for user and control storage cost. Due to different nature of data access pattern, data popularity is important in replication because of the unstable and unpredictable nature of popular files. Also, replicas placement is important in consideration of system's performance. In data-parallel applications, data locality is a key issue and this consequence of this issue occurs the decrement of system’ performance. Therefore, this paper proposes a data locality-based replication for Hadoop Distributed File System (HDFS). In replica allocation, data popularity is considered for maintaining less replicas for unpopular data and also, disk bandwidth, CPU utilization and disk utilization are considered in the proposed replica placement algorithm in order to get better data locality and more effective storage utilization. Our proposed scheme will be effective for HDFS. en_US
dc.language.iso en en_US
dc.publisher 9th International Workshop on Computer Science and Engineering (WCSE 2019), Hong Kong en_US
dc.subject Replication en_US
dc.subject Data Locality en_US
dc.subject Data Popularity en_US
dc.title Replication Based on Data Locality for Hadoop Distributed File System en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository



Browse

My Account

Statistics