UCSY's Research Repository

Merging Small Files Based on Agglomerative Hierarchical Clustering on HDFS for Cloud Storage

Show simple item record

dc.contributor.author Wai, Khin Su Su
dc.contributor.author Myint, Julia
dc.contributor.author Yee, Tin Tin
dc.date.accessioned 2019-07-03T06:38:10Z
dc.date.available 2019-07-03T06:38:10Z
dc.date.issued 2018-02-22
dc.identifier.uri http://onlineresource.ucsy.edu.mm/handle/123456789/234
dc.description.abstract Hadoop distributed file system (HDFS) was originally designed for large files. HDFS stores each small file as one separate block although the size of several small files is lesser than the size of block size. Therefore, a large number of blocks are created with massive small files. When the large number of small files is accessed, NameNode often becomes the bottleneck. The problem of storing and accessing large number of small files is named as small file problem. In order to solve this issue in HDFS, an approach of merging small files on HDFS is proposed. In this paper, small files are merged into a larger file based on the agglomerative hierarchical clustering mechanism to reduce NameNode memory consumption. This approach will provide small files for cloud storage. en_US
dc.language.iso en en_US
dc.publisher Sixteenth International Conferences on Computer Applications(ICCA 2018) en_US
dc.title Merging Small Files Based on Agglomerative Hierarchical Clustering on HDFS for Cloud Storage en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository



Browse

My Account

Statistics