UCSY's Research Repository

Improving the performance of Hadoop MapReduce Applications via Optimization of Concurrent Containers Per Node

Show simple item record

dc.contributor.author Htay, Than Than
dc.contributor.author Phyu, Sabai
dc.date.accessioned 2021-01-31T11:26:29Z
dc.date.available 2021-01-31T11:26:29Z
dc.date.issued 2020-02-28
dc.identifier.uri https://onlineresource.ucsy.edu.mm/handle/123456789/2562
dc.description.abstract Apache Hadoop is a distributed platform for storing, processing and analyzing of big data on commodity machines. Hadoop has tunable parameters and they affect the performance of MapReduce applications significantly. In order to improve the performance, tuning the Hadoop configuration parameters is an effective approach. Performance optimization is usually based on memory utilization, disk I/O rate, CPU utilization and network traffic. In this paper, the effect of MapReduce performance is experimented and analyzed by varying the number of concurrent containers (cc) per machine on yarn-based pseudo-distributed mode. In this experiment, we also measure the impact of performance by using different suitable Hadoop Distributed File System (HDFS) block size. From our experiment, we found that tuning cc per node improve performance compared to default parameter setting. We also observed the further performance improvement via optimizing cc along with different HDFS block size. en_US
dc.language.iso en en_US
dc.publisher Proceedings of the Eighteenth International Conference On Computer Applications (ICCA 2020) en_US
dc.subject MapReduce en_US
dc.subject parameter tuning en_US
dc.subject concurrent containers en_US
dc.subject block size en_US
dc.title Improving the performance of Hadoop MapReduce Applications via Optimization of Concurrent Containers Per Node en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository



Browse

My Account

Statistics