Efficient Performance Optimization on Yarn-Based MapReduce Hadoop Framework

dc.contributor.author	Htay, Than Than
dc.contributor.author	Phyu, Sabai
dc.date.accessioned	2019-07-22T04:39:14Z
dc.date.available	2019-07-22T04:39:14Z
dc.date.issued	2019-02-27
dc.identifier.uri	http://onlineresource.ucsy.edu.mm/handle/123456789/1124
dc.description.abstract	Apache Hadoop exposes 180+ configuration parameters for all types of applications and clusters, 10-20% of which has a great impact on performance and efficiency of the execution. The optimal configuration settings for one application may not be suitable for another one leading to poor system resources utilization and long application completion time. Further, optimizing many parameters is a time consuming and a challenging job because configuration parameters and search space are huge, and users require good knowledge of Hadoop framework. The issue is that the user should adjust at least the important parameters, e.g. the number of map tasks that can run in parallel for a given application. This paper introduces the parameter optimization algorithm to the key application level parameter based on input data size and dynamic resource capabilities at any given time for a given application to improve execution time and resource utilization with nearly zero optimization overhead.	en_US
dc.language.iso	en	en_US
dc.publisher	Seventeenth International Conference on Computer Applications(ICCA 2019)	en_US
dc.title	Efficient Performance Optimization on Yarn-Based MapReduce Hadoop Framework	en_US
dc.type	Article	en_US