UCSY's Research Repository

Performance Analysis of a Scalable Naïve Bayes Classifier on MapReduce and Beyond MapReduce

Show simple item record

dc.contributor.author Oo, Myat Cho Mon
dc.contributor.author Thein, Thandar
dc.date.accessioned 2019-07-03T06:49:26Z
dc.date.available 2019-07-03T06:49:26Z
dc.date.issued 2018-02-22
dc.identifier.uri http://onlineresource.ucsy.edu.mm/handle/123456789/247
dc.description.abstract Many real world areas from different sources generate the big data with large volume of high velocity, complex and variable data. Big data becomes a challenge when they are difficult to process and extract knowledge using traditional analysis tools. Therefore the scalable machine learning algorithms are needed for processing such big data. Recently Hadoop MapReduce framework has been adapted for parallel computing. MapReduce may not fit for most of the real world data applications. For large scale machine learning on distributed system, Spark has finally become much more viable beyond MapReduce. Although both of these frameworks are Apache-hosted data analytic framework, their performance varies significantly based on the use case under their implementation. This paper aims to analyze the performance of scalable Naïve Bayes classifier (SNB) which is implemented on MapReduce and Beyond MapReduce over different real world datasets. The comparison results show that SNB on Beyond MapReduce provides minimal processing time than SNB on MapReduce for efficiently big data classification. en_US
dc.language.iso en en_US
dc.publisher Sixteenth International Conferences on Computer Applications(ICCA 2018) en_US
dc.subject Bid Data en_US
dc.subject Beyond MapReduce en_US
dc.subject MapReduce en_US
dc.subject scalable Naive Bayes en_US
dc.subject Spark en_US
dc.title Performance Analysis of a Scalable Naïve Bayes Classifier on MapReduce and Beyond MapReduce en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository



Browse

My Account

Statistics