Performance Analysis of a Scalable Naïve Bayes Classifier on MapReduce and Beyond MapReduce

Oo, Myat Cho Mon; Thein, Thandar

UCSYRR Home
/
Conferences
/
International Conference on Computer Applications (ICCA)
/
Sixteenth International Conference On Computer Applications (ICCA 2018)
/
View Item

dc.contributor.author	Oo, Myat Cho Mon
dc.contributor.author	Thein, Thandar
dc.date.accessioned	2019-07-03T06:49:26Z
dc.date.available	2019-07-03T06:49:26Z
dc.date.issued	2018-02-22
dc.identifier.uri	http://onlineresource.ucsy.edu.mm/handle/123456789/247
dc.description.abstract	Many real world areas from different sources generate the big data with large volume of high velocity, complex and variable data. Big data becomes a challenge when they are difficult to process and extract knowledge using traditional analysis tools. Therefore the scalable machine learning algorithms are needed for processing such big data. Recently Hadoop MapReduce framework has been adapted for parallel computing. MapReduce may not fit for most of the real world data applications. For large scale machine learning on distributed system, Spark has finally become much more viable beyond MapReduce. Although both of these frameworks are Apache-hosted data analytic framework, their performance varies significantly based on the use case under their implementation. This paper aims to analyze the performance of scalable Naïve Bayes classifier (SNB) which is implemented on MapReduce and Beyond MapReduce over different real world datasets. The comparison results show that SNB on Beyond MapReduce provides minimal processing time than SNB on MapReduce for efficiently big data classification.	en_US
dc.language.iso	en	en_US
dc.publisher	Sixteenth International Conferences on Computer Applications(ICCA 2018)	en_US
dc.subject	Bid Data	en_US
dc.subject	Beyond MapReduce	en_US
dc.subject	MapReduce	en_US
dc.subject	scalable Naive Bayes	en_US
dc.subject	Spark	en_US
dc.title	Performance Analysis of a Scalable Naïve Bayes Classifier on MapReduce and Beyond MapReduce	en_US
dc.type	Article	en_US