UCSY's Research Repository

Clustering Analogous Words in Myanmar Language using Word Embedding Model

Show simple item record

dc.contributor.author Mon, Aye Myat
dc.contributor.author Soe, Khin Mar
dc.date.accessioned 2019-07-23T03:41:18Z
dc.date.available 2019-07-23T03:41:18Z
dc.date.issued 2019-02-27
dc.identifier.uri http://onlineresource.ucsy.edu.mm/handle/123456789/1199
dc.description.abstract Word embedding represents the words in terms of vectors. It is influenced on different NLP research areas such as document classification, author identification, sentiment analysis, etc. One of the most popular embedding models is Word2Vec model. It provides efficient representations of words by using Continuous Bag of Words model (CBOW) and Skip Gram model. In English language, word embedding model can be applied for data preprocessing well but there is a very little amount of work done in Myanmar language. Text preprocessing is important part to build embedding model and it is a significantly effect on final results. This paper tries to extract the analogous words between Myanmar news articles focus on the bag of words (CBOW) model using different features vector sizes. By analyzing word embedding model are obtained the better results with a high dimensional vectors than a low dimensional vectors to cluster the words based on its relatedness. en_US
dc.language.iso en en_US
dc.publisher Seventeenth International Conference on Computer Applications(ICCA 2019) en_US
dc.subject Word2Vec en_US
dc.subject Continuous Bag of Words Model (CBOW) en_US
dc.subject Myanmar Language en_US
dc.subject Word Embedding en_US
dc.title Clustering Analogous Words in Myanmar Language using Word Embedding Model en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository



Browse

My Account

Statistics