UCSY's Research Repository

Word Segmentation and New Word Identification of Myanmar Language

Show simple item record

dc.contributor.author Soe, Ei Phyu Phyu
dc.date.accessioned 2019-09-25T05:55:00Z
dc.date.available 2019-09-25T05:55:00Z
dc.date.issued 2012-02-28
dc.identifier.uri http://onlineresource.ucsy.edu.mm/handle/123456789/2262
dc.description.abstract Myanmar texts are different from English texts in that they have no spaces to mark the boundaries of words. So, Myanmar word segmentation is a difficult. The processing of Myanmar text is complicated by the fact that there are no word delimiters. To segment Myanmar word, systems typically use knowledge-based methods and large lexicons. This paper presents the ability of linear-chain conditional random fields (CRFs) to perform Myanmar word segmentation by providing lexicon. This paper also presents a probabilistic new word detection method by providing the access variety (AV) statistics and forwardbackward algorithm. This system constructs the lexicon to improve the new word identification. en_US
dc.language.iso en_US en_US
dc.publisher Tenth International Conference On Computer Applications (ICCA 2012) en_US
dc.subject Myanmar language en_US
dc.subject Lexicon en_US
dc.subject Word Segmentation en_US
dc.subject Conditional Random Fields en_US
dc.title Word Segmentation and New Word Identification of Myanmar Language en_US
dc.type Article en_US

Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


My Account