UCSY's Research Repository

Joint Word Segmentation and Stemming for Myanmar Language Based on Conditional Random Fields

Show simple item record

dc.contributor.author Oo, Yadanar
dc.contributor.author Soe, Khin Mar
dc.date.accessioned 2019-07-03T08:12:22Z
dc.date.available 2019-07-03T08:12:22Z
dc.date.issued 2018-02-22
dc.identifier.uri http://onlineresource.ucsy.edu.mm/handle/123456789/325
dc.description.abstract In this paper, we describe a joint work on word segmentation and stemming of Myanmar sentences with syllabled-based tagging under Conditional Random Fields(CRF) framework. A manuallysegmented corpus was developed to train the segmenter, and we implement it as a 7-tag syllablebased tagging and stemming with conditional random fields(CRF). And then, the trained CRF segmenter was compared to a baseline approached based on longest matching that used a dictionary extracted from manually segmented corpus. In our approach, we can achieve comparative performances compared to 4-tag syllable tagging approach. The experimental results show that the CRF with 7-tag set and word feature improve the stemming performance. en_US
dc.language.iso en en_US
dc.publisher Sixteenth International Conferences on Computer Applications(ICCA 2018) en_US
dc.subject segmentation en_US
dc.subject stemming en_US
dc.subject syllable tagging en_US
dc.subject conditional random fields en_US
dc.title Joint Word Segmentation and Stemming for Myanmar Language Based on Conditional Random Fields en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository



Browse

My Account

Statistics