UCSY's Research Repository

Chunk Tagged Corpus Creation for Myanmar Language

Show simple item record

dc.contributor.author Myint, Phyu Hninn
dc.contributor.author Htwe, Tin Myat
dc.contributor.author Thein, Ni Lar
dc.date.accessioned 2019-07-03T03:42:13Z
dc.date.available 2019-07-03T03:42:13Z
dc.date.issued 2011-05-05
dc.identifier.uri http://onlineresource.ucsy.edu.mm/handle/123456789/160
dc.description.abstract In the applications of Natural language processing (NLP), sentence analysis is one of the important phases for machine translation systems. Currently, no mature deep analysis that has been worked done is available for Myanmar language. To perform shallow parsing on sentences, the chunk identification is a fundamental task. The POS tagged corpus creation has been proposed in [8] and in this paper, we have proposed a methodology for building chunk tagged corpus for Myanmar Language. We use the POS tagged corpus that is proposed in [8] and identify chunks in Myanmar POS tagged texts. Our approach uses rule-based on how to identify all chunks in a Myanmar sentence. As a preprocessing step, normalization of POS tags is needed to perform in order to produce finer tags. Hence, normalization rules are also developed. After normalization, chunk rules are applied to tag chunk for these finer tags. Our chunk tagged corpus is very useful in Myanmar to English machine translation system. en_US
dc.language.iso en en_US
dc.publisher Ninth International Conference On Computer Applications (ICCA 2011) en_US
dc.title Chunk Tagged Corpus Creation for Myanmar Language en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository



Browse

My Account

Statistics