UCSY's Research Repository

Building Word-Aligned Bilingual Corpus for Statistical Myanmar-English Translation

Show simple item record

dc.contributor.author Nwet, Khin Thandar
dc.date.accessioned 2019-07-25T04:30:22Z
dc.date.available 2019-07-25T04:30:22Z
dc.date.issued 2010-12-16
dc.identifier.uri http://onlineresource.ucsy.edu.mm/handle/123456789/1264
dc.description.abstract In recent years statistical word alignment models have been widely used for various Natural Language Processing (NLP) problems. In this paper we describe our work in constructing an aligned English-Myanmar parallel corpus. Corpora are not available for Myanmar language and our work in developing parallel corpus will also hopefully be very useful in many natural language applications. Word alignment plays a crucial role in statistical machine translation, since word-aligned corpora have been found to be an excellent source of translation-related knowledge. If there were errors in alignment, this will cause subsequence failure NLP processes. The alignments produced when the training on word-aligned data are dramatically better than when training on sentence-aligned data. The main purpose of this system is to provide as part of translation machine in Myanmar-English machine translation. The proposed system is combination of corpus based approach and dictionary lookup approach. The corpus based approach is based on the first three IBM models. en_US
dc.language.iso en en_US
dc.publisher Fifth Local Conference on Parallel and Soft Computing en_US
dc.title Building Word-Aligned Bilingual Corpus for Statistical Myanmar-English Translation en_US
dc.type Article en_US

Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


My Account