UCSY's Research Repository

English-Myanmar (Burmese) Phrase-Based SMT with One-to-One and One-to-Multiple Translations Corpora

Show simple item record

dc.contributor.author Htun, Honey
dc.contributor.author Thu, Ye Kyaw
dc.contributor.author Oo, Nyein Nyein
dc.contributor.author Supnithi, Thepchai
dc.date.accessioned 2021-01-31T10:44:00Z
dc.date.available 2021-01-31T10:44:00Z
dc.date.issued 2020-02-28
dc.identifier.uri https://onlineresource.ucsy.edu.mm/handle/123456789/2556
dc.description.abstract This paper contributes the first investigation of machine translation (MT) performance differences between Myanmar and English languages with the use of several possible Myanmar translations for the specific primary educational domain. We also developed both one-to-one and many Myanmar translations corpora (over 8K and 46K sentences) based on old and new English textbooks (including Grade 1 to 3) which are published by the Ministry of Education. Our developing parallel corpora were used for phrase-based statistical machine translation (PBSMT) which is the de facto standard of statistical machine translation. We measured machine translation performance differences among one-tomany English to Myanmar translation corpora. The differences range between 19.68 and 52.38 BLEU scores from English to Myanmar and between 50.17 and 75.12 BLEU scores from Myanmar to English translation. We expect this study can be applied in Myanmar-to-English automatic speech recognition (ASR) development for primary English textbooks. The main purpose is to translate primary English textbooks data correctly even if the children use in several Myanmar conversation styles. en_US
dc.language.iso en en_US
dc.publisher Proceedings of the Eighteenth International Conference On Computer Applications (ICCA 2020) en_US
dc.subject Phrase-based Statistical Machine Translation (PBSMT) en_US
dc.subject One-to-Many Parallel Corpus en_US
dc.subject Myanmar-English Machine Translation en_US
dc.subject Primary English Textbooks of Myanmar en_US
dc.subject Word Error Rate (WER) en_US
dc.title English-Myanmar (Burmese) Phrase-Based SMT with One-to-One and One-to-Multiple Translations Corpora en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository



Browse

My Account

Statistics