Abstract:
In Natural Language Processing (NLP), Machine
translation (MT) is the application of computers to
the task of translating texts from one natural
language to another. In Natural Languages of
Myanmar and Pa-Oh language, word boundary
identification is not easy between words with spaces.
In this system, Pa-Oh to Myanmar language
translation framework is proposed. Word tokenizing
plays a vital role in most Natural Language
Processing. Syllabification is also a important task in
Pa-Oh. Working directly with characters does not
help. It is therefore useful to syllabify texts first. The
first step is entering the Pa-Oh words. And then the
system syllabified the input word by looking up
syllable files. After tokenizing the input words, each
word examines whether they are in word
list/dictionary or not quickly. Finally it can display
correct currency words of Myanmar with the same
meaning of Pa-Oh language.