Grapheme Cluster Segmentation Tool for Myanmar Text Based on Positional Prediction Text Input Concept

Thu, Ye Kyaw; URANO, Yoshiyori

UCSYRR Home
/
Conferences
/
International Conference on Computer Applications (ICCA)
/
Ninth International Conference On Computer Applications (ICCA 2011)
/
View Item

Grapheme Cluster Segmentation Tool for Myanmar Text Based on Positional Prediction Text Input Concept

Thu, Ye Kyaw; URANO, Yoshiyori

URI: http://onlineresource.ucsy.edu.mm/handle/123456789/276

Date: 2011-05-05

Abstract:

We present the grapheme cluster segmentation tool for Myanmar text based on our proposed Positional Prediction text input concept. Motivation of this research is to develop the Positional Prediction database of Myanmar consonants from the existing Myanmar electronic documents such as PDF e-books, Microsoft Word documents. In this paper, we introduce segmentation rule, implementation process of Positional Prediction combination pattern segmentation and character segmentation. We also present difficulties of encoding conversion from old ASCII based font to Unicode font, developing process and initial study results with the content of a Myanmar e-book of 62 pages. This grapheme cluster segmentation approach is not only useful for creating Positional Prediction database but also applicable for statistical analysis on distributions of characters and Positional Prediction patterns of Myanmar language

Show full item record