UCSY's Research Repository

Myanmar Word Segmentation using Hybrid Approach

Show simple item record

dc.contributor.author Myat, Khine Myint
dc.date.accessioned 2019-09-23T04:55:43Z
dc.date.available 2019-09-23T04:55:43Z
dc.date.issued 2018-05
dc.identifier.uri http://onlineresource.ucsy.edu.mm/handle/123456789/2247
dc.description.abstract Word segmentation is a basic task and an important problem in natural language processing. In Myanmar language, words composed of single or multiple syllables are usually not separated by white space. Myanmar word segmentation is to determine the boundaries of words for languages without word separators in orthography. This system uses a 2-step longest matching approach. The first step was syllable segmentation and second uses Hybrid Approach of left-to-right syllable maximum matching and hierarchical expectation maximization approach. This system is intended to be able to use as a pre-processing tool in Myanmar text processing such as Machine Translation, Information Retrieval, Search Engine using Myanmar language. The experimental result shows 93% of accuracy based on a collection of 300 articles from the business, entertainment and sports sections of the Myanmar newspaper nearly 35,000 words. The proposed word segmentation is implemented as a web-based tool using C# .Net language. en_US
dc.language.iso en_US en_US
dc.publisher University of Computer Studies, Yangon en_US
dc.title Myanmar Word Segmentation using Hybrid Approach en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository



Browse

My Account

Statistics