Lexicalized HMM-based Part-of-Speech Tagger for Myanmar Language

Myint, Phyu Hninn; Htwe, Tin Myat; Thein, Ni Lar

UCSYRR Home
/
Conferences
/
International Conference on Computer Applications (ICCA)
/
Tenth International Conference On Computer Applications (ICCA 2012)
/
View Item

dc.contributor.author	Myint, Phyu Hninn
dc.contributor.author	Htwe, Tin Myat
dc.contributor.author	Thein, Ni Lar
dc.date.accessioned	2019-09-25T14:12:20Z
dc.date.available	2019-09-25T14:12:20Z
dc.date.issued	2012-02-28
dc.identifier.uri	http://onlineresource.ucsy.edu.mm/handle/123456789/2281
dc.description.abstract	Part-of-speech (POS) tagging is the process of assigning the part-of-speech tag or other lexical class marker to each and every word in a sentence. In many Natural Language Processing (NLP) applications such as word sense disambiguation, information retrieval, information processing, parsing, question answering, and machine translation, POS tagging is considered as the one of the basic necessary tools. Identifying the ambiguities in lexical items is the challenging objective in the process of developing an efficient and accurate POS Tagger. This paper proposes the developments for POS-tagger and POS-tagset of Myanmar language, which is very essential computational linguistic tool needed for many natural language processing (NLP) applications. Since most previous works for HMM-based tagging consider only part-of-speech information in contexts, their models cannot utilize lexical information which is crucial for resolving some morphological ambiguity. In this paper, a simple method to build Lexicalized Hidden Markov Models (L-HMMs) is introduced for improving the precision of part-of-speech tagging in Myanmar language. In experiments, lexicalized models achieve higher accuracy than non-lexicalized models.	en_US
dc.language.iso	en_US	en_US
dc.publisher	Tenth International Conference On Computer Applications (ICCA 2012)	en_US
dc.title	Lexicalized HMM-based Part-of-Speech Tagger for Myanmar Language	en_US
dc.type	Article	en_US