UCSY's Research Repository

Dependency Head Annotation for Myanmar Dependency Treebank

Show simple item record

dc.contributor.author Aye, Hnin Thu Zar
dc.contributor.author Pa, Win Pa
dc.date.accessioned 2020-12-30T05:58:07Z
dc.date.available 2020-12-30T05:58:07Z
dc.date.issued 2020-11
dc.identifier.issn 2415-6698
dc.identifier.uri https://onlineresource.ucsy.edu.mm/handle/123456789/2545
dc.description.abstract Complete manual annotation of dependency treebank needs resources like annotators and annotation tools and takes long time and has high possibility of inconsistent annotations for free word order languages such as Myanmar. This paper describes a dependency head annotation scheme with Universal part-of-speech and Universal Dependencies for Myanmar dependency treebank. Currently 22,810 sentences and 680,218 tokens were annotated from three corpora for Myanmar dependency treebank. Some language specific issues are also described with examples. Raw syntactic structures were annotated automatically by UDPipe according to the Universal Dependencies based on Universalpart-of-speech tag scheme. Then unsupervised annotated dependency head structures have been manually updated in post processing. To be reliable and speedy post process with reduced errors for manual updating, selected sentences were added to the training data after being updated. After that the model has been retrained and the remaining sentences were parsed by UDPipe. Post processing was repeated until all sentences were updated. Some specifications of dependency annotation schemes in sentences encountered in post processing are presented with examples. For parsing performance of annotated data, cross validation tests and parsing experiments were performed. Moreover, annotated treebank data have also been evaluated by CoNLL 2017 evaluation script for parsing performance. Results of parsing experiments and evaluation are also reported by unlabeled and labeled attachment scores and demonstrated that the proposed method is a suitable way for building Myanmar dependency trees. Moreover, syntax structures of treebank are also analyzed and syntax information is also presented. This dependency head annotation for dependency treebank is the first work for Myanmar language as far as we know. en_US
dc.language.iso en en_US
dc.publisher Special Issue on Multidisciplinary Sciences and Engineering in Advances in Science, Technology and Engineering Systems Journal en_US
dc.relation.ispartofseries Volume 5, Issue 6;pp. 788-800
dc.subject Dependency head en_US
dc.subject Universal Dependencies en_US
dc.subject Treebank en_US
dc.subject Annotation schemes en_US
dc.title Dependency Head Annotation for Myanmar Dependency Treebank en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository



Browse

My Account

Statistics