dc.contributor.author |
Fam, Rashel
|
|
dc.contributor.author |
Purwarianti, Ayu
|
|
dc.contributor.author |
Lepage, Yves
|
|
dc.date.accessioned |
2019-07-04T04:25:50Z |
|
dc.date.available |
2019-07-04T04:25:50Z |
|
dc.date.issued |
2018-02-22 |
|
dc.identifier.uri |
http://onlineresource.ucsy.edu.mm/handle/123456789/421 |
|
dc.description |
This work was supported by grant number
15K00317 from the Japanese Society for the
Promotion of Science (JSPS): ‘Language
productivity: fast extraction of productive analogical
clusters and their evaluation using statistical machine
translation.’ |
en_US |
dc.description.abstract |
The vocabulary of a natural language
processing (NLP) system is usually limited by the
word forms learnt by the system in the preliminary
step, for example, word forms seen in the training
corpus. Thus, out-of-vocabulary (OOV) problem is
an important issue in NLP. In this paper, we study
the plausibility of unseen word forms generated from
analogical grids on Indonesian, a language known
for its richness in derivational morphology. We
construct analogical grids from a list of word forms
contained in an annotated Indonesian corpus. We
generate new word forms by filling the empty cells in
the analogical grids. We verify these generated word
forms using morphological analyzer and count how
many of them are valid Indonesian word forms. |
en_US |
dc.language.iso |
en |
en_US |
dc.publisher |
Sixteenth International Conferences on Computer Applications(ICCA 2018) |
en_US |
dc.title |
Plausibility of Word Forms Generated from Analogical Grids in Indonesian |
en_US |
dc.type |
Article |
en_US |