UCSY's Research Repository

Text Normalization and Classification System for Internet Forum

Show simple item record

dc.contributor.author Htet, Ya Min
dc.contributor.author Nyunt, Thi Thi Soe
dc.date.accessioned 2019-07-25T04:11:39Z
dc.date.available 2019-07-25T04:11:39Z
dc.date.issued 2010-12-16
dc.identifier.uri http://onlineresource.ucsy.edu.mm/handle/123456789/1255
dc.description.abstract Internet forum is one of the most common modes of knowledge sharing through text. An internet forum is an online discussion site. From a technical point of view, forums are web applications managing user-generated text contents. Text normalization is converting „informally inputted‟ text into the canonical form, by eliminating „noises‟ in the text and detecting paragraph and sentence boundaries in the text and take case restoration and suggest valid words for each invalid word in the text by using dictionary. Text classification is the process of grouping text item into related predefined classes or categories to make it easier for the user to find it. The system intends to normalize and classify the internet forum. For text normalization, Cascaded approach is used and for classification Naïve Bayes (NB) method is used. In the system, hold out method is used to evaluate the system‟s performance. en_US
dc.language.iso en en_US
dc.publisher Fifth Local Conference on Parallel and Soft Computing en_US
dc.title Text Normalization and Classification System for Internet Forum en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository



Browse

My Account

Statistics