UCSY's Research Repository

A Cascaded Approach to Text Normalization for Email Data Cleaning

Show simple item record

dc.contributor.author Theint, Aye Aye
dc.contributor.author Tun, Myint Thu Zar
dc.date.accessioned 2019-08-06T03:58:19Z
dc.date.available 2019-08-06T03:58:19Z
dc.date.issued 2009-12-30
dc.identifier.uri http://onlineresource.ucsy.edu.mm/handle/123456789/1842
dc.description.abstract Email is one of the commonest modes of communication via text. By using email, people are sending and receiving many messages per day and communicating with partners and friends. Most of email data is very noisy. Thus, text normalization is the most popular and it is necessary to clean up email data. Text cleaning and normalization is a significant aspect in developing many text processing and information extraction applications in email data cleaning processes. Many text normalization applications need to take email as input. Text normalization has many methods to find the useful information. Among these methods, a Cascaded Approach is very suitable for cleaning email data. Our proposed system is to convert the canonical form from the “informally inputted” text by using text normalization. Moreover, this paper is to eliminate “noises” in the text and to detect paragraph and sentence boundaries in the text. en_US
dc.language.iso en en_US
dc.publisher Fourth Local Conference on Parallel and Soft Computing en_US
dc.subject text normalization en_US
dc.subject email data cleaning en_US
dc.subject information extraction en_US
dc.subject canonical form en_US
dc.title A Cascaded Approach to Text Normalization for Email Data Cleaning en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository



Browse

My Account

Statistics