UCSY's Research Repository

Page Segmentation and Document Layout Analysis for Scanned Image by using Smearing Algorithm

Show simple item record

dc.contributor.author Htun, Nay Win
dc.contributor.author Ko, Lin Min
dc.date.accessioned 2019-07-22T08:09:07Z
dc.date.available 2019-07-22T08:09:07Z
dc.date.issued 2010-12-16
dc.identifier.uri http://onlineresource.ucsy.edu.mm/handle/123456789/1173
dc.description.abstract This paper presents a feature-based system which utilizes domain knowledge to segment and classify scanned image documents. Documents usually consists of a mixture of text and image. Text block possesses an interesting property that the x-profile or y-profile of text block is a periodic pattern. Image block possesses generate the connectivity histogram by summing the number of dark pixels with the same connectivity value. Initially, one-scan run-length smearing algorithm (RLSA) with block merging is proposed to segment the document. After segmentation process, the next task is to classify the segmented block. The classification task is then performed based on the rules induced from the features or primitives associated with each document. In this system, proper use of domain knowledge is proved to be effective in accelerating the segmentation speed and decreasing the classification error. en_US
dc.language.iso en en_US
dc.publisher Fifth Local Conference on Parallel and Soft Computing en_US
dc.subject one-scan run-length smearing en_US
dc.subject block merging en_US
dc.subject connectivity histogram en_US
dc.subject text block en_US
dc.subject image block en_US
dc.title Page Segmentation and Document Layout Analysis for Scanned Image by using Smearing Algorithm en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository



Browse

My Account

Statistics