dc.description.abstract |
This paper proposed a system that can convert Myanmar Portable Document format to machine editable word document with format. It uses Myanmar Intelligent Character Recognition (MICR) to recognize character. MICR is one kind of ICR (Intelligent Character Recognition) system. It is based on statistical and semantic information of the characters. The required statistical and semantic information can be obtained by measuring width and height ratio, black stroke counts, number of loops, open directions, histogram values, etc. The final decision is made by the voting system. MICR has been successfully developed in the following applications such as car license plate recognition system, speed limited road sign recognition system, recognition of vouchers, digit recognizer and online handwritten Myanmar combined words recognition system, etc. The main idea of this paper is to format the page like the original (pdf) document including alignment (left, right, center), Bold, Italic, and underlined color and picture, etc. |
en_US |