UCSY's Research Repository

Myanmar Web Pages Crawler

Show simple item record

dc.contributor.author Khine, Su Mon
dc.contributor.author Thein, Yadana
dc.date.accessioned 2019-08-13T15:23:58Z
dc.date.available 2019-08-13T15:23:58Z
dc.date.issued 2015
dc.identifier.uri http://onlineresource.ucsy.edu.mm/handle/123456789/2131
dc.description.abstract Nowadays web pages are implemented in various kinds of languages on Web and web crawlers are important for search engine. Language specific crawlers are crawlers that traverse and collect the relative web pages using the successive URls of web page. There is very little research area in crawling for Myanmar Language web sites. Most of the language specific crawlers are based on n-gram character sequences which require training documents, the proposed crawler differ from those crawlers. The proposed system focused on only part of crawler to search and retrieve Myanmar web pages for Myanmar Language search engine. The proposed crawler detects the Myanmar character and rule based syllable threshold is used to judgment the relevant of the pages. According to experimental results, the proposed crawler has better performance, achieves successful accuracy and storage space for search engines are lesser since it only crawls the relevant documents for Myanmar web sites. en_US
dc.language.iso en en_US
dc.publisher International Conference on Natural Language Processing en_US
dc.subject Language specific crawler en_US
dc.subject Myanmar Language en_US
dc.subject rule base syllable segmentation en_US
dc.title Myanmar Web Pages Crawler en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository



Browse

My Account