UCSY's Research Repository

Ordering URL for Focused Web Crawlers using Effective Prioritization

Show simple item record

dc.contributor.author Min, Nandar Win
dc.contributor.author Hlaing, Aye Nandar
dc.date.accessioned 2019-07-04T03:38:43Z
dc.date.available 2019-07-04T03:38:43Z
dc.date.issued 2012-02-28
dc.identifier.uri http://onlineresource.ucsy.edu.mm/handle/123456789/395
dc.description.abstract Obtaining important pages rapidly is very useful when a crawler cannot visit the entire Web in a reasonable amount of time. One approach is using focused crawler because it tries to download only pages with pre-defined topic to avoid the irrelevant web documents and reduce network traffic. It can also minimize the overall number of downloaded Web pages for processing and maximize the percentage of relevant pages. In this paper, we present in what order a focused crawler should visit the URLs it has seen, in order to obtain more “important” pages first. During crawling,Naive Bayes Classifier with four feature representations is used to enhancecorrectness of a specific topic. To provide sorting URLs, we use the Priority equation that gives every page a score. en_US
dc.language.iso en en_US
dc.publisher Tenth International Conference On Computer Applications (ICCA 2012) en_US
dc.subject focused crawler en_US
dc.subject learningfocused crawler en_US
dc.subject Naive Bayes classifier en_US
dc.subject similarity space model en_US
dc.title Ordering URL for Focused Web Crawlers using Effective Prioritization en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository



Browse

My Account

Statistics