Ordering URL for Focused Web Crawlers using Effective Prioritization

dc.contributor.author	Min, Nandar Win
dc.contributor.author	Hlaing, Aye Nandar
dc.date.accessioned	2019-07-04T03:38:43Z
dc.date.available	2019-07-04T03:38:43Z
dc.date.issued	2012-02-28
dc.identifier.uri	http://onlineresource.ucsy.edu.mm/handle/123456789/395
dc.description.abstract	Obtaining important pages rapidly is very useful when a crawler cannot visit the entire Web in a reasonable amount of time. One approach is using focused crawler because it tries to download only pages with pre-defined topic to avoid the irrelevant web documents and reduce network traffic. It can also minimize the overall number of downloaded Web pages for processing and maximize the percentage of relevant pages. In this paper, we present in what order a focused crawler should visit the URLs it has seen, in order to obtain more “important” pages first. During crawling,Naive Bayes Classifier with four feature representations is used to enhancecorrectness of a specific topic. To provide sorting URLs, we use the Priority equation that gives every page a score.	en_US
dc.language.iso	en	en_US
dc.publisher	Tenth International Conference On Computer Applications (ICCA 2012)	en_US
dc.subject	focused crawler	en_US
dc.subject	learningfocused crawler	en_US
dc.subject	Naive Bayes classifier	en_US
dc.subject	similarity space model	en_US
dc.title	Ordering URL for Focused Web Crawlers using Effective Prioritization	en_US
dc.type	Article	en_US