UCSY's Research Repository

Topic Maps Extraction on Focused Web Pages By Clustering with Web Structure and Contents

Show simple item record

dc.contributor.author Rar, Sa Mie
dc.contributor.author Khaing, Myo Kay
dc.date.accessioned 2019-07-19T14:19:19Z
dc.date.available 2019-07-19T14:19:19Z
dc.date.issued 2017-12-27
dc.identifier.uri http://onlineresource.ucsy.edu.mm/handle/123456789/1089
dc.description.abstract Web mining is the use of data mining techniques to automatically discover and extract information from Web documents and services. This area of research is so huge today partly due to the interests of various research communities, the tremendous growth of information sources available on the Web and the recent interest in ecommerce. Web mining is often associated with IR and IE.Web page clustering is one of the major preprocessing steps in web mining analysis.In this paper, clustering method is proposed to semiautomatically extract Topic Maps from a set of web pages. Firstly, the web pages are downloaded from the internet. To extract contents, remove stopwords and stem. And Second calculate the TF_IDF, content similarity, link similarity. Finally, calculating the Newman’s method with the weighting based on the similarities by contents of web pages and types of links is applied to develop the potential clusters. Then the system generates the topic map by assuming the clusters as topics, the edges as associations, the web pages related to the topic as occurrences from the result of clustering. As the experimental result, more compact and denser cluster. en_US
dc.language.iso en en_US
dc.publisher Eighth Local Conference on Parallel and Soft Computing en_US
dc.title Topic Maps Extraction on Focused Web Pages By Clustering with Web Structure and Contents en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository



Browse

My Account

Statistics