Documents Clustering using Partitional Clustering Methods

Wai, Khin Myo; San, Khin Cho

UCSYRR Home
/
Conferences
/
Local Conference on Parallel and Soft Computing
/
Fourth Local Conference on Parallel and Soft Computing
/
View Item

dc.contributor.author	Wai, Khin Myo
dc.contributor.author	San, Khin Cho
dc.date.accessioned	2019-07-29T07:12:39Z
dc.date.available	2019-07-29T07:12:39Z
dc.date.issued	2009-12-30
dc.identifier.uri	http://onlineresource.ucsy.edu.mm/handle/123456789/1455
dc.description.abstract	Document clustering is text processing that groups documents with similar concept. Clustering is defined as a process of partitioning a set of objects (patterns) into a set disjoined group (clusters). Its goal is to reduce the amount of data by categorizing or grouping similar data items together and obtain useful information. Clustering methods can be divided into two basic types: hierarchical and partitional clustering. This system used two partitional clustering methods. They are Self-Organizing Map (SOM) and K-Means. Self-Organization Maps is an artificial neural network model that is well suited for mapping high dimensional data into a two-dimensional representation space. SOM clustering is one of the well-known unsupervised clustering techniques. The goal of K-Means is to find k points of a dataset that can best represent the dataset in a certain mathematical sense (to be detailed later). These k points are also known as cluster centers, prototypes, centroids, or code words, and so on. The most known class of partitioned clustering algorithms is the K-Means algorithm and its variants. In this paper, documents are clustered by SOM algorithm how these are related to each other and K-Means start by randomly selecting k point cluster means; then assigns each document to its nearest cluster mean.	en_US
dc.language.iso	en	en_US
dc.publisher	Fourth Local Conference on Parallel and Soft Computing	en_US
dc.title	Documents Clustering using Partitional Clustering Methods	en_US
dc.type	Article	en_US