Nasser, Alissar (2019) Investigating K-means and Kernel K-means Algorithms with Internal Validity Indices for Cluster Identification. Journal of Advances in Mathematics and Computer Science, 30 (2). pp. 1-12. ISSN 24569968
Nasser3022018JAMCS45837.pdf - Published Version
Download (303kB)
Abstract
Clustering is an unsupervised method where the number of clusters is not known by users. Therefore, the outcomes of a clustering algorithm depend on the input number of clusters specified by users. Consequently it is very important to evaluate the result of the clustering algorithms according to the number of clusters and choose the one that optimize a certain criterion. We present in this paper several clustering validity indices used in the literature. Using several synthetic and real datasets, these indices are then compared based on clustering results provided by the well known k-means clustering algorithm and its non-linear version the kernel K-means algorithm. The results showed that none of the validity indices is superior to the others; in the other hand, the kernel k-means failed to improve clustering accuracy of the dataset from the number of clusters perspective.
Item Type: | Article |
---|---|
Subjects: | West Bengal Archive > Mathematical Science |
Depositing User: | Unnamed user with email support@westbengalarchive.com |
Date Deposited: | 06 Apr 2023 13:01 |
Last Modified: | 03 Sep 2024 05:21 |
URI: | http://article.stmacademicwriting.com/id/eprint/456 |