CFP last date
20 May 2024
Reseach Article

An Accurate Revelation of the Similarity between Clusters

by A. Veera Mahendra, S. M. Farooq
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 78 - Number 10
Year of Publication: 2013
Authors: A. Veera Mahendra, S. M. Farooq
10.5120/13525-1220

A. Veera Mahendra, S. M. Farooq . An Accurate Revelation of the Similarity between Clusters. International Journal of Computer Applications. 78, 10 ( September 2013), 16-20. DOI=10.5120/13525-1220

@article{ 10.5120/13525-1220,
author = { A. Veera Mahendra, S. M. Farooq },
title = { An Accurate Revelation of the Similarity between Clusters },
journal = { International Journal of Computer Applications },
issue_date = { September 2013 },
volume = { 78 },
number = { 10 },
month = { September },
year = { 2013 },
issn = { 0975-8887 },
pages = { 16-20 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume78/number10/13525-1220/ },
doi = { 10.5120/13525-1220 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:51:18.024109+05:30
%A A. Veera Mahendra
%A S. M. Farooq
%T An Accurate Revelation of the Similarity between Clusters
%J International Journal of Computer Applications
%@ 0975-8887
%V 78
%N 10
%P 16-20
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The structure of the data set playing a vital role in datamining. In concept of datamining information recovery and pattern identification nothing but data clustering. There are multiple clustering algorithms have been commenced to clustering categorical data. Unfortunately these algorithms created an incomplete information. In recent times cluster ensembles have come out as an essential solution to overcome these limitations and to get the excellence results for clustering. A Link-Based similarity measure is proposed to guess unknown values from a link network of clusters and bridges the gap among the task of data clustering and that link examination. It also improves the ability of ensemble methodology for categorical data. A new Link-Based cluster ensemble approach is commenced which is well-organized than the previous model, where a binary cluster association matrix, like matrix is used to create the cluster ensembles. These cluster ensembles have impurity information, to overcome these problem Link-Based similarity algorithm is used to generate an accurate pure clusters.

References
  1. A Link-Based Cluster Ensemble Approach for Categorical Data Clustering Natthakan Iam-On, Tossapon Boongoen, Simon Garrett, and Chris Price
  2. L. Kaufman and P. J. Rousseeuw, Finding Groups in Data: An Introduction to Cluster Analysis. Wiley Publishers, 1990.
  3. A. K. Jain and R. C. Dubes, Algorithms for Clustering. Prentice-Hall, 1998.
  4. P. Zhang, X. Wang, and P. X. Song, "Clustering Categorical Data Based on Distance Vectors," The J. Am. Statistical Assoc. , vol. 101, no. 473, pp. 355-367, 2006.
  5. S. Guha, R. Rastogi, and K. Shim, "ROCK: A Robust Clustering Algorithm for Categorical Attributes," Information Systems, vol. 25, no. 5, pp. 345-366, 2000.
  6. G. Jeh and J. Widom, "Simrank: A Measure of Structural-Context Similarity," Proc. ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining (KDD), pp. 538-543, 2002.
  7. Z. Huang, "Extensions to the K-Means Algorithm for Clustering Large Data Sets with Categorical Values," Data Mining and Knowledge Discovery, vol. 2, pp. 283-304, 1998.
  8. C. Domeniconi and M. Al-Razgan, "Weighted Cluster Ensembles:Methods and Analysis," ACM Trans. Knowledge Discovery from Data, vol. 2, no. 4, pp. 1-40, 2009.
  9. X. Z. Fern and C. E. Brodley, "Solving Cluster Ensemble Problems by Bipartite Graph Partitioning," Proc. Int'l Conf. Machine Learning (ICML), pp. 36-43, 2004.
  10. A. Strehl and J. Ghosh, "Cluster Ensembles: A Knowledge Reuse Framework for Combining Multiple Partitions," J. Machine Learning Research, vol. 3, pp. 583-617, 2002.
  11. G. Karypis and V. Kumar, "Multilevel K-Way Partitioning Scheme for Irregular Graphs," J. Parallel Distributed Computing, vol. 48, no. 1, pp. 96-129, 1998.
  12. A. Ng, M. Jordan, and Y. Weiss, "On Spectral Clustering: Analysis and an Algorithm," Advances in Neural Information Processing Systems, vol. 14, pp. 849-856, 2001.
Index Terms

Computer Science
Information Sciences

Keywords

Data clustering Cluster ensemble Link-based similarity measure Data sets.