CFP last date
20 May 2024
Reseach Article

Finding the Number of Clusters in Unlabeled Datasets using Extended Dark Block Extraction

by Srinivasulu Asadi, Dr Ch D V Subba Rao, V Saikrishna
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 7 - Number 3
Year of Publication: 2010
Authors: Srinivasulu Asadi, Dr Ch D V Subba Rao, V Saikrishna
10.5120/1148-1503

Srinivasulu Asadi, Dr Ch D V Subba Rao, V Saikrishna . Finding the Number of Clusters in Unlabeled Datasets using Extended Dark Block Extraction. International Journal of Computer Applications. 7, 3 ( September 2010), 1-4. DOI=10.5120/1148-1503

@article{ 10.5120/1148-1503,
author = { Srinivasulu Asadi, Dr Ch D V Subba Rao, V Saikrishna },
title = { Finding the Number of Clusters in Unlabeled Datasets using Extended Dark Block Extraction },
journal = { International Journal of Computer Applications },
issue_date = { September 2010 },
volume = { 7 },
number = { 3 },
month = { September },
year = { 2010 },
issn = { 0975-8887 },
pages = { 1-4 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume7/number3/1148-1503/ },
doi = { 10.5120/1148-1503 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T19:55:55.924102+05:30
%A Srinivasulu Asadi
%A Dr Ch D V Subba Rao
%A V Saikrishna
%T Finding the Number of Clusters in Unlabeled Datasets using Extended Dark Block Extraction
%J International Journal of Computer Applications
%@ 0975-8887
%V 7
%N 3
%P 1-4
%D 2010
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Clustering analysis is the problem of partitioning a set of objects O = {o1… on} into c self-similar subsets based on available data. In general, clustering of unlabeled data poses three major problems: 1) assessing cluster tendency, i.e., how many clusters to seek? 2) Partitioning the data into c meaningful groups, and 3) validating the c clusters that are discovered. We address the first problem, i.e., determining the number of clusters c prior to clustering. Many clustering algorithms require number of clusters as an input parameter, so the quality of the clusters mainly depends on this value. Most methods are post clustering measures of cluster validity i.e., they attempt to choose the best partition from a set of alternative partitions.

References
  1. R.F. Ling, Comm. ACM, vol. 16, pp. 355-361, 1973, “A Computer Generated Aid for Cluster Analysis,”
  2. J.C. Bezdek and R. Hathaway,” Proc. Int’l Joint Conf. Neural Networks (IJCNN ’02), pp. 2225-2230, 2002,
  3. J. Huband, J.C. Bezdek, and R. Hathaway, Pattern Recognition, vol. 38, no. 11, pp. 1875-1886, 2005, “bigVAT: Visual Assessment of Cluster Tendency for Large Data Sets”.
  4. R. Hathaway, J.C. Bezdek, and J. Huband, Pattern Recognition, vol. 39, pp. 1315-1324, 2006, “Scalable Visual Assessment of Cluster Tendency”.
  5. W.S. Cleveland, Visualizing Data. Hobart Press, 1993. J.C. Bezdek, R.J. Hathaway, and J. Huband, IEEE Trans. Fuzzy Systems, vol. 15, no. 5, pp. 890-903, 2007, “Visual Assessment of Clustering Tendency for Rectangular Dissimilarity Matrices”.
  6. R.C. Gonzalez and R.E. Woods, Prentice Hall, 2002, Digital Image Processing.
  7. I. Dhillon, D. Modha, and W. Spangler, Proc. 30th Symp. Interface: Computing Science and Statistics, 1998, “Visualizing Class Structure of Multidimensional Data”.
  8. R.F. Ling, Comm. ACM, vol. 16, pp. 355-361, 1973, “A Computer Generated Aid for Cluster Analysis”.
  9. T. Tran-Luu, PhD dissertation, Univ. of Maryland, College Park, 1996, “Mathematical Concepts and Novel Heuristic Methods for Data Clustering and Visualization”.
  10. J.C. Bezdek and R. Hathaway, Proc. Int’l Joint Conf. Neural Networks (IJCNN ’02), pp. 2225-2230, 2002, “VAT: A Tool for Visual Assessment of (Cluster) Tendency”.
  11. J. Huband, J.C. Bezdek, and R. Hathaway, Pattern Recognition, vol. 38, no. 11, pp. 1875-1886, 2005, “bigVAT: Visual Assessment of Cluster Tendency for Large Data Sets”.
  12. Liang Wang, Christopher Leckie, Kotagiri Ramamohanarao, and James Bezdek, Fellow, IEEE-MARCH 2009, Automatically Determining the Number of Clusters in Unlabeled Data Sets.
Index Terms

Computer Science
Information Sciences

Keywords

Clustering Cluster Tendency Reordered Dissimilarity Image VAT C-Means Clustering