Finding the Number of Clusters in Unlabeled Datasets using Extended Dark Block Extraction

Srinivasulu Asadi; Dr Ch D V Subba Rao; V Saikrishna

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 20 July 2026

Submit your paper

Know more

The week's pick

Quantifying Label-Induced Bias in Large Language Model Self and Cross Evaluations

Muskan Saraf Sajjad Rezvani Boroujeni Justin Beaudry Hossein Abedi Tom Bush

Random Articles

Survey of Methods of Solving TSP along with its Implementation using Dynamic Programming Approach

August

2012

Coordinator Location Effects in AODV Routing Protocol in ZigBee Mesh Network

October

2015

A Simple and Efficient Roadmap to Process Fingerprint Images in Frequency Domain

February

2015

Architectural Distortion Detection in Mammogram using Contourlet Transform and Texture Features

July

2013

Reseach Article

Finding the Number of Clusters in Unlabeled Datasets using Extended Dark Block Extraction

by Srinivasulu Asadi, Dr Ch D V Subba Rao, V Saikrishna

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 7 - Number 3

Year of Publication: 2010

Authors: Srinivasulu Asadi, Dr Ch D V Subba Rao, V Saikrishna

10.5120/1148-1503

Srinivasulu Asadi, Dr Ch D V Subba Rao, V Saikrishna . Finding the Number of Clusters in Unlabeled Datasets using Extended Dark Block Extraction. International Journal of Computer Applications. 7, 3 ( September 2010), 1-4. DOI=10.5120/1148-1503

@article{ 10.5120/1148-1503,

author = { Srinivasulu Asadi, Dr Ch D V Subba Rao, V Saikrishna },

title = { Finding the Number of Clusters in Unlabeled Datasets using Extended Dark Block Extraction },

journal = { International Journal of Computer Applications },

issue_date = { September 2010 },

volume = { 7 },

number = { 3 },

month = { September },

year = { 2010 },

issn = { 0975-8887 },

pages = { 1-4 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume7/number3/1148-1503/ },

doi = { 10.5120/1148-1503 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T19:55:55.924102+05:30

%A Srinivasulu Asadi

%A Dr Ch D V Subba Rao

%A V Saikrishna

%T Finding the Number of Clusters in Unlabeled Datasets using Extended Dark Block Extraction

%J International Journal of Computer Applications

%@ 0975-8887

%V 7

%N 3

%P 1-4

%D 2010

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Clustering analysis is the problem of partitioning a set of objects O = {o1… on} into c self-similar subsets based on available data. In general, clustering of unlabeled data poses three major problems: 1) assessing cluster tendency, i.e., how many clusters to seek? 2) Partitioning the data into c meaningful groups, and 3) validating the c clusters that are discovered. We address the first problem, i.e., determining the number of clusters c prior to clustering. Many clustering algorithms require number of clusters as an input parameter, so the quality of the clusters mainly depends on this value. Most methods are post clustering measures of cluster validity i.e., they attempt to choose the best partition from a set of alternative partitions.

References

R.F. Ling, Comm. ACM, vol. 16, pp. 355-361, 1973, “A Computer Generated Aid for Cluster Analysis,”
J.C. Bezdek and R. Hathaway,” Proc. Int’l Joint Conf. Neural Networks (IJCNN ’02), pp. 2225-2230, 2002,
J. Huband, J.C. Bezdek, and R. Hathaway, Pattern Recognition, vol. 38, no. 11, pp. 1875-1886, 2005, “bigVAT: Visual Assessment of Cluster Tendency for Large Data Sets”.
R. Hathaway, J.C. Bezdek, and J. Huband, Pattern Recognition, vol. 39, pp. 1315-1324, 2006, “Scalable Visual Assessment of Cluster Tendency”.
W.S. Cleveland, Visualizing Data. Hobart Press, 1993. J.C. Bezdek, R.J. Hathaway, and J. Huband, IEEE Trans. Fuzzy Systems, vol. 15, no. 5, pp. 890-903, 2007, “Visual Assessment of Clustering Tendency for Rectangular Dissimilarity Matrices”.
R.C. Gonzalez and R.E. Woods, Prentice Hall, 2002, Digital Image Processing.
I. Dhillon, D. Modha, and W. Spangler, Proc. 30th Symp. Interface: Computing Science and Statistics, 1998, “Visualizing Class Structure of Multidimensional Data”.
R.F. Ling, Comm. ACM, vol. 16, pp. 355-361, 1973, “A Computer Generated Aid for Cluster Analysis”.
T. Tran-Luu, PhD dissertation, Univ. of Maryland, College Park, 1996, “Mathematical Concepts and Novel Heuristic Methods for Data Clustering and Visualization”.
J.C. Bezdek and R. Hathaway, Proc. Int’l Joint Conf. Neural Networks (IJCNN ’02), pp. 2225-2230, 2002, “VAT: A Tool for Visual Assessment of (Cluster) Tendency”.
J. Huband, J.C. Bezdek, and R. Hathaway, Pattern Recognition, vol. 38, no. 11, pp. 1875-1886, 2005, “bigVAT: Visual Assessment of Cluster Tendency for Large Data Sets”.
Liang Wang, Christopher Leckie, Kotagiri Ramamohanarao, and James Bezdek, Fellow, IEEE-MARCH 2009, Automatically Determining the Number of Clusters in Unlabeled Data Sets.

Index Terms

Computer Science

Information Sciences

Keywords

Clustering Cluster Tendency Reordered Dissimilarity Image VAT C-Means Clustering