Call for Paper - January 2023 Edition
IJCA solicits original research papers for the January 2023 Edition. Last date of manuscript submission is December 20, 2022. Read More

Error Evaluation on K- Means and Hierarchical Clustering with Effect of Distance Functions for Iris Dataset

International Journal of Computer Applications
© 2014 by IJCA Journal
Volume 86 - Number 16
Year of Publication: 2014
Harish Kumar Sagar
Varsha Sharma

Harish Kumar Sagar and Varsha Sharma. Article: Error Evaluation on K- Means and Hierarchical Clustering with Effect of Distance Functions for Iris Dataset. International Journal of Computer Applications 86(16):1-5, January 2014. Full text available. BibTeX

	author = {Harish Kumar Sagar and Varsha Sharma},
	title = {Article: Error Evaluation on K- Means and Hierarchical Clustering with Effect of Distance Functions for Iris Dataset},
	journal = {International Journal of Computer Applications},
	year = {2014},
	volume = {86},
	number = {16},
	pages = {1-5},
	month = {January},
	note = {Full text available}


In Data clustering (a sub field of Data mining), k-means and hierarchical based clustering algorithms are popular due to its excellent performance in clustering of large data sets. This paper presents two different comparative studies which includes various Data Clustering algorithms for analyzing best one with minimum clustering error. The foremost objective of this paper is to divide the data objects into k number of different clusters with homogeneity and the each cluster should be heterogeneous to each other. However, these both algorithms (K-Mean and Hierarchical) are not free with the errors. In this paper, firstly various distance has been considered for these two algorithms for comparing and analyzing the best distance methods to solve the existing problems. .


  • Pham, D. T. , S. S. Dimov, and C. D. Nguyen. "Selection of K in K-means clustering. " Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science 219. 1 (2005): 103-119
  • Awasthi, Rekha, Anil K. Tiwari, and Seema Pathak. "Empirical Evaluation on K Means Clustering with Effect of Distance Functions for Bank Dataset. " IJITR1. 3 (2013): 233-235.
  • Density-based clustering algorithms DBSCAN and SNN by Adriano Moreira, Maribel Y. Santos and Sofia Carneiro.
  • Kanungo, Tapas, et al. "An efficient k-means clustering algorithm: Analysis and implementation. " Pattern Analysis and Machine Intelligence, IEEE Transactions on 24. 7 (2002): 881-892.
  • J. L. Bentley, Multidimensional Binary Search Trees Used for Associative Searching.
  • Esteves, Rui Maximo, Rui Pais, and Chunming Rong. "K-means clustering in the cloud--a mahout test. " In Advanced Information Networking and Applications (WAINA), 2011 IEEE Workshops of International Conference on, pp. 514-519. IEEE, 2011.
  • F. Caoa et. al. , "An initialization method for the k-Means algorithm using neighborhood model", Computers and Mathematics with Applications, vol. 58, pp. 474 – 483, 2009.
  • Han, Jiawei, Kamber, Micheline. (2000) Data Mining: Concepts and Techniques. Morgan Kaufmann.
  • Euclidean Distance in http://people. revoledu. com /kardi/tutorial/Similarity/EuclideanDistance. html.
  • Euclidean distance in http://en. wikipedia. org/wiki/ Euclidean_distance#One-dimensional_distance