CFP last date
20 May 2024
Reseach Article

A Novel Design Specification Distance (DSD) based K-Mean Clustering Performance Evaluation on Engineering Materials' Database

by Doreswamy, Hemanth. K. S
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 55 - Number 15
Year of Publication: 2012
Authors: Doreswamy, Hemanth. K. S
10.5120/8832-3043

Doreswamy, Hemanth. K. S . A Novel Design Specification Distance (DSD) based K-Mean Clustering Performance Evaluation on Engineering Materials' Database. International Journal of Computer Applications. 55, 15 ( October 2012), 26-33. DOI=10.5120/8832-3043

@article{ 10.5120/8832-3043,
author = { Doreswamy, Hemanth. K. S },
title = { A Novel Design Specification Distance (DSD) based K-Mean Clustering Performance Evaluation on Engineering Materials' Database },
journal = { International Journal of Computer Applications },
issue_date = { October 2012 },
volume = { 55 },
number = { 15 },
month = { October },
year = { 2012 },
issn = { 0975-8887 },
pages = { 26-33 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume55/number15/8832-3043/ },
doi = { 10.5120/8832-3043 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:57:21.026035+05:30
%A Doreswamy
%A Hemanth. K. S
%T A Novel Design Specification Distance (DSD) based K-Mean Clustering Performance Evaluation on Engineering Materials' Database
%J International Journal of Computer Applications
%@ 0975-8887
%V 55
%N 15
%P 26-33
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Organizing data into semantically more meaningful is one of the fundamental modes of understanding and learning. Cluster analysis is a formal study of methods for understanding and algorithm for learning. K-mean clustering algorithm is one of the most fundamental and simple clustering algorithms. When there is no prior knowledge about the distribution of data sets, K-mean is the first choice for clustering with an initial number of clusters. In this paper a novel distance metric called Design Specification (DS) distance measure function is integrated with K-mean clustering algorithm to improve cluster accuracy. The K-means algorithm with proposed distance measure maximizes the cluster accuracy to 99. 98%at P = 1. 525, which is determined through the iterative procedure. The performance of Design Specification (DS) distance measure function with K - mean algorithm is compared with the performances of other standard distance functions such as Euclidian, squared Euclidean, City Block, and Chebshew similarity measures deployed with K-mean algorithm. The proposed method is evaluated on the engineering materials database. The experiments on cluster analysis and the outlier profiling show that these is an excellent improvement in the performance of the proposed method.

References
  1. Anil Kumar Patidar, JitendraAgrawal and Nishchol Mishra (2012) "Analysis of Different Similarity Measure Functions and their Impacts on Shared Nearest Neighbor Clustering Approach", International Journal of Computer Applications, Vol. 40. , No. 16, February 2012. pp. 1-5.
  2. Ankita Vimal, Satyanarayana R Valluri, Kamalakar Karlapalem (2008) "An Experiment with Distance Measures for Clustering" International Conference on Management of Data COMAD 2008,pp. 241-244.
  3. D T Pham and A A?fy (2007) "Clustering Techniques And Their Applications In Engineering", Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science 221: 1445, Vol. 221. pp. 1445-1459.
  4. Doreswamy (2010) "Similarity Measuring Approach Based Engineering Materials Selection", International Journal of Computational Intelligence Systems (IJCIS). Vol. 3, pp. 115-122.
  5. Doreswamy, Manohar G, Hemanth K S (2012) "Object-Oriented Database Model For Effective Mining Of Advanced Engineering Materials Data Sets" Proceedings in Computer Science & Information technology (CS&IT) CCSEA-2012. Vol. 2, Num. 2 pp. 129-137 (ISBN : 978-1-921987-03-8) 2012
  6. Eduardo R. Hruschka, Ricardo J. G. B. Campello, Alex A. Freitas and André C. P. L. F. de Carvalho (2009) "A Survey of Evolutionary Algorithms for Clustering" IEEE Transaction on Systems,Man and Cybernetics, PartC: Applications and Reviews. ,Vol. 30. ,Issue. 2. ,pp. 133-155.
  7. Engineering materials datasets available at http://www. matweb. com.
  8. Guadalupe J. Torres, Ram B. Basnet, Andrew H. Sung, SrinivasMukkamala, and Bernardete M. Ribeiro (2008) "A Similarity Measure for Clustering and its Applications",World Academy of Science, Engineering and Technology Vol. 41. ,pp. 489-495.
  9. Irene Ntoutsi PhD Thesis "Similarity Issues in Data Mining –Methodologies and Techniques" 2008.
  10. Jain A K, (2010) "Data Clustering; 50 years beyond K-means. Patterns Recognition Letters, Vol 31 Issue 8, June, 2010 pp. 651-666.
  11. Jiawei Han, MichelineKamber and Jian Pie (2012) " Data Mining: Concepts and Techniques", Margan Kaufmann Publishers, 3rd edition.
  12. Kuang-Chiung Chang, Cheng Wen, Ming-FengYehandRen-GueyLee (2005) "A Comparison of Similarity Measures For Clustering of Qrs Complexes" Biomedical Engineering Applications, Basis & Communications Vol. 17 No. 6. ,December 2005. , pp. 324-331.
  13. M. C. Naldi, R. J. G. B. Campello, E. R. Hruschka, A. C. P. L. F. Carvalho (2011) "Efficiency issues of evolutionary k-means", Applied Soft Computing Vol. 11. ,pp. 1938-1952.
  14. Manish Verma, MaulySrivastava, NehaChack, Atul Kumar Diswar, Nidhi Gupta (2012) "A Comparative Study of Various Clustering Algorithms in Data Mining", International Journal of Engineering Research and Applications (IJERA)Vol. 2, Issue 3, May-Jun 2012, pp. 1379-1384.
  15. T. Velmurugan and T. Santhanam (2011)" A survey of Partition based clustering algorithms in Data Mining: An Experimental Approach", Information Technology Journal Vol. 10, Issue. 3, pp. 478-484.
  16. Todsanai Chumwatana, KokWai Wong and Hong Xie (2009) "Non-segmented Document Clustering Using Self-Organizing Map and Frequent Max Substring Technique" Lecture Notes in Computer Science, 2009, Volume 5864,pp. 691-698.
  17. W. D. Callister Jr. (2000) "Materials Science and Engineering". 5th ed. , John Wiley and Sons, New York, NY, USA.
Index Terms

Computer Science
Information Sciences

Keywords

K- means clustering engineering materials dataset Knowledge discovery system and novel Design Specification (DS) distance measure