CFP last date
20 May 2024
Reseach Article

Efficient Quality Assessment Technique with Integrated Cluster Validation and Decision Trees

by S.Prakash Kumar, Dr.K.S.Ramaswami
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 21 - Number 9
Year of Publication: 2011
Authors: S.Prakash Kumar, Dr.K.S.Ramaswami
10.5120/2539-3474

S.Prakash Kumar, Dr.K.S.Ramaswami . Efficient Quality Assessment Technique with Integrated Cluster Validation and Decision Trees. International Journal of Computer Applications. 21, 9 ( May 2011), 30-36. DOI=10.5120/2539-3474

@article{ 10.5120/2539-3474,
author = { S.Prakash Kumar, Dr.K.S.Ramaswami },
title = { Efficient Quality Assessment Technique with Integrated Cluster Validation and Decision Trees },
journal = { International Journal of Computer Applications },
issue_date = { May 2011 },
volume = { 21 },
number = { 9 },
month = { May },
year = { 2011 },
issn = { 0975-8887 },
pages = { 30-36 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume21/number9/2539-3474/ },
doi = { 10.5120/2539-3474 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:08:03.309614+05:30
%A S.Prakash Kumar
%A Dr.K.S.Ramaswami
%T Efficient Quality Assessment Technique with Integrated Cluster Validation and Decision Trees
%J International Journal of Computer Applications
%@ 0975-8887
%V 21
%N 9
%P 30-36
%D 2011
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Clustering becomes a key technique in analyzing quality assessment in most of the recent research works. The partitioned clustering techniques used in previous work utilize attributes of objects to form cluster. The cluster numbers were initialized, which reduces cluster quality in terms of cluster object aggregation and appropriation. The work presented an efficient quality assessment technique comprising of two parts i.e., fuzzy k-means cluster validation scheme and decision tree model. The Fuzzy k-means cluster validation scheme improves recall and precision measure of automatically labeling cluster objects. The decision tree model evaluates labeled cluster object and decides on the appropriation of attributes to its cluster validity index. The cluster quality index is measured in terms of number of clusters, number of objects in each cluster, cluster object cohesiveness, precision and recall values. Cluster validates focus on quality metrics of the institution data set features experimented with real and synthetic data sets. The results of quality indexed fuzzy k-means shows better cluster validation compared to that of traditional k-family algorithm. The experimental results of cluster validation scheme and decision tree confirm the reliability of quality validity index which performs better than other traditional k-family clusters.

References
  1. Delavari N, Beikzadeh M. R, Shirazi M. R. A., “A New Model for Using Data Mining in Educational System”, 5th International Conference on Information Technology based Education and Training: ITEHT ’04, Istanbul, Turkey, 31st May-2nd Jun 2004.
  2. Delavari N, Beikzadeh M. R, “A New Analysis Model for Data Mining Processes in Educational Systems”, MMU International Symposium on Information and Communications Technologies 2004 in conjuction with the 5th National Conference on Telecommunication Technology 2004, Putrajaya, Malaysia, 7th- 8th October 2004.
  3. Chapman P, Clinton J, Kerber R, Khabaza T, Reinartz T, Shearer C, Wirth R, CRISP-DM 1.0: Step-by-step data mining guide, 2000
  4. Two Crows Corporation, “Introduction to Data Mining and Knowledge Discovery”, TwoCrows Corporation, Third Edition, U.S.A, 1999.
  5. Han J, Kamber M, ”Data Mining: Concepts and Techniques”, Simon Fraser University, Morgan Kaufmann publishers, ISBN 1-55860-489-8. 2001.
  6. Chen M .S, Han J, Yu P. S, "Data Mining: An Overview from a Database Perspective". IEEE Transaction on Knowledge and Data Engineering, 1996.
  7. Han J, "How can Data Mining Help Bio-Data Analysis". BIOKDD02: Workshop on data mining in Bioinformatics, 2002.
  8. Feldman R, "Mining the Biomedical Literature using Semantic Analysis and Neural Language Processing Techniques, a link analysis approaches". ClearForest Corporation, New York, 2003.
  9. Edelstein H, "Building Profitable Customer Relationships with Data Mining", Two Crows Corporation, SPSS white paper-executive briefing, 2000.
  10. Chang W. H. T, Lee Y. H, " Telecommunications Data Mining for Target Marketing," Journal of Computers, Vol. 12, No. 4, December 2000, pp.60-74.
  11. Mobasher B, Jain N, Han E, Srivastava J, "Web Mining: Pattern Discovery from World Wide Web Transactions", Technical Report TR96-050, Department of Computer Science, University of Minnesota, 1996.
  12. The Y. W, Mustaffa K. M, Zaitun A. B, Lee, "Data Mining In Computer Auditing". Informing Science. Cork, Ireland June 19-21, 2002.
  13. Baylis P, "Better Health Care with Data Mining", SPSS White Paper, UK, 1999.
  14. Brossette S. E, Sprague A. P, Hardin J. M, Waites K. B, Jones W. T, Moser S.A, "Association Rules and Data Mining in Hospital Infection Control and Public Health Surveillance", Journal of the American Medical Informatics Association (JAMIA), vol. 5: 1998, pp.373-381.
  15. Luan J, "Data mining and Knowledge Management, A System Analysis for Establishing a Tiered Knowledge Management Model (TKMM)", Proceedings of Air Forum, Toronto, Canada. 2001.
  16. Rajesh N. Dave. "Validating fuzzy partitions obtained through c-shells clustering", Pattern Recognition Letters, Vol .17, pp613-623, 1996
  17. J. C. Dunn. "Well separated clusters and optimal fuzzy partitions", J. Cybern. Vol.4, pp. 95- 104, 1974
  18. Milligan, G.W. and Cooper, M.C. (1985), "An Examination of Procedures for Determining the Number of Clusters in a Data Set", Psychometrika, 50, 159-179.
  19. Milligan G. W., Soon S.C., Sokol L. M. “The effect of cluster size, dimensionality and the number of clusters on recovery of true cluster structure”. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 5, pp. 40-47, 1983
  20. S. Theodoridis, K. Koutroubas. Pattern recognition, Academic Press, 1999
  21. Xunali Lisa Xie, Genardo Beni. "A Validity measure for Fuzzy Clustering", IEEE Transactions on Pattern Analysis and machine Intelligence, Vol13, No4, August 1991.
Index Terms

Computer Science
Information Sciences

Keywords

Cluster Validation Fuzzy K-Means Quality Assessment