Call for Paper - May 2023 Edition
IJCA solicits original research papers for the May 2023 Edition. Last date of manuscript submission is April 20, 2023. Read More

An Efficient Clustering Algorithm for Outlier Detection

Print
PDF
International Journal of Computer Applications
© 2011 by IJCA Journal
Number 1 - Article 1
Year of Publication: 2011
Authors:
S.Vijayarani
S.Nithya
10.5120/3916-5514

S.Vijayarani and S.Nithya. Article:An Efficient Clustering Algorithm for Outlier Detection. International Journal of Computer Applications 32(7):22-27, October 2011. Full text available. BibTeX

@article{key:article,
	author = {S.Vijayarani and S.Nithya},
	title = {Article:An Efficient Clustering Algorithm for Outlier Detection},
	journal = {International Journal of Computer Applications},
	year = {2011},
	volume = {32},
	number = {7},
	pages = {22-27},
	month = {October},
	note = {Full text available}
}

Abstract

With the help of data mining, an important and valuable knowledge is extracted from the large massive collection of data. There are several techniques and algorithms are used for extracting the hidden patterns from the large data sets and finding the relationships between them. Clustering is one of the important techniques in data mining. Clustering algorithms are used for grouping the data items based on their similarity. Outlier Detection is a very important research problem in data mining. Clustering algorithms are used for detecting the outliers efficiently. In this research paper, we focused on outlier detection in health data sets such as Pima Indians Diabetes data set and Breast Cancer Wisconsin data set using partitioning clustering algorithms. The algorithms used in this research work are PAM, CLARA AND CLARANS and a new clustering algorithm ECLARANS is proposed for detecting outliers. In order to find the best clustering algorithm for outlier detection several performance measures are used. The experimental results show that the outlier detection accuracy is very good in the proposed ECLARANS clustering algorithm compared to the existing algorithms.

Reference

  • Arun K Pujari: Data Mining Techniques, Universities Press (India) Private Limited 2001.
  • Ajay Challagalla,S.S.Shivaji Dhiraj ,D.V.L.N Somayajulu,Toms Shaji Mathew,Saurav Tiwari,Syed Sharique Ahmad “ Privacy Preserving Outlier Detection Using Hierarchical Clustering Methods,2010 34th Annual IEEE Computer Software and Applications Conference Workshops.
  • Al-Zoubi, M. (2009) An Effective Clustering-Based Approach for Outlier Detection, European Journal of Scientific Research.
  • Jiang, S. And An, Q. (2008) Clustering Based Outlier Detection Method, Fifth International Conference on Fuzzy Systems and Knowledge Discovery.
  • John Peter.S., Department of computer science and research center St.Xavier’s College, Palayamkottai, An Efficient Algorithm for Local Outlier Detection Using Minimum Spanning Tree, International Journal of Research and Reviews in Computer Science (IJRRCS), March 2011.
  • Loureiro, A., Torgo, L. And Soares, C. (2004) Outlier Detection using Clustering Methods: A Data Cleaning Application, in Proceedings of KDNet Symposium on Knowledge-Based Systems for the public Sector. Bonn, Germany.
  • Murugavel. P. et al, Improved Hybrid Clustering And Distance-Based Technique for Outlier Removal, International Journal on Computer Science and Engineering (IJCSE), 1 JAN 2011
  • Ng, R. and Han, J. (1994) Efficient and Effective Clustering Methods for Spatial Data Mining,” Proc. 20th Conf.
  • Ng, R. and Han, J. (2002) CLARANS: A Method for Clustering Objects for Spatial Data Mining, IEEE Transactions on Knowledge and Data Engineering.
  • Outlier Detection Algorithms in Data Mining Systems. I. Petrovskiy, Department of Computational Mathematics and Cybernetics, Moscow State University, Vorob’evy gory, Moscow, 119992 Russia.e-mail: This e-mail address is being protected from spambots. You need JavaScript enabled to view it .suReceived February 19, 2003.
  • OUTLIER DETECTION, Irad Ben-Gal, Department of Industrial Engineering,Tel-Aviv University,Ramat-Aviv, Tel-Aviv 69978, Israel., This e-mail address is being protected from spambots. You need JavaScript enabled to view it .
  • Velmurugan, T. and Santhanam, T. (2011) A survey of partition based clustering algorithms in data mining: An experimental approach, Inform. Technol. J.,