CFP last date
22 April 2024
Reseach Article

Comparison of Outlier Detection Methods in Diabetes Data

by V. Mahalakshmi, M. Govindarajan
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 155 - Number 10
Year of Publication: 2016
Authors: V. Mahalakshmi, M. Govindarajan
10.5120/ijca2016912451

V. Mahalakshmi, M. Govindarajan . Comparison of Outlier Detection Methods in Diabetes Data. International Journal of Computer Applications. 155, 10 ( Dec 2016), 28-32. DOI=10.5120/ijca2016912451

@article{ 10.5120/ijca2016912451,
author = { V. Mahalakshmi, M. Govindarajan },
title = { Comparison of Outlier Detection Methods in Diabetes Data },
journal = { International Journal of Computer Applications },
issue_date = { Dec 2016 },
volume = { 155 },
number = { 10 },
month = { Dec },
year = { 2016 },
issn = { 0975-8887 },
pages = { 28-32 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume155/number10/26642-2016912451/ },
doi = { 10.5120/ijca2016912451 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:00:55.434706+05:30
%A V. Mahalakshmi
%A M. Govindarajan
%T Comparison of Outlier Detection Methods in Diabetes Data
%J International Journal of Computer Applications
%@ 0975-8887
%V 155
%N 10
%P 28-32
%D 2016
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Outlier is defined as an observation that deviates extensively from other observations. The identification of outliers can lead to the discovery of useful and meaningful knowledge. Outlier detection has been widely studied in the past decades. Most refined methods in data mining address this issue to some extent, but not fully, and can be improved by addressing the problem more directly. The detection of outliers can lead to the invention of unpredicted facts in areas such as credit card fraud detection, calling card fraud detection, discovering criminal behaviors, discovering network intrusions, etc. This paper mainly discusses and compares approach of different outlier detection from data mining perspective, which can be grouped into distance-based approach and density-based approach.

References
  1. Surekha V Peshatwar & Snehlata Dongre, “Outlier Detection Over Data Stream Using Cluster Based Approach And Distance Based Approach”, International Conference on Electrical Engineering and Computer Science (ICEECS-2012), Trivandrum, May 12th, 2012.
  2. Pranjali Kasture, Jayant Gadge, “Cluster based Outlier Detection”, International Journal of Computer Applications (0975 – 8887) Vol.58(10), November 2012.
  3. Garima Singh, Vijay Kumar, “An Efficient Clustering and Distance Based Approach for Outlier Detection”, International Journal of Computer Trends and Technology (IJCTT), Vol.4(7), July 2013.
  4. H. Dutta, C. Giannella, K.D. Borne, and H. Kargupta, “Distributed Top-K Outlier Detection from Astronomy Catalogs Using the DEMAC System,” Proc. SIAM International Conference in Data Mining (SDM), 2007.
  5. C. Aggarwal, J. Han, J. Wang, P.S. Yu, “A framework for projected clustering of high dimensional data streams”, in Proceedings of the 30th VLDB Conference, Toronto, Canada, 2004, pp. 852-863.
  6. Angiulli, F., and Pizzuti, C., “Fast outlier detection in high dimensional spaces”, Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery, PKDD 002, Springer-Verlag, London, UK, 2002, pp. 15–26.
  7. Chandola, V., Banerjee, A., and Kumar, V., “Anomaly detection: a survey”, ACM Comput. Surv. (CSUR), Vol. 41(3), 2009, pp. 1–58.
  8. Chawla, S., and Gionis, A., “k-means–: A unified approach to clustering and outlier detection”, SDM, SIAM, 2013, pp. 189–197.
  9. Acuna, E., and Rodriguez, C. A., “A meta analysis study of outlier detection methods in classification”, Technical paper, Department of Mathematics, University of Puerto Rico at Mayaguez, 2004, pp. 1-25.
Index Terms

Computer Science
Information Sciences

Keywords

Outlier detection Distance-based approach Density-based approach