Call for Paper - January 2023 Edition
IJCA solicits original research papers for the January 2023 Edition. Last date of manuscript submission is December 20, 2022. Read More

Comparison of Outlier Detection Methods in Diabetes Data

Print
PDF
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Year of Publication: 2016
Authors:
V. Mahalakshmi, M. Govindarajan
10.5120/ijca2016912451

V Mahalakshmi and M Govindarajan. Comparison of Outlier Detection Methods in Diabetes Data. International Journal of Computer Applications 155(10):28-32, December 2016. BibTeX

@article{10.5120/ijca2016912451,
	author = {V. Mahalakshmi and M. Govindarajan},
	title = {Comparison of Outlier Detection Methods in Diabetes Data},
	journal = {International Journal of Computer Applications},
	issue_date = {December 2016},
	volume = {155},
	number = {10},
	month = {Dec},
	year = {2016},
	issn = {0975-8887},
	pages = {28-32},
	numpages = {5},
	url = {http://www.ijcaonline.org/archives/volume155/number10/26642-2016912451},
	doi = {10.5120/ijca2016912451},
	publisher = {Foundation of Computer Science (FCS), NY, USA},
	address = {New York, USA}
}

Abstract

Outlier is defined as an observation that deviates extensively from other observations. The identification of outliers can lead to the discovery of useful and meaningful knowledge. Outlier detection has been widely studied in the past decades. Most refined methods in data mining address this issue to some extent, but not fully, and can be improved by addressing the problem more directly. The detection of outliers can lead to the invention of unpredicted facts in areas such as credit card fraud detection, calling card fraud detection, discovering criminal behaviors, discovering network intrusions, etc. This paper mainly discusses and compares approach of different outlier detection from data mining perspective, which can be grouped into distance-based approach and density-based approach.

References

  1. Surekha V Peshatwar & Snehlata Dongre, “Outlier Detection Over Data Stream Using Cluster Based Approach And Distance Based Approach”, International Conference on Electrical Engineering and Computer Science (ICEECS-2012), Trivandrum, May 12th, 2012.
  2. Pranjali Kasture, Jayant Gadge, “Cluster based Outlier Detection”, International Journal of Computer Applications (0975 – 8887) Vol.58(10), November 2012.
  3. Garima Singh, Vijay Kumar, “An Efficient Clustering and Distance Based Approach for Outlier Detection”, International Journal of Computer Trends and Technology (IJCTT), Vol.4(7), July 2013.
  4. H. Dutta, C. Giannella, K.D. Borne, and H. Kargupta, “Distributed Top-K Outlier Detection from Astronomy Catalogs Using the DEMAC System,” Proc. SIAM International Conference in Data Mining (SDM), 2007.
  5. C. Aggarwal, J. Han, J. Wang, P.S. Yu, “A framework for projected clustering of high dimensional data streams”, in Proceedings of the 30th VLDB Conference, Toronto, Canada, 2004, pp. 852-863.
  6. Angiulli, F., and Pizzuti, C., “Fast outlier detection in high dimensional spaces”, Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery, PKDD 002, Springer-Verlag, London, UK, 2002, pp. 15–26.
  7. Chandola, V., Banerjee, A., and Kumar, V., “Anomaly detection: a survey”, ACM Comput. Surv. (CSUR), Vol. 41(3), 2009, pp. 1–58.
  8. Chawla, S., and Gionis, A., “k-means–: A unified approach to clustering and outlier detection”, SDM, SIAM, 2013, pp. 189–197.
  9. Acuna, E., and Rodriguez, C. A., “A meta analysis study of outlier detection methods in classification”, Technical paper, Department of Mathematics, University of Puerto Rico at Mayaguez, 2004, pp. 1-25.

Keywords

Outlier detection, Distance-based approach, Density-based approach