CFP last date
20 May 2024
Reseach Article

Reduce Noise in K-Mean Clustering using DBSCAN Algorithm

by Manjur Ahammad, Faija Juhin, Dewan Md. Farid
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 184 - Number 9
Year of Publication: 2022
Authors: Manjur Ahammad, Faija Juhin, Dewan Md. Farid
10.5120/ijca2022922064

Manjur Ahammad, Faija Juhin, Dewan Md. Farid . Reduce Noise in K-Mean Clustering using DBSCAN Algorithm. International Journal of Computer Applications. 184, 9 ( Apr 2022), 21-25. DOI=10.5120/ijca2022922064

@article{ 10.5120/ijca2022922064,
author = { Manjur Ahammad, Faija Juhin, Dewan Md. Farid },
title = { Reduce Noise in K-Mean Clustering using DBSCAN Algorithm },
journal = { International Journal of Computer Applications },
issue_date = { Apr 2022 },
volume = { 184 },
number = { 9 },
month = { Apr },
year = { 2022 },
issn = { 0975-8887 },
pages = { 21-25 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume184/number9/32357-2022922064/ },
doi = { 10.5120/ijca2022922064 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T01:21:02.931476+05:30
%A Manjur Ahammad
%A Faija Juhin
%A Dewan Md. Farid
%T Reduce Noise in K-Mean Clustering using DBSCAN Algorithm
%J International Journal of Computer Applications
%@ 0975-8887
%V 184
%N 9
%P 21-25
%D 2022
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The growth of data mining procedure is increasing day by day. We can extract useful insight from data. For mining data different techniques and tools have been introduced every day. By gaining knowledge from those insight of the data many research paper is being written. Based on the behavior, pattern and the characteristics data are being clustered into different groups. For clustering these massive amount of data we use different types of algorithms and techniques. The most common types of algorithms that are used in clustering are partitioning, hierarchical, grid-based and model-based algorithms. To handle these data another, type of algorithms are K-means clustering, density-based algorithm, similarity-based algorithms etc. the agenda off these algorithms are different. Some performs well for nominal data, some for categorical or ordinal data, contrariwise some can remove duplicate or noisy data and some can’t do so. In this paper a method has been showed that how can we cluster a dataset and remove the noisiness of that particular dataset at the same time.

References
  1. L.-J. Zhang, J. Zhang, and H. Cai, Services computing. Springer, 2007.
  2. F. Curbera et al. Unraveling the Web Services: An Introduction to SOAP, WSDL, and UDDI. IEEE Internet Computing, Mar/ Apr issue 2002.
  3. D.A. Menasce, “QoS issues in web services,” IEEE Internet Compute., vol. 6, pp. 72–75, 2002.
  4. Tao Yu, Yue Zhang, and Kwei-Jay Iin, “Efficient Algorithms for Web Services Selection with End-to-End QoS Constraints”, ACM Transactions on the Web, Vol. 1, No. 1, Article 6, Publication date: May 2007.
  5. Kaufman, L. and Rousseeuw, P. (1990). Finding Groups in Data—An Introduction to Cluster Analysis. Wiley Series in Probability and Mathematical Statistics. NewYork: JohnWiley& Sons, Inc.
  6. Fujikawa, Y. and Ho, T. (2002). Cluster-based algorithms for dealing with missing values.In Cheng, M.-S., Yu, P. S., and Liu, B., editors, Advances in Knowledge Discovery and Data Mining, Proceedings of the 6th Pacific-Asia Conference, PAKDD 2002, Taipei,Taiwan, volume 2336 of Lecture Notes in Computer Science, pages 549–554. New York:Springer.
  7. S. Y. Hwang, H. Wang, J. Tang, and J. Srivastava, “A probabilistic approach to modeling and estimating the QoS of web-servicesbased workflows,” Info. Sci., vol. 177, pp. 5484–5503, 2007.
  8. J. Z. Huang, M. K. Ng, H. Rong, and Z. Li, “Automated variable weighting in k-means type clustering,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 27, no. 5, pp. 657–668, May 2005.
  9. A. K. Jain, “Data clustering: 50 years beyond k-means,” Pattern Recognition Letters, vol. 31, no. 8, pp. 651–666, June 2010.
  10. A. Avizienis, J.-C. Laprie, B. Randell, and C. Landwehr, ‘‘Basic Concepts and Taxonomy of Dependable and Secure Computing,’’ IEEE Trans. Dependable Secure Comput., vol. 1, no. 1, pp. 11 /33, Jan.-Mar. 2004.
  11. Marin Silic, GoranDelac, Ivo Krka and SinisaSrbljic, “Scalable and Accurate Prediction of Availability of Atomic Web Services”, IEEE transaction on service computing, 2014.
  12. L. Shao, J. Zhang, Y. Wei, J. Zhao, B. Xie, and H. Mei, “Personalized QoS prediction for Web services via collaborative filtering, ”in Proc. 5th International Conference on Web Services (ICWS 2007), 2007, pp. 439446.
  13. T.Miranda Lakshmi, R.JosephineSahana, V.PrasannaVenkatesan. Review on Density-Based Clustering Algorithms for Big Data p.15, p.17.
Index Terms

Computer Science
Information Sciences

Keywords

Big Data K-Means Clustering DBSCAN OPTICS