CFP last date
20 May 2024
Reseach Article

Multi-Density based Incremental Clustering

by Lanka Pradeep, A.m. Sowjanya
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 116 - Number 17
Year of Publication: 2015
Authors: Lanka Pradeep, A.m. Sowjanya
10.5120/20426-2742

Lanka Pradeep, A.m. Sowjanya . Multi-Density based Incremental Clustering. International Journal of Computer Applications. 116, 17 ( April 2015), 6-9. DOI=10.5120/20426-2742

@article{ 10.5120/20426-2742,
author = { Lanka Pradeep, A.m. Sowjanya },
title = { Multi-Density based Incremental Clustering },
journal = { International Journal of Computer Applications },
issue_date = { April 2015 },
volume = { 116 },
number = { 17 },
month = { April },
year = { 2015 },
issn = { 0975-8887 },
pages = { 6-9 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume116/number17/20426-2742/ },
doi = { 10.5120/20426-2742 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:57:21.921190+05:30
%A Lanka Pradeep
%A A.m. Sowjanya
%T Multi-Density based Incremental Clustering
%J International Journal of Computer Applications
%@ 0975-8887
%V 116
%N 17
%P 6-9
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Clustering is the task of grouping a set of objects in such a way that objects in the same group are more similar to each other than to those in other groups . It is a common technique for statistical data analysis, used in many fields, including machine learning, pattern recognition, image analysis, information retrieval, and bioinformatics. A major difficulty in design of modern clustering algorithms is that, new datasets are dynamically added to the existing large database and it is not efficient to perform data clustering on the entire database every time a new dataset is added to the database. The new data added dynamically to the existing database is called incremental data. DBSCAN is widely used density based clustering algorithm. However it is known that DBSCAN fails to identify clusters of different densities. This paper presents a simple and efficient algorithm that identifies clusters of different densities and arbitrary shapes with automatic Eps estimation. Eps is estimated by using distance curve and difference of slopes and DBSCAN is applied on the data for each estimated Eps, resulting in multi-density clusters. Then by making use of formed clusters, incrementally updated data is clustered.

References
  1. M Ester, H-P. Kriegel. J. Sander, and X, Xu. 1996. "A density-based algorithm for discovering clusters in large spatial databases". KDD'96.
  2. Sushmita Mitra, Jay Nandy, KDDClus: A simple method for multi-density clustering, in: Proc. 2011 International Workshop on Soft Computing Applications and Knowledge Discovery (SCAKD'2011), Moscow, 2011, 72-76.
  3. Chien-Yu Chen, Shien-Ching Hwang, and Yen-Jen Oyang,"An Incremental Hierarchical Data Clustering Algorithm Based on Gravity Theory", Proceedings of the 6th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, Pages: 237 - 250, 2002.
  4. Dan Simovici, Namita Singla, "Metric Incremental Clustering of Nominal Data", ICDM, pp: 523-526, 2004.
  5. M. Charikar, C. Chekur, T. Feder, and R. Motwani, Incremental clustering and dynamic information retrieval,in Proc. of the 29th Annual ACM Symposium on Theory of Computing, 1997, pp. 626–635. .
  6. Jiawei Han, Micheline Kamber, "Data Mining Concepts and Techniques", Harcourt India Private Limited, 2001.
  7. Seokkyung Chung and Dennis McLeod, "Dynamic Pattern Mining: An Incremental Data Clustering Approach",Journal on Data Semantics, Vol. 2, pp. 85-112, 2005.
  8. Pang-Ning Tan, Michael Steinbach, Vipin Kumar, "Introducing to Data Mining", Pearson Education Asia LTD, 2006.
  9. Jason D. Peterson, "Clustering overview", http://www. cs. ndsu. nodak. edu/~jasonpet/CSCI779/Clustering. pdf.
  10. Wikipedia "cluster analysis", http://en. wikipedia. org/wiki/Cluster_analysis.
  11. M. H. Marghny, Rasha M. Abd El-Aziz and Ahmed I. Taloba, "An Effective Evolutionary Clustering Algorithm: Hepatitis C Case Study", Computer Science Department, Egypt, International Journal of Computer Applications, vol. 34, No. 6, pp. 0975-8887, 2011.
  12. Sowjanya, A. M. and M. Shashi, 2010. Cluster featurebased incremental clustering approach (CFICA) for numerical data. IJCSNS Int. J. Comp. Sci. NetworkSecurity, 10.
  13. R. Ding, Q. Wang, Y. Dang, Q. Fu, H. Zhang, and D. Zhang. Yading: Fast clustering of large-scale time series data. In VLDB, 2015.
Index Terms

Computer Science
Information Sciences

Keywords

Data clustering Incremental clustering Multi-density clustering Data mining.