Multi-Density based Incremental Clustering

Lanka Pradeep; A.m. Sowjanya

Call for Paper

July Edition

IJCA solicits high quality original research papers for the upcoming July edition of the journal. The last date of research paper submission is 20 June 2025

Submit your paper

Know more

The week's pick

Designing Multi-Tenant E-Learning Systems in the Cloud: A Process-Oriented Approach for Higher Education

Sameh Azouzi Sonia Ayachi Ghannouchi

Random Articles

Data Mining using Modified GFMM Neural Network

April

2015

Monitoring System using GSM

May

2015

ON Tiling Patterns Involving Islamic Stars with an Odd Number of Vertices

March

2013

Design and Implementation of Scalable, Fully Distributed Web Crawler for a Web Search Engine

February

2011

Reseach Article

Multi-Density based Incremental Clustering

by Lanka Pradeep, A.m. Sowjanya

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 116 - Number 17

Year of Publication: 2015

Authors: Lanka Pradeep, A.m. Sowjanya

10.5120/20426-2742

Lanka Pradeep, A.m. Sowjanya . Multi-Density based Incremental Clustering. International Journal of Computer Applications. 116, 17 ( April 2015), 6-9. DOI=10.5120/20426-2742

@article{ 10.5120/20426-2742,

author = { Lanka Pradeep, A.m. Sowjanya },

title = { Multi-Density based Incremental Clustering },

journal = { International Journal of Computer Applications },

issue_date = { April 2015 },

volume = { 116 },

number = { 17 },

month = { April },

year = { 2015 },

issn = { 0975-8887 },

pages = { 6-9 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume116/number17/20426-2742/ },

doi = { 10.5120/20426-2742 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T22:57:21.921190+05:30

%A Lanka Pradeep

%A A.m. Sowjanya

%T Multi-Density based Incremental Clustering

%J International Journal of Computer Applications

%@ 0975-8887

%V 116

%N 17

%P 6-9

%D 2015

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Clustering is the task of grouping a set of objects in such a way that objects in the same group are more similar to each other than to those in other groups . It is a common technique for statistical data analysis, used in many fields, including machine learning, pattern recognition, image analysis, information retrieval, and bioinformatics. A major difficulty in design of modern clustering algorithms is that, new datasets are dynamically added to the existing large database and it is not efficient to perform data clustering on the entire database every time a new dataset is added to the database. The new data added dynamically to the existing database is called incremental data. DBSCAN is widely used density based clustering algorithm. However it is known that DBSCAN fails to identify clusters of different densities. This paper presents a simple and efficient algorithm that identifies clusters of different densities and arbitrary shapes with automatic Eps estimation. Eps is estimated by using distance curve and difference of slopes and DBSCAN is applied on the data for each estimated Eps, resulting in multi-density clusters. Then by making use of formed clusters, incrementally updated data is clustered.

References

M Ester, H-P. Kriegel. J. Sander, and X, Xu. 1996. "A density-based algorithm for discovering clusters in large spatial databases". KDD'96.
Sushmita Mitra, Jay Nandy, KDDClus: A simple method for multi-density clustering, in: Proc. 2011 International Workshop on Soft Computing Applications and Knowledge Discovery (SCAKD'2011), Moscow, 2011, 72-76.
Chien-Yu Chen, Shien-Ching Hwang, and Yen-Jen Oyang,"An Incremental Hierarchical Data Clustering Algorithm Based on Gravity Theory", Proceedings of the 6th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, Pages: 237 - 250, 2002.
Dan Simovici, Namita Singla, "Metric Incremental Clustering of Nominal Data", ICDM, pp: 523-526, 2004.
M. Charikar, C. Chekur, T. Feder, and R. Motwani, Incremental clustering and dynamic information retrieval,in Proc. of the 29th Annual ACM Symposium on Theory of Computing, 1997, pp. 626–635. .
Jiawei Han, Micheline Kamber, "Data Mining Concepts and Techniques", Harcourt India Private Limited, 2001.
Seokkyung Chung and Dennis McLeod, "Dynamic Pattern Mining: An Incremental Data Clustering Approach",Journal on Data Semantics, Vol. 2, pp. 85-112, 2005.
Pang-Ning Tan, Michael Steinbach, Vipin Kumar, "Introducing to Data Mining", Pearson Education Asia LTD, 2006.
Jason D. Peterson, "Clustering overview", http://www. cs. ndsu. nodak. edu/~jasonpet/CSCI779/Clustering. pdf.
Wikipedia "cluster analysis", http://en. wikipedia. org/wiki/Cluster_analysis.
M. H. Marghny, Rasha M. Abd El-Aziz and Ahmed I. Taloba, "An Effective Evolutionary Clustering Algorithm: Hepatitis C Case Study", Computer Science Department, Egypt, International Journal of Computer Applications, vol. 34, No. 6, pp. 0975-8887, 2011.
Sowjanya, A. M. and M. Shashi, 2010. Cluster featurebased incremental clustering approach (CFICA) for numerical data. IJCSNS Int. J. Comp. Sci. NetworkSecurity, 10.
R. Ding, Q. Wang, Y. Dang, Q. Fu, H. Zhang, and D. Zhang. Yading: Fast clustering of large-scale time series data. In VLDB, 2015.

Index Terms

Computer Science

Information Sciences

Keywords

Data clustering Incremental clustering Multi-density clustering Data mining.