Case Study: Enhanced Clustering Technique on Sequential Data Streams using Optics and Chameleon

K. SanthiSree; V. Vineela; Y. Ambica; Ch. Anitha

Call for Paper

July Edition

IJCA solicits high quality original research papers for the upcoming July edition of the journal. The last date of research paper submission is 22 June 2026

Submit your paper

Know more

The week's pick

CAD-Genesis: An Open-Source AI-Powered Add-in for Natural Language-Driven Parametric CAD Modeling and Cross-Platform Integration in SolidWorks and Fusion 360

Anil Mandloi Prakhi Mandloi

Random Articles

FPGA Implementation of Intelligent Climate Control for Greenhouse

February

2010

Facial Features Extraction by Relative Geometrical Position

July

2014

A Robust Audio Steganographic Scheme in Time Domain (RASSTD)

October

2013

Migration Effects on BBO Evolution in Optimizing Fifteen Element Yagi-Uda Antenna Design

April

2013

Reseach Article

Case Study: Enhanced Clustering Technique on Sequential Data Streams using Optics and Chameleon

by K. SanthiSree, V. Vineela, Y. Ambica, Ch. Anitha

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 176 - Number 20

Year of Publication: 2020

Authors: K. SanthiSree, V. Vineela, Y. Ambica, Ch. Anitha

10.5120/ijca2020920151

K. SanthiSree, V. Vineela, Y. Ambica, Ch. Anitha . Case Study: Enhanced Clustering Technique on Sequential Data Streams using Optics and Chameleon. International Journal of Computer Applications. 176, 20 ( May 2020), 1-5. DOI=10.5120/ijca2020920151

@article{ 10.5120/ijca2020920151,

author = { K. SanthiSree, V. Vineela, Y. Ambica, Ch. Anitha },

title = { Case Study: Enhanced Clustering Technique on Sequential Data Streams using Optics and Chameleon },

journal = { International Journal of Computer Applications },

issue_date = { May 2020 },

volume = { 176 },

number = { 20 },

month = { May },

year = { 2020 },

issn = { 0975-8887 },

pages = { 1-5 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume176/number20/31312-2020920151/ },

doi = { 10.5120/ijca2020920151 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-07T00:43:00.390716+05:30

%A K. SanthiSree

%A V. Vineela

%A Y. Ambica

%A Ch. Anitha

%T Case Study: Enhanced Clustering Technique on Sequential Data Streams using Optics and Chameleon

%J International Journal of Computer Applications

%@ 0975-8887

%V 176

%N 20

%P 1-5

%D 2020

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Huge data is getting accumulated every second in the real world .Clustering on web usage data is useful to identify what users are exactly looking for on the world wide web, like user traversals, users behavior and their characteristics, which helps for Web personalization. Clustering web sessions is to group them based on similarity and consists of minimizing the Intra-cluster similarity and maximizing the Inter-group similarity. In the past there exist multiple similarity measures like Euclidean, Jaccard ,Cosine , Manhattan, Minkowski, and many to measure similarity between web patterns. In this paper, we enhanced Chameleon Clustering Algorithm(CCA) based on CHAMELEON. Experiments are performed on MSNBC.COM website (free online news channel), on sequential data streams in the context of clustering in the domain of Web usage mining. Clustering in data mining is a discovery process that groups a set of data such that the intra-cluster similarity is maximized and the inter-cluster similarity is minimized. Existing clustering algorithms, such as K-means, PAM, CLARANS, DBSCAN, CURE, and ROCK are designed to find clusters that fit some static models Specially, we present a detailed comparison of OPTICS and CHAMELEON and the results illustrate that CHAMELEON is much more suitable for clustering the dynamic datasets. The Inter-cluster and Intra-cluster distances are computed using Average Levenshtein Distance (ALD) to demonstrate the usefulness of the proposed approach in the context of web usage mining. This new enhanced (CHAMELEON algorithm)has good results when compared with existing OPTICS clustering technique , and provided good time requirements of the newly developed algorithms.

References

Aggarwal.C, Han.J, Wang.J, Yu.P.S, “A Framework for Projected Clustering of High Dimensional Data Streams”, 2004,pp.(852-863)Int. Conf. on Very Large Data Bases, Toronto, Canada.
Aoying.Z, Shuigeng.Z, “Approaches for scaling DBSCAN algorithm to large spatial database”, 2000,pp.(509–526),Journal of Computer Science and Technology, 15(6).
Chen Song-Yu, O'Grady2,O'Hare, Wei Wang, “A Clustering Algorithm Incorporating Density and Direction”, IAWTAC ,IEEE 2008.Deepak P, Shourya Roy IBM India Research Lab, OPTICS on Text Data: Experiments and Test Results.
Cooley.R,Mobasher, B,Srivastava.J, “Web mining: Information and pattern discovery on the world wide web”, 9th IEEE Int. Conf. Tools AI.
K.santhiSree, R.Kranthi Kumar, International Journal of computer publications:Case Study : Comparative Analysis: On Clustering of Sequential Data Streams USING Optics and ICA,2016,(34-37),135(2), (0975 – 8887).
Guha.S, Mishra.N, Motwani.R, Callaghan.l,“ Clustering data streams”. In Proceedings of Computer Science. IEEE,,November,2000, pp(1391-1399), 16(10).
K.Santhisree, Dr A.Damodaram, “SSM-DBSCAN and SSM-OPTICS : Incorporating a new similarity measure for Density based Clustering of Web usage data”. 2011,,International Journal on Computer Science and Engineering (IJCSE),.3(9),PP.(3170-3184)September India.
K.Santhisree,”SSM-DENCLUE : Enhanced Approach for Clustering of Sequentialdata” Experiments and Test cases, June 2014.,International Journal of Computer Applications,96(6),pp.(7-14),Published by Foundation of Computer Science, New York, USA.
https://www-users.cs.umn.edu/~hanxx023/dmclass/chameleon.pdf

Index Terms

Computer Science

Information Sciences

Keywords

Sequence Mining Clustering Density Based Clustering(optics). Data Mining Clustering similarity measures Web Personalization.