OPTICS on Sequential Data: Experiments and Test Results

International Journal of Computer Applications
© 2010 by IJCA Journal
Number 5 - Article 1
Year of Publication: 2010
Dr A.Damodaram

The Web has enormous, various and knowledgeable data for data mining research. Clustering web usage data is useful to discover interesting patterns pertaining to user traversals, behaviour and their usage characteristics. Moreover, users accesses web pages in an order in which they are interested and hence incorporating sequence nature of their usage is crucial for clustering web transactions. In this paper we present OPTICS ("Ordering Points To Identify the Clustering Structure") algorithm to find density based clusters on a web usage data on MSNBC.COM website which is a free news data website with so different categories of news).The clusters are generated by OPTICS algorithm . The average of inter cluster and intra cluster are Calculated. the results are compared with different similarity measures like Euclidean , Jaccard, projected Euclidean, cosine and fuzzy similarity Finally showed behavior of clusters that made by OPTICS algorithm on a sequential data in a web usage domain. we performed a variety of experiments in the context of density based clustering , quantify our results by the way of explanation s and list conclusions.


