CFP last date
20 May 2024
Reseach Article

Recursive Prefix Suffix Pattern Detection Approach for Mining Sequential Patterns

by Dr. P.Padmaja, P.Naga Jyothi, M.Bhargava
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 29 - Number 3
Year of Publication: 2011
Authors: Dr. P.Padmaja, P.Naga Jyothi, M.Bhargava
10.5120/3541-4848

Dr. P.Padmaja, P.Naga Jyothi, M.Bhargava . Recursive Prefix Suffix Pattern Detection Approach for Mining Sequential Patterns. International Journal of Computer Applications. 29, 3 ( September 2011), 50-53. DOI=10.5120/3541-4848

@article{ 10.5120/3541-4848,
author = { Dr. P.Padmaja, P.Naga Jyothi, M.Bhargava },
title = { Recursive Prefix Suffix Pattern Detection Approach for Mining Sequential Patterns },
journal = { International Journal of Computer Applications },
issue_date = { September 2011 },
volume = { 29 },
number = { 3 },
month = { September },
year = { 2011 },
issn = { 0975-8887 },
pages = { 50-53 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume29/number3/3541-4848/ },
doi = { 10.5120/3541-4848 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:14:52.117704+05:30
%A Dr. P.Padmaja
%A P.Naga Jyothi
%A M.Bhargava
%T Recursive Prefix Suffix Pattern Detection Approach for Mining Sequential Patterns
%J International Journal of Computer Applications
%@ 0975-8887
%V 29
%N 3
%P 50-53
%D 2011
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The need for real-time data mining has long been recognized in various application domains. Some of the applications like customer shopping sequences, medical treatments, natural disasters, telephone calling patterns, Weblog click streams, DNA sequences and gene structures require sequential pattern mining techniques. These techniques find the complete set of frequent subsequences for the given set of sequences and support threshold. Traditional pattern growth based approaches for sequential pattern mining derive length (n+1) pattern based on projected databases of length n-patterns recursively. As result lot of recursions occur which may lead to certain complexities. Thus, in order to reduce the number of iterations, an efficient bidirectional sequential pattern mining approach namely Recursive Prefix Suffix Pattern detection, RPSP algorithm is proposed. The RPSP algorithm first finds all Frequent Itemsets (FI’s) according to the given minimum support and transforms the database such that each transaction is replaced by all the FI’s it contains and then finds the patterns. The pattern further is detected based on ith projected databases, and constructs suffix and prefix databases based on the apriori property. RPSP will increase the number of frequent patterns by reducing the minimum support and vice versa. Recursion is terminated when the detected FI set of prefix or suffix projected database of parent database is null. All the patterns that correspond to a particular ith projected database of transformed database are formed into a set, which is disjoint from all other sets. The union of all the disjoint subsets is the resultant set of frequent patterns. The proposed algorithm was tested on the hypothetical data and results obtained were found satisfactory. Thus, RPSP algorithm can be applicable to many real world sequential data sets.

References
  1. R.Agarwal and R.Srikanth, “Mining Sequential Patterns” ICDE’95, Pg 3-14,1995.
  2. R. Agrawal, and R. Srikant, Fast algorithms for mining association rules, Proc. of 20th Intl. Conf. on VLDB, pp. 487-499, 1994.
  3. G. Grahne and J. Zhu., "Efficiently Using Prefix-trees in Mining Frequent Itemsets," Prof.FIMI'03 Workshop on Frequent Itemset Mining Implementations, 2003.
  4. J. Chen, T. Cook. Mining Contiguous Sequential Patterns from Web Logs. In Proc. Of WWW2007 Poster session, May 8-12, 2007, Banff, Alberta, Canada.
  5. C. Antunes and A. L. Oliveira, “Sequential Pattern Mining Algorithms: Trade-offs between Speed and Memory,” Proc. 2nd Intl. Workshop on Mining Graphs, Trees and Sequences, 2004.
  6. M.Y. Lin and S.Y. Lee, “Fast Discovery of Sequential Patterns through Memory Indexing and Database Partitioning,” J. of Information Science and Engineering, vol. 21,2005.
  7. M. Zaki, “Spade: An Efficient Algorithm for Mining Frequent Sequences,” Machine Learning, vol. 40, pp.31-60, 2001.
  8. Jinlin Chen, “An UpDown Directed Acyclic Graph Approach for Sequential Pattern Mining”, IEEE 2009, vol-ll, Pg 1-16.
  9. D.Y. Chiu, Y.H. Wu and A.L.P. Chen, “An Efficient Algorithm for Mining Frequent Sequences by a New Strategy without Support Counting,” Proc. ICDE 2004, pp. 375, 2004.
  10. J. Pei, J. Han, B. Mortazavi-Asl, H. Pinto, Q. Chen, U. Dayal, and M.C. Hsu,“PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected Pattern Growth,” Proc. 2001 Int’l Conf. Data Eng. (ICDE ’01), pp. 215-224, 2001.
  11. Z. Zhang, Y. Wang, and M. Kitsuregawa, “EffectiveSequential Pattern Mining Algorithms for Dense Database,” Proc. Japanese National Data Engineering WorkShop (DEWS'06). 2006.
  12. Chen, T. Cook. Mining Contiguous Sequential Patterns from Web Logs. In Proc. of WWW2007 Poster session, May 8-12,2007, Banff, Alberta, Canada.
Index Terms

Computer Science
Information Sciences

Keywords

Sequential patterns Frequent itemsets Prefix and Suffix databases Projected database Transformed database