CFP last date
20 May 2024
Reseach Article

Distributed Sequential Pattern Mining: A Survey and Future Scope

by Suhasini Itkar, Uday Kulkarni
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 94 - Number 18
Year of Publication: 2014
Authors: Suhasini Itkar, Uday Kulkarni
10.5120/16461-6187

Suhasini Itkar, Uday Kulkarni . Distributed Sequential Pattern Mining: A Survey and Future Scope. International Journal of Computer Applications. 94, 18 ( May 2014), 28-35. DOI=10.5120/16461-6187

@article{ 10.5120/16461-6187,
author = { Suhasini Itkar, Uday Kulkarni },
title = { Distributed Sequential Pattern Mining: A Survey and Future Scope },
journal = { International Journal of Computer Applications },
issue_date = { May 2014 },
volume = { 94 },
number = { 18 },
month = { May },
year = { 2014 },
issn = { 0975-8887 },
pages = { 28-35 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume94/number18/16461-6187/ },
doi = { 10.5120/16461-6187 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:18:01.950956+05:30
%A Suhasini Itkar
%A Uday Kulkarni
%T Distributed Sequential Pattern Mining: A Survey and Future Scope
%J International Journal of Computer Applications
%@ 0975-8887
%V 94
%N 18
%P 28-35
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Distributed sequential pattern mining is the data mining method to discover sequential patterns from large sequential database on distributed environment. It is used in many wide applications including web mining, customer shopping record, biomedical analysis, scientific research, etc. A large research has been done on sequential pattern mining on various distributed environments like Grid, Hadoop, Cluster, Cloud, etc. Different types of sequential pattern mining can be performed are sequential patterns, maximal sequential patterns, closed sequences, constraint based and time interval based sequential patterns. This paper presents a systematic review on work done for sequential pattern mining and advanced sequential pattern mining on distributed environment. This paper finally presents future research directions related to sequential pattern mining in distributed environment.

References
  1. Agrawal R, Imielinski T, Swami A 1993. "Mining association rules between sets of items in large databases", in Proceedings of ACM-SIGMOD, Washington.
  2. R. Agrawal, R. Srikant, 1995. "Mining Sequential Patterns", in Proceedings of the International Conference on Data Engineering (ICDE), Taipei, Taiwan.
  3. Jiawei Han, Hong Cheng, Dong Xin, Xifeng Yan, 2007. "Frequent pattern mining: current status and future directions". Data Mining Knowledge Discovery.
  4. Han J. , Dong G. , Mortazavi-Asl B. , Chen Q. , Dayal U. , Hsu M. -C. , 2000. "Freespan: Frequent pattern-projected sequential pattern mining", in Proceeding of International Conference on Knowledge Discovery and Data Mining (KDD'2000).
  5. Jian Pei, Jiawei Han, Behzad Mortazavi-Asl, Jianyong Wang, Helen Pinto, Qiming Chen, Umeshwar Dayal, Mei-Chun, 2004. "Mining Sequential Patterns by Pattern-Growth : The PrefixSpan Approach". IEEE Transaction on Knowledge and Data Engineering.
  6. Liu Pei-yu, Gong Wei, Jia Xian, 2011. "An Improved PrefixSpan Algorithm Research for Sequential Pattern". In Proceedings of IT in Medicine and Education (ITME).
  7. Jinlin Chen, 2010. "An UpDown Directed Acyclic Graph Approach for Sequential Pattern Mining". IEEE Transaction on Knowledge and Data Engineering.
  8. Kamber M, Han J, Chiang JY, 1997. "Metarule-guided mining of multi-dimensional association rules using data cubes", in Proceedings of International conference on knowledge discovery and data mining, Newport Beach.
  9. Pasquier N, Bastide Y, Taouil R, Lakhal L, 1999. "Discovering frequent closed itemsets for association rules", in Proceedings of Seventh International conference on database theory (ICDT'99), Jerusalem, Israel.
  10. Pei J, Han J, Mao R, 2000. "CLOSET: an efficient algorithm for mining frequent closed itemsets', in Proceedings of ACM-SI0MOD international workshop data mining and knowledge discovery (DMKD'00), Dallas.
  11. Bayardo RJ, 1998. "Efficiently mining long patterns from databases", in Proceedings of ACM-SIGMOD international conference on management of data (SIGMOD'98), Seattle, WA.
  12. Grahne G, Lakshmanan L, Wang X, 2000. "Efficient mining of constrained correlated sets", in Proceedings of the International conference on data engineering (ICDE'00), San Diego, CA.
  13. Bonchi F, Lucchese C, 2004. "On closed constrained frequent pattern mining", in Proceedings of International conference on data mining (ICDM'04), Brighton, UK.
  14. Liu J, Paulsen S, Sun X, Wang W, Nobel A, Prins J, 2006. "Mining approximate frequent itemsets in the presence of noise: algorithm and analysis", in Proceedings of SIAM international conference on data mining (SDM'06), Bethesda, MD.
  15. Yan, X. , Han, J. , and Afshar, R. , 2003. "CloSpan: Mining closed sequential patterns in large datasets". Third SIAM International Conference on Data Mining (SDM), San Fransico, CA.
  16. Chun-Sheng Wanga, Ying-Ho Liub, Kuo-Chung Chuc. , "Closed inter-sequence pattern mining ". The Journal of Systems and Software, volume 86, 2013.
  17. Jian Pei, Jiawei Han, Wei Wang, "Constraint-based sequential pattern mining: the pattern growth methods", The Journal Intelligent Information System, Vol. 28, No. 2, pp. 133 –160, 2007.
  18. Di Wu, Xiaoxue Wang, Ting Zuo, Tieli Sun, Fengqin Yang , 2010. "A Sequential Pattern Mining algorithm with time constraints based on vertical format". 2nd International Conference on Information Science and Engineering (ICISE), vol. no. 4, pp. 3479-3482.
  19. Burdick D, Calimlim M, Gehrke J, 2001. "MAFIA: a maximal frequent itemset algorithm for transactional databases", in Proceedings of International conference on data engineering (ICDE'01), Heidelberg, Germany, pp 443–452.
  20. Luo C, Chung S. , 2005. "Efficient mining of maximal sequential patterns using multiple samples", in Proceedings of SIAM international conference on data mining (SDM'05), Newport Beach, CA, pp 415–426.
  21. Guralnik V, Karypis G. , 2004. "Parallel tree-projection-based sequence mining algorithms". Parallel Computing, pp 443-472.
  22. Cong S, Han J, Padua D. , 2005. "Parallel mining of closed sequential patterns", in Proceeding of Eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, Chicago, USA, pp. 562-567.
  23. Chih-Hung Wu, Yu-Chieh Lo, 2006. "Mining Sequential Patterns on a Grid-Computing Environment". IEEE International Conference on Systems, Man, and Cybernetics, Taipei, Taiwan.
  24. Chih-Hung Wu, Chih-Chin Lai, Yu-Chieh Lo, 2012. "An empirical study on mining sequential patterns in a grid computing environment". Expert Systems with Applications, pp. 5748–5757.
  25. Shaochun Wu, Genfeng Wu, Shenjie Jin, 2004. "Pre-Clustering based Sequential Pattern Mining". Fourth International Conference on Computer and Information Technology, pp. 1008-1013.
  26. Chun-Chieh Chen, Chi-Yao Tseng,Ming-Syan Chen, 2013. "Highly Scalable Sequential Pattern Mining Based on MapReduce Model on the Cloud". IEEE International Congress on Big Data.
  27. J. Ayres, J. Gehrke, T. Yu, and J. Flannick, 2002. "Sequential Pattern Mining Using a Bitmap Representation", in Proceedings of International Conference on knowledge Discovery and Data Mining, pp. 429-435.
  28. Wei Yong-qing, Liu Dong, Duan Lin-shan, 2012. "Distributed PrefixSpan Algorithm Based on MapReduce". International Symposium on Information Technology in Medicine and Education.
  29. Kong-Fa-Hu, Chang-Hai Zhang, Ling Chen, 2007. "A Scalable Method of Mining Approximate Multidimensional Sequential Patterns on Distributed Systems", in Proceedings of Sixth International Conference on Machine Learning and Cybernetics, Hong Kong, pp. 19-22.
  30. Xueqiang Wang, Jing Wang, Tengjiao Wang, Hongyan Li, Dongqing Yang, 2010. "Parallel Sequential Pattern Mining by Transaction Decomposition". Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2010).
  31. Xingquan Zhu, Bin Li, Xindong Wu, Dan He, Chengqi Zhang, 2011. "CLAP: Collaborative pattern mining for distributed information systems". Decision Support Systems 52, pp. 40-51.
  32. Sanjay D. Bhanderi, Sanjay Garg, 2012. "Parallel Frequent Set Mining Using Inverted Matrix Approach". Engineering (NUiCONE), Nirma University International conference.
  33. Changhai Zhang, Kongfa Hu, Zhuxi Chen, Ling Chen, Yisheng Dong, 2007. "ApproxMGMSP: A Scalable Method of Mining Approximate Multidimensional Sequential Patterns on Distributed System". Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007).
  34. Soon M. Chung, Congnan Luo, 2004. "Distributed Mining of Maximal Frequent Itemsets from Databases on a Cluster of Workstations". IEEE International Symposium on Cluster Computing and the Grid.
  35. Yu Hirate, Hayato Yamana, 2006. "Sequential Pattern Mining with Time Intervals". W. K. Ng, M. Kitsuregawa, J. Li(Eds) : PAKDD, LNAI, PP. 775-779.
  36. Fabian Morchen, 2007. "Unsupervised pattern mining from symbolic temporal data". ACM SIGKDD Explorations Newsletter, Volume 9 issue 1, Pages 41-55.
  37. Yu Hirate, Hayato Yamana, 2009. "Profiling Node Conditions of Distributed System with Sequential Pattern Mining". Software Technologies for Future Dependable Distributed Systems.
  38. Bettahally N. Keshavamurthy, Durga Toshniwal, Bhavani K. Eshwar, "Hiding co-occurring prioritized sensitive patterns over distributed progressive sequential data streams". Journal of Network and Computer Application, pp. 1116–1129,2007.
  39. IBM Quest Data Mining Project. Quest synthetic data generation code, http://www. cs. loyola. edu/~cgiannel/ assoc_gen. html.
  40. J. W. Huang, S. C. Lin, and M. S. Chen, 2010. "DPSP: Distributed Progressive Sequential Pattern Mining on the Cloud". Advances in Knowledge Discovery and Data Mining.
Index Terms

Computer Science
Information Sciences

Keywords

Distributed Sequential Pattern Mining Maximal Patterns Constraint based Patterns Distributed environment.