CFP last date
20 May 2024
Reseach Article

Continuous Prediction of Closed Frequent Itemsets from High speed Distributed Data Streams using Parallel Mining on Manifold Windows with Varying Size

by V.sidda Reddy, T.v. Rao, A.govardhan
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 101 - Number 2
Year of Publication: 2014
Authors: V.sidda Reddy, T.v. Rao, A.govardhan
10.5120/17662-8479

V.sidda Reddy, T.v. Rao, A.govardhan . Continuous Prediction of Closed Frequent Itemsets from High speed Distributed Data Streams using Parallel Mining on Manifold Windows with Varying Size. International Journal of Computer Applications. 101, 2 ( September 2014), 34-40. DOI=10.5120/17662-8479

@article{ 10.5120/17662-8479,
author = { V.sidda Reddy, T.v. Rao, A.govardhan },
title = { Continuous Prediction of Closed Frequent Itemsets from High speed Distributed Data Streams using Parallel Mining on Manifold Windows with Varying Size },
journal = { International Journal of Computer Applications },
issue_date = { September 2014 },
volume = { 101 },
number = { 2 },
month = { September },
year = { 2014 },
issn = { 0975-8887 },
pages = { 34-40 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume101/number2/17662-8479/ },
doi = { 10.5120/17662-8479 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:30:40.508667+05:30
%A V.sidda Reddy
%A T.v. Rao
%A A.govardhan
%T Continuous Prediction of Closed Frequent Itemsets from High speed Distributed Data Streams using Parallel Mining on Manifold Windows with Varying Size
%J International Journal of Computer Applications
%@ 0975-8887
%V 101
%N 2
%P 34-40
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Continuous prediction of closed frequent itemsets from high speed distributed data streams is an active research work, which is because of the conflict to the process time taken to perform mining consistent itemsets from current records and high alacrity transmission time in data streams. By the motivation gained from our earlier proposed models, here we devised a novel closed frequent itemset mining model for high speed distributed data streams. The said model is referred as Parallel Closed Frequent Itemsets Mining (PCFIM) over High Speed Distributed Data streams by Manifold Varying Size Windows (MVSW). The results obtained from experiments are significant to prove that the proposed PCFIM is scalable and robust on high speed data streams and miles ahead over existing bench mark models.

References
  1. R. Agrawal and R. Srikant, "Fast algorithms for mining association rules", In Proceedings of International Conference on Very Large Databases, 1994, pp. 487-499.
  2. G. S. Manku and R. Motwani, "Approximatefrequency counts over data streams", In Proceedings of VLDB International Conference on Very Large Databases, 2002, pp. 346-357.
  3. H. -F. Li, S. -Y. Lee, and M. -K. Shan, "An efficient algorithm for mining frequent itemsets over the entire history of data streams" , In Proceedings of International Workshop on Knowledge Discovery in Data Streams, 2004, pp. 20-24.
  4. J. Han, J. Pei, Y. Yin, and R. Mao, "Mining frequent patterns without candidate generation: A frequent-pattern tree approach", Data Mining and Knowledge Discovery, Vol. 8, 2004, pp. 53-87.
  5. J. Han, J. Pei, Y. Yin, and R. Mao, "Mining frequent patterns without candidate generation: A frequent-pattern tree approach", Data Mining and Knowledge Discovery, Vol. 8, 2004, pp. 53-87.
  6. C. H. Lin, D. Y. Chiu, Y. H. Wu, and A. L. P. Chen, "Mining frequent itemsets from data streams with a time-sensitive sliding window" , In Proceedings of SDM International Conference on Data Mining, 2005, pp. 68-79.
  7. J. H. Chang and W. S. Lee, "estWin: Online data stream mining of recent frequent itemsets by sliding window method", Journal of Information Science, Vol. 31, 2005, pp. 76-90.
  8. Y. Chi, H. Wang, P. S. Yu, and R. R. Muntz, "Catch the moment: maintaining closed frequent itemsets over a data stream sliding window", Knowledge and Information Systems, Vol. 10, 2006, pp. 265-294.
  9. C. K. S. Leung and Q. I. Khan, "DSTree: a tree structure for the mining of frequent sets from data streams", In Proceedings of IEEE International Conference on Data Mining, 2006, pp. 928-932.
  10. J. X. Yu, Z. Chong, H. Lu, Z. Zhang, and A. Zhou, "A false negative approach to mining frequent itemsets from high speed transactional data streams", Information Sciences, Vol. 176, 2006, pp. 1986-2015.
  11. X. Zhi-Jun, C. Hong, and C. Li, "An efficient algorithm for frequent itemset mining on data streams" ,In Proceedings of the 6th Industrial Conference on Data Mining, 2006, pp. 474-491.
  12. J. Chang and W. Lee, "Finding recently frequent itemsets adaptively over online transactional data streams", Information Systems, Vol. 31, 2006, pp. 849-869.
  13. J. Han, H. Cheng, D. Xin, and X. Yan, "Frequent pattern mining: current status and future directions", Data Mining and Knowledge Discovery, Vol. 15, 2007, pp. 55-86.
  14. B. Mozafari, H. Thakkar, and C. Zaniolo, "Verifying and mining frequent patterns from large windows over data streams", In Proceedings of International Conference on Data Engineering, 2008, pp. 179-188.
  15. S. K. Tanbeer, C. F. Ahmed, B. S. Jeong, and Y. K. Lee, "Sliding window-based frequent pattern mining over data streams", Information Sciences, Vol. 179, 2009, pp. 3843-3865.
  16. H. F. Li and S. Y. Lee, "Mining frequent itemsets over data streams using efficient window sliding techniques", Expert Systems with Applications, Vol. 36, 2009, pp. 1466-1477.
  17. H. Li and H. Chen, "Mining non-derivable frequent itemsets over data stream", Data and Knowledge Engineering, Vol. 68, 2009, pp. 481-498.
  18. J. H. Chang and W. S. Lee, "estMax: Tracing maximal frequent itemsets instantly over online transactional data streams", IEEE Transactions on Knowledge and Data Engineering, Vol. 21, 2009, pp. 1418-1431.
  19. M. Deypir and M. H. Sadreddini, "EclatDS: An efficient sliding window based frequent pattern mining method for data streams", Intelligent Data Analysis, Vol. 15, 2011, pp. 571-587.
  20. Ahmed N. Albatineh and Magdalena Niewiadomska-Bugaj, "Correcting Jaccard and other similarity indices for chance agreement in cluster analysis", Adv Data Anal Classif, 5:179-200, DOI10. 1007/s11634-011-0090-y, http://dx. doi. org/10. 1007/s11634-011-0090-y, 2011.
  21. V. sidda Reddy, Dr T. V. Rao, and Dr A. Govardhan, "TIFIM: Tree based Incremental Frequent Itemset Mining over Streaming Data", International Journal of Computers & Technology, Vol 10, No 5, 2013.
  22. V. Sidda Reddy, Dr T. V. Rao, and Dr A. Govardhan, "Mining Frequent Itemsets (MFI) over Data streams: Variable Window Size (VWS) by Context Variation Analysis (CVA) of the streaming transactions", International Journal of Data Mining & Knowledge Management Process (IJDKP), Vol. 4, No. 4, July 2014.
  23. V. Sidda Reddy, M. Narendra and K. Helini, "Knowledge Discovery from Static Datasets to Evolving Data Streams and Challenges", International Journal of Computer Applications (IJCA), Volume 87, No. 15, 2014, pp. 22-25.
Index Terms

Computer Science
Information Sciences

Keywords

Data Streams Distributed Data Stream Closed Frequent Itemsets Mining Sliding Window Varying Window.