CFP last date
20 September 2024
Reseach Article

Privacy Preserving in Data Mining using FP Growth Algorithm on Hybrid Partitioned Dataset

by Harpreet Kaur, Shaveta Angurala
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 147 - Number 3
Year of Publication: 2016
Authors: Harpreet Kaur, Shaveta Angurala
10.5120/ijca2016911021

Harpreet Kaur, Shaveta Angurala . Privacy Preserving in Data Mining using FP Growth Algorithm on Hybrid Partitioned Dataset. International Journal of Computer Applications. 147, 3 ( Aug 2016), 6-9. DOI=10.5120/ijca2016911021

@article{ 10.5120/ijca2016911021,
author = { Harpreet Kaur, Shaveta Angurala },
title = { Privacy Preserving in Data Mining using FP Growth Algorithm on Hybrid Partitioned Dataset },
journal = { International Journal of Computer Applications },
issue_date = { Aug 2016 },
volume = { 147 },
number = { 3 },
month = { Aug },
year = { 2016 },
issn = { 0975-8887 },
pages = { 6-9 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume147/number3/25631-2016911021/ },
doi = { 10.5120/ijca2016911021 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:50:53.515285+05:30
%A Harpreet Kaur
%A Shaveta Angurala
%T Privacy Preserving in Data Mining using FP Growth Algorithm on Hybrid Partitioned Dataset
%J International Journal of Computer Applications
%@ 0975-8887
%V 147
%N 3
%P 6-9
%D 2016
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Data mining is used in various business domains to extract important information from the large data repositories. In this paper, Horizontal and Vertical data distribution is combined to provide privacy to the data. FP Growth algorithm on hybrid partitioned dataset is used to decrease the execution time for generation of rules. The experiments are carried out on the two datasets namely adult and credit dataset and results are predicted on the basis of Apriori and FP Growth algorithm. The experimental results show that the FP Growth algorithm is better in performance than Apriori algorithm in terms of execution time because FP Growth algorithm takes less time to generate rules.

References
  1. K. Liu, H. Kargupta and J. Ryan. Random projection-based multiplicative data perturbation for privacy preserving distributed data mining. IEEE Trans. Knowledge and Data Engg, 18(1):92-106, January 2006.
  2. M. Kantarcioglu and C. Clifton. Privacy-preserving distributed mining of association rules on horizontally partitioned data. In The ACM SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery (DMKD’02), pages 24-31, June 2 2002.
  3. Benjamin C. M. Fung , Ke Wang , Rui Chen , Philip S. Yu, Privacy preserving data publishing: A survey of recent developments, ACM Computing Surveys (CSUR), vol. 42, no. 4, pp. 1-53, 2010.
  4. Yin, Yong, Ikou Kaku, Jiafu Tang, and JianMing Zhu. ”Privacy preserving Data Mining,” In Data Mining, pp. 101-119. Springer London, 2011.
  5. Anusuya M ,Sudharani K ,Ganthimathi M ,Sumathi G, “Frequent Itemset Mining Using PFP-Growth via Transaction Splitting”, International Journal of Innovative Research in Computer and Communication Engineering, An ISO 3297: 2007 Certified Organization, Vol. 4, Issue 2, February 2016
  6. Jaideep Vaidya, Senior Member, IEEE, Basit Shafiq, Member, IEEE, Wei Fan, Member, IEEE, Danish Mehmood, and David Lorenzi, “A Random Decision Tree Framework for Privacy Preserving Data Mining”, IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, VOL. 11, NO. 5, SEPTEMBER/OCTOBER 2014
  7. Vikas G. Ashok, K. Navuluri A. Alhafdhi R. Mukkamala, “Dataless Data Mining: Association Rules-based Distributed Privacy-preserving Data Mining”, 2015 12th International Conference on Information Technology - New Generations
  8. Patil Suraj K, Gadage Shrinivas, “Privacy Preserving Two Party Distributed Association Rule Mining by FP Growth on Horizontally Partitioned Data”, International Journal of Innovative Research in Computer and Communication Engineering, (An ISO 3297: 2007 Certified Organization), Vol. 3, Issue 6, June 2015
  9. Asha Khatri, Swati Kabra, Shamsher Singh and Durgesh Kumar Mishra, ”Architecture for Preserving Privacy During Data Mining by Hybridization of Partitioning on Medical Data”, 2010 Fourth Asia International Conference on Mathematical/Analytical Modelling and Computer Simulation
  10. M. Saravanan, A. M. Thoufeeq, S. Akshaya & V.L. Jayasre Manchari, “Exploring New Privacy Approaches in a Scalable Classification Framework”, Data Science and Advanced Analytics (DSAA), 2014 International Conference
  11. Majid Bashir Malik , M. Asger Ghazi and Rashid Ali, “Privacy Preserving Data Mining Techniques: Current Scenario and Future Prospects”, 2012 Third International Conference on Computer and Communication Technology
  12. Jaideep Vaidya, Chris Clifton, “Privacy Preserving Association Rule Mining in Vertically Partitioned Data”, Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining in 2002
  13. Chris Clifton, “Privacy preserving distributed data mining” In ACM SIGKDD Explorations, November 9, 2001.
  14. Gang Kou, Yi Peng1, Yong Shi2, and Zhengxin Chen, “Data mining of medical data using data separation-based technique” Data Science Journal, volume 6, supplement, 30 July 2007, pp S429-S434.
  15. Vassilios S. Verykios, Elisa Bertino, Igor Nai Fovino, Loredana Parasiliti Provenza, Yucel Saygin, Yannis Theodoridis “State-of-the-art in Privacy Preserving Data Mining” In the proceeding of SIGMOD Record, Vol. 33, No. 1, March 2004, pp 50-57.
  16. DANIEL HUNYADI, “Performance comparison of Apriori and FP-Growth algorithms in generating association rules”, Proceedings of the European Computing Conference, Department of Computer Science”Lucian Blaga” University of Sibiu, Romania.
  17. Abdullah Saad Almalaise Alghamdi, “Efficient Implementation of FP Growth Algorithm-Data Mining on Medical Data”, IJCSNS International Journal of Computer Science and Network Security, VOL.11 No.12, December 2011
Index Terms

Computer Science
Information Sciences

Keywords

Apriori algorithm Association rule mining FP Growth algorithm Hybrid Partitioning Privacy preserving data mining