CFP last date
22 April 2024
Reseach Article

OCCT: A One –Class Clustering Tree for Implementing One – to- Many and Many – to- Many Data Linkage

by Manali Pare Guha, Anju Singh, Divaker Singh
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 137 - Number 3
Year of Publication: 2016
Authors: Manali Pare Guha, Anju Singh, Divaker Singh
10.5120/ijca2016908522

Manali Pare Guha, Anju Singh, Divaker Singh . OCCT: A One –Class Clustering Tree for Implementing One – to- Many and Many – to- Many Data Linkage. International Journal of Computer Applications. 137, 3 ( March 2016), 7-10. DOI=10.5120/ijca2016908522

@article{ 10.5120/ijca2016908522,
author = { Manali Pare Guha, Anju Singh, Divaker Singh },
title = { OCCT: A One –Class Clustering Tree for Implementing One – to- Many and Many – to- Many Data Linkage },
journal = { International Journal of Computer Applications },
issue_date = { March 2016 },
volume = { 137 },
number = { 3 },
month = { March },
year = { 2016 },
issn = { 0975-8887 },
pages = { 7-10 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume137/number3/24253-2016908522/ },
doi = { 10.5120/ijca2016908522 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:37:20.167490+05:30
%A Manali Pare Guha
%A Anju Singh
%A Divaker Singh
%T OCCT: A One –Class Clustering Tree for Implementing One – to- Many and Many – to- Many Data Linkage
%J International Journal of Computer Applications
%@ 0975-8887
%V 137
%N 3
%P 7-10
%D 2016
%I Foundation of Computer Science (FCS), NY, USA
Abstract

One to many & many to many data linkage are necessary in data mining. OCCT Implementation for one to many & Many to many Data Linkage is to identify different entities across different Data sources. Data Linkage is linking data between two different database. One to many data linkage is associated an entity from first data set with a group matching from the other data set. In many to Many Data Linkage method the entities of same type and different nature should be arrange with Map Reduce method. In the OCCT was evaluated after using data sets from three different domains: , recommender system, data leakage prevention and fraud detection. data leakage prevention domain, the goal is to detect abnormal access. Recommender system, the method is used for matching new users of the system with the items. In fraud detection legitimate transactions performed by users.

References
  1. S Mallela, Dhillon I.S and D.S Modha, “ Co – Clustering and information – Theoretic” Prof. SIGKDD Ninth ACM Int’l Conf. Data Mining and knowledge Discovery , pp. 89-98, 2003.
  2. Dr. Anju Singh, Dr. Divakar Singh, Gopal Patidar ” Document Clustering approach using Hebbian-type Neural Network and Agglomerative Clustering ” vol. 75, issue 9, 2013.
  3. , A.K. Elmagarmid, M. Yakout, H. Elmeleegy, M. Quzzani, and A.Qi, “ Record Linkage Behaviours,” Proc. Endowment VLDB, vol. 64, no 328.
  4. A.B. Sunter and I.P. Fellegi, “ Record Linkage Theory,” J. Am. Soc. Statistical, , pp. 1183-1210 vol. 64, no. 328, Dec. 1969.
  5. Baxter Rf. And M Guha. Gurprit L., “ Linkage record which based on Decision Models ,” Data Mining, , pp. 146-169. vol. 3755
  6. Goshair R. k. and Christaen p. , “Complexity for Reduplication in Data Mining and Quality Measures of Linkage Data in Data Mining ,” pp. 127-151, 2007 vol. 43,.
  7. Christean P. , “ Indexing Techniques survey for Scalable Reduplication and Linkage Record ,” IEEE Transmission . and Data Eng and Knowledge., doi:10.1109/TKDE. 2011.127. vol. 24, no. 9, pp. 1537-1555, Sept. 2012,
  8. Dr. Divakar Singh” Intrusion Detection based System on Probabilistic Neural Network and Fuzzy C Means Clustering , D Singh – 2013 vol. 74, issue 2, pp. 30-33
  9. Grahmens A. “ A Data Mining Decision Tree Recommender smart System ,” Proc. 10th Int’l Conf Community Services . Innovative Internet, pp. 170-179, 2010.
  10. Flach P. , Ferri C., M. Guha and Herna´ndez-Orallo J., “DecisionTrees Model showing the Use of Area under curve ROC . Machine Learning, pp. 139-146, 2002.
  11. Gopandi N., Korean Y. , and Lempeal R., Adaptive internet based business or other enterprise Systems Using Decision Trees updated,” Proc. Fourth ACM Int’L Conf. for Data Mining and Web Search , pp. 595-604, 2011.
  12. Adomavicius G. and Tuzhilin A. Its for the Next Generation of Data Mining Recommender smart Systems: A Survey of the Possible Extensions and State-of-the-Art Data Engineering and IEEE T Knowledge Transmission ., vol. 17, no. 6, pp. 739-749,June2005.
Index Terms

Computer Science
Information Sciences

Keywords

Clustering classification data matching decision tree induction keywords Map Reduce Data Linkage Matching