CFP last date
20 August 2024
Call for Paper
September Edition
IJCA solicits high quality original research papers for the upcoming September edition of the journal. The last date of research paper submission is 20 August 2024

Submit your paper
Know more
Reseach Article

Web Pattern Mining using ECLAT

by Poonam P. Doshi, Emmanuel M.
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 179 - Number 8
Year of Publication: 2017
Authors: Poonam P. Doshi, Emmanuel M.
10.5120/ijca2017916009

Poonam P. Doshi, Emmanuel M. . Web Pattern Mining using ECLAT. International Journal of Computer Applications. 179, 8 ( Dec 2017), 9-14. DOI=10.5120/ijca2017916009

@article{ 10.5120/ijca2017916009,
author = { Poonam P. Doshi, Emmanuel M. },
title = { Web Pattern Mining using ECLAT },
journal = { International Journal of Computer Applications },
issue_date = { Dec 2017 },
volume = { 179 },
number = { 8 },
month = { Dec },
year = { 2017 },
issn = { 0975-8887 },
pages = { 9-14 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume179/number8/28755-2017916009/ },
doi = { 10.5120/ijca2017916009 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:54:47.505317+05:30
%A Poonam P. Doshi
%A Emmanuel M.
%T Web Pattern Mining using ECLAT
%J International Journal of Computer Applications
%@ 0975-8887
%V 179
%N 8
%P 9-14
%D 2017
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The use of internet has been increasing day by day. The users can find their resources with the help of different hyperlinks. These usages of Internet have led to the invention of web crawlers. The search engine which helps the user to explore the web is known as Web Crawler. In web crawlers the crawled data can be used to find missing links, community detection in complex networks. The concept of providing accuracy for this is forever in the vein. In this paper, web crawlers: their architecture, process of semantic focused crawling technology, ontology learning, pattern matching, types and various challenges being faced when search engines use the web crawlers, have been reviewed. The web results more relevant to the user query through keyword expansion have been retrieved by the system. This data is being use further for the efficient association rule mining using Eclat Algorithm which is weaved for the vertical transactions based scheme. This process is being powered with Shannon information gain to identify the important words for the frequent pattern mining, and the whole process is being catalyzed by the fuzzy logic classification for more mere pattern identification process.

References
  1. Poonam P. Doshi, Dr. Emmanuel M.: “Feature Extraction Techniques Using Semantic-Based Crawler for Search Engine”, in Proc. of an International Conference on computing, communication and energy systems. ICCCES – 2016, in association with IET, UK and sponsored by TEQIP _ II, 29th 30th Jan 2016.
  2. Trupti V. Udapure, Ravindra D. Kale, Rajesh C. Dharmik “Study of Web Crawler and its Different types”, OSR Journal of Computer Engineering ISSN 2278-8727 Volume 16 Issue 1 Feb 2014.
  3. H. Dong, F. Hussain, and E. Chang, O. Gervasi, D. Taniar, B. Murgante, A. Lagana, Y. Mun, and M. Gavrilova, Eds., “State of the art in semantic focused crawlers”, in Proc. ICCSA 2009, Berlin, Germany, Vol. 5593, pp. 910–924, 2009.
  4. M. Ehrig, A. Maedche, “Ontology-focused crawling of web documents”, in SAC’03: Proceedings of the 2003 ACM symposium on Applied computing, ACM Press, New York, NY, USA, pp. 1174–1178, 2003.
  5. A. Maedche, M. Ehrig, S. Handschuh, L. Stojanovic, R. Volz, “Ontology-focused crawling of documents and relational metadata”, in Proceedings of the11th International World Wide Web Conference WWW-2002, Hawaii, 2002.
  6. Prashant Dahiwale, M. M. Raghuwanshi, Latesh Malik, “Design and Implementation of Focused Web Crawler Using Genetic Algorithm: An Approach to Web Mining”, International Journal of Scientific & Engineering Research, Vol. 6, no. 6, June-2015
  7. Vijayashri Losarwar, Madhuri Joshi, “Data Preprocessing in Web Usage Mining”, International Conference on Artificial Intelligence and Embedded Systems (ICAIES'2012) July 15-16, 2012.
  8. W. Wong, W. Liu, and M. Bennamoun, “Ontology learning from text: A look back and into the future”, ACM Computer. Surveys, Vol. 44, pp.20:1–36, 2012.
  9. Dong, H., Hussain, F. K., Chang, “E.: State of the art in semantic focused crawlers Computational Science and Its Applications”, – ICCSA 2009. Springer-Verlag, Seoul, Korea (July 2009) pp. 910-924
  10. Soumen Chakrabarti, “Mining the Web Discovering knowledge from hypertext data”, Boston: Elsevier, 2012.
  11. H. T. Zheng, B. Y. Kang, and H. G. Kim, “An ontology-based approach to learnable focused crawling”, Inf. Sciences, Vol. 178, pp.4512–4522, 2008.
  12. C. Su, Y. Gao, J. Yang, and B. Luo, “An efficient adaptive focused crawler based on ontology learning”, in Proc. 5th International Conference, Hybrid Intell. System. (HIS ’05), Rio de Janeiro, Brazil, 2005, pp. 73–78.
  13. Slimani, Thabet, and Amor Lazzez. "Efficient Analysis of Pattern and Association Rule Mining Approaches", Ar Xiv preprint ar Xiv: 1402. 2892 (2014).
  14. Khurana, Dhiraj, and Satish Kumar. "Web Crawler: A Review", IJCSMS International Journal of Computer Science & Management Studies 12.01 (2012).
  15. Jain, Nidhi, and Paramjeet Rawat. "A Study of Focused Web Crawlers for Semantic Web", International Journal of Computer Science and Information Technologies 4.2 (2013): 398-402
  16. Sonali Abhane, P. D. Lambhate “Enriching Web Interesting Pattern Mining Using Vertical Transaction Process”, International Journal of Science and Research (IJSR) ISSN (Online): 2319-7064, 2016
  17. Dr. Emmanuel M., Mr. Saurabh M Khatri, Dr. Ramesh Babu D. R. “A Novel scheme for Term weighting in Text Categorization: Positive Impact factor”, IEEE International Conference on Systems, Man, and Cybernetics, 2013
  18. A. Arasu, J. Cho, H. Garcia-Molina, A. Paepcke, and S. Raghavan. “Searching the Web”. ACM Transactions on Internet Technology, 1(1), 2001
  19. Dr Rajender Nath, Khyati Chopra, “Web Crawlers: Taxonomy, Issues & Challenges”, International Journal of Advanced Research in Computer Science and Software Engineering, Volume 3, Issue 4, April 2013.
  20. Ntoulas, A., Cho, J., and Olston, C. “What’s new on the Web? The evolution of the Web from a search engine perspective”,WWW04,1-12,2004.
Index Terms

Computer Science
Information Sciences

Keywords

Web crawler Shannon information gain Association Rules Eclat Algorithm and Fuzzy.