CFP last date
20 May 2024
Reseach Article

An Efficient Algorithm for Web Page Change Detection

by Srishti Goel, Rinkle Rani Aggarwal
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 48 - Number 10
Year of Publication: 2012
Authors: Srishti Goel, Rinkle Rani Aggarwal
10.5120/7386-0173

Srishti Goel, Rinkle Rani Aggarwal . An Efficient Algorithm for Web Page Change Detection. International Journal of Computer Applications. 48, 10 ( June 2012), 28-33. DOI=10.5120/7386-0173

@article{ 10.5120/7386-0173,
author = { Srishti Goel, Rinkle Rani Aggarwal },
title = { An Efficient Algorithm for Web Page Change Detection },
journal = { International Journal of Computer Applications },
issue_date = { June 2012 },
volume = { 48 },
number = { 10 },
month = { June },
year = { 2012 },
issn = { 0975-8887 },
pages = { 28-33 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume48/number10/7386-0173/ },
doi = { 10.5120/7386-0173 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:43:44.872787+05:30
%A Srishti Goel
%A Rinkle Rani Aggarwal
%T An Efficient Algorithm for Web Page Change Detection
%J International Journal of Computer Applications
%@ 0975-8887
%V 48
%N 10
%P 28-33
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Internet is actively used for the exchange of information. People upload the web pages and updating the new web pages very frequently. There is a frequent change in the content of the web page hence it become necessary to develop an efficient system which could detect these changes efficiently and in the minimum browsing time. So as to achieve this we compare the old web page and the new web page. Changes in a web page can be detected with the use of various algorithms. Various tools and services are also available which can be used to detect these changes. In this paper a new algorithm for the structural as well as content change detection has been proposed and described. For better results tree has been designed for the corresponding web pages. The proposed change detection algorithm is based on assigning hash value to each leaf node and tag value to the non leaf nodes. Bottom up approach has been used for assignment. The level of each node has been used to find hash values and modification in a node. It has been shown with the help of suitable examples that the proposed algorithm extracts the changes very efficiently from the various web pages.

References
  1. Buneman P. , Davidson S. , Fan W. , Hara C. , and Tan W. Aug 2002"Keys for XML" Proceedings of international conference on Computer Networks, 39 (5), 473–487.
  2. Chawathe S. , Rajaraman A. , Garcia-Molina H. and Widom June 1996, "Change Detection in Hierarchically Structured Information", Proceedings of the ACM SIGMOD International Conference on Management of Data, 25(2), 493-504.
  3. Chawathe S. , Garcia-Molina H. May 1997"Meaningful Change detection in structured data", proceeding in ACM SIGMOD International conference, 26-37.
  4. Liu, L. , Pu C. , and Tang W. "WebCQ: 2000 Detecting and Delivering Information Changes on the Web". In Proceedings of International Conference on Information and Knowledge Management, pp. 512-519.
  5. Lu, B. , Hui C. B. , and Zhang Y. 2002. Personalized "Information Monitoring over the Web". in First International Conference on Information Technology and Applications(ICITA).
  6. Available at: Mind-it, http://www. netmind. com.
  7. Leonardi E. , Sourav Bhownick S. , "Detecting Content Changes on Ordered XML Documents Using Relational Databases".
  8. Yadav D. 2009"Design of A Novel Incremental Parallel web crawler" Phd thesis, Jaypee Institute of Information Technology University, 2009.
  9. H. P. Khandagale H. P. and Halkarnikar P. P. " A Novel Approach for Web Page Change Detection System" June 2010, International Journal of Computer Theory and Engineering, 2(3), 793-8201.
Index Terms

Computer Science
Information Sciences

Keywords

Web Page Change Detection Tag Of Node Tree Matching Hash Value Change Monitoring