CFP last date
20 May 2024
Reseach Article

Context based Web Indexing for Storage of Relevant Web Pages

by Nidhi Tyagi, Rahul Rishi, R. P. Agarwal
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 40 - Number 3
Year of Publication: 2012
Authors: Nidhi Tyagi, Rahul Rishi, R. P. Agarwal
10.5120/5021-7166

Nidhi Tyagi, Rahul Rishi, R. P. Agarwal . Context based Web Indexing for Storage of Relevant Web Pages. International Journal of Computer Applications. 40, 3 ( February 2012), 1-5. DOI=10.5120/5021-7166

@article{ 10.5120/5021-7166,
author = { Nidhi Tyagi, Rahul Rishi, R. P. Agarwal },
title = { Context based Web Indexing for Storage of Relevant Web Pages },
journal = { International Journal of Computer Applications },
issue_date = { February 2012 },
volume = { 40 },
number = { 3 },
month = { February },
year = { 2012 },
issn = { 0975-8887 },
pages = { 1-5 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume40/number3/5021-7166/ },
doi = { 10.5120/5021-7166 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:27:04.762842+05:30
%A Nidhi Tyagi
%A Rahul Rishi
%A R. P. Agarwal
%T Context based Web Indexing for Storage of Relevant Web Pages
%J International Journal of Computer Applications
%@ 0975-8887
%V 40
%N 3
%P 1-5
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

A focused crawler downloads web pages that are relevant to a user specified topic. The downloaded documents are indexed with a view to optimize speed and performance in finding relevant documents for a search query at the search engine side. However, the information will be more relevant if the context of the topic is also made available to the retrieval system. This paper proposes a technique for indexing the keyword extracted from the web documents along with their contexts wherein it uses a height balanced binary search (AVL) tree, for indexing purpose to enhance the performance of the retrieval system.

References
  1. Diligenti M., Coetzee F.M., Lawrence S., Giles C.L.and Gori M., “Focused Crawling using context graphs”, Proc. International Conference on Very Large Databases (VLDB ’00), pp. 527-534, 2000.
  2. Yang Yongsheng and Wang Hui, “Implementation of Focused Crawler”, COMP630D Course Project Report.
  3. A.K.Sharma, “Data Structures using C”, Pearson publication, 2011.
  4. Fabrizio Silvestri, Raffaele Perego and Salvatore Orlando ”Assigning Document Identifiers to Enhance Compressibility of Web Search Engines Indexes”. Proceedings of SAC, 2004.
  5. Oren Zamir and Oren Etzioni “Web Document Clustering: A feasibility demonstration”. Proceedings of SIGIR, 1998.
  6. Changshang Zhou, Wei Ding and Na Yang, “Double Indexing Mechanism of Search Engine based on Campus Net”, Proceedings of the 2006 IEEE Asia-Pacific Conference on Services Computing (APSCC'06), 2006.
  7. Naresh Chauhan and A. K. Sharma,” Design of an Agent Based Context Driven Focused Crawler”,BVICAM’S International Journal of Information Technology, pp 61-66, 2008.
  8. Parul Gupta and A.K.Sharma,” Context based Indexing in Search Engines using Ontology”, International Journal of Computer Applications, Volume 1 No. 14, pp 49-52, 2010.
  9. Steve Lawrence, “Context in Web Search”, IEEE Data Engineering Bulletin, 2000.
  10. Wang Jicheng, Huang Yuan, Wu Gangshan and Zhang Fuyan, “Web Mining: Knowledge Discovery on the Web”, IEEE International Conference, Tokyo, 1999.
  11. O. Zamir, O. Etzioni, O. Madanim, and R.M. Karp “Fast and Intuitive Clustering of Web Documents,” Proceeding Third International Conference Knowledge Discovery and Data Mining, pp. 287-290, Aug. 1997.
Index Terms

Computer Science
Information Sciences

Keywords

AVL tree contextual repository balance factor