Call for Paper - August 2022 Edition
IJCA solicits original research papers for the August 2022 Edition. Last date of manuscript submission is July 20, 2022. Read More

Timestamp based Recrawling Technique (TSBCT)

Print
PDF
International Journal of Computer Applications
© 2012 by IJCA Journal
Volume 45 - Number 22
Year of Publication: 2012
Authors:
Babita Ahuja
Neelu Chaudhary
10.5120/7081-9533

Babita Ahuja and Neelu Chaudhary. Article: Timestamp based Recrawling Technique (TSBCT). International Journal of Computer Applications 45(22):23-26, May 2012. Full text available. BibTeX

@article{key:article,
	author = {Babita Ahuja and Neelu Chaudhary},
	title = {Article: Timestamp based Recrawling Technique (TSBCT)},
	journal = {International Journal of Computer Applications},
	year = {2012},
	volume = {45},
	number = {22},
	pages = {23-26},
	month = {May},
	note = {Full text available}
}

Abstract

In this era of digital tsunami of information on the web, everyone is completely dependent on the WWW for information retrieval. Most of the information is hidden behind the query interface. In the query interface the user types the keyword to access the web pages. These pages are known as the Hidden web, Invisible Web or Dark Web. Such kind of web pages cannot be indexed by the Search Engines. As these are not indexed by the search engines these pages cannot be returned and displayed to the users. This paper discusses the various reasons due of which they are not indexed by the search engines and the possible solutions for these reasons.

References

  • Sriram Raghavan Hector Garcia-Molina Computer Science Department Stanford University Stanford, CA 94305, USA, "Crawling the HiddenWeb"
  • Rosy Madaan / (IJCSE) International Journal on Computer Science and Engineering Vol. 02, No. 03, 2010, 753-758, "A Framework for Incremental Hidden Web Crawler"
  • Ping Wu Ji-Rong Wen, Huan Liu, Wei-Ying Ma "Query Selection Techniques for Efficient Crawling of Structured Web Sources"
  • Jian Qiu, Feng Shao, Misha Zatsman, Jayavel Index Structures for Querying the Deep Web, Workshop on the Web and Databases (WebDB), 2003, 79-86
  • Ntoulas, A. , Zerfos, P. , Cho, J. Downloading Textual Hidden Web Content Through Keyword Queries. In Proceedings of the 5th ACM/IEEE Joint Conference on Digital Libraries (JCDL05). 2005.
  • Chang, K; He, B; Zhang, Z. (2005). Toward Large Scale Integration: Building a MetaQuerier over Databases on the Web. CIDR, pp44-55