![]() |
10.5120/7081-9533 |
Babita Ahuja and Neelu Chaudhary. Article: Timestamp based Recrawling Technique (TSBCT). International Journal of Computer Applications 45(22):23-26, May 2012. Full text available. BibTeX
@article{key:article, author = {Babita Ahuja and Neelu Chaudhary}, title = {Article: Timestamp based Recrawling Technique (TSBCT)}, journal = {International Journal of Computer Applications}, year = {2012}, volume = {45}, number = {22}, pages = {23-26}, month = {May}, note = {Full text available} }
Abstract
In this era of digital tsunami of information on the web, everyone is completely dependent on the WWW for information retrieval. Most of the information is hidden behind the query interface. In the query interface the user types the keyword to access the web pages. These pages are known as the Hidden web, Invisible Web or Dark Web. Such kind of web pages cannot be indexed by the Search Engines. As these are not indexed by the search engines these pages cannot be returned and displayed to the users. This paper discusses the various reasons due of which they are not indexed by the search engines and the possible solutions for these reasons.
References
- Sriram Raghavan Hector Garcia-Molina Computer Science Department Stanford University Stanford, CA 94305, USA, "Crawling the HiddenWeb"
- Rosy Madaan / (IJCSE) International Journal on Computer Science and Engineering Vol. 02, No. 03, 2010, 753-758, "A Framework for Incremental Hidden Web Crawler"
- Ping Wu Ji-Rong Wen, Huan Liu, Wei-Ying Ma "Query Selection Techniques for Efficient Crawling of Structured Web Sources"
- Jian Qiu, Feng Shao, Misha Zatsman, Jayavel Index Structures for Querying the Deep Web, Workshop on the Web and Databases (WebDB), 2003, 79-86
- Ntoulas, A. , Zerfos, P. , Cho, J. Downloading Textual Hidden Web Content Through Keyword Queries. In Proceedings of the 5th ACM/IEEE Joint Conference on Digital Libraries (JCDL05). 2005.
- Chang, K; He, B; Zhang, Z. (2005). Toward Large Scale Integration: Building a MetaQuerier over Databases on the Web. CIDR, pp44-55