CFP last date
20 May 2024
Reseach Article

Web Searching With Logarithmic and Probability Measure

by S. Subatra Devi, P. Sheik Abdul Khader
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 64 - Number 6
Year of Publication: 2013
Authors: S. Subatra Devi, P. Sheik Abdul Khader
10.5120/10638-5380

S. Subatra Devi, P. Sheik Abdul Khader . Web Searching With Logarithmic and Probability Measure. International Journal of Computer Applications. 64, 6 ( February 2013), 16-19. DOI=10.5120/10638-5380

@article{ 10.5120/10638-5380,
author = { S. Subatra Devi, P. Sheik Abdul Khader },
title = { Web Searching With Logarithmic and Probability Measure },
journal = { International Journal of Computer Applications },
issue_date = { February 2013 },
volume = { 64 },
number = { 6 },
month = { February },
year = { 2013 },
issn = { 0975-8887 },
pages = { 16-19 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume64/number6/10638-5380/ },
doi = { 10.5120/10638-5380 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:15:41.221288+05:30
%A S. Subatra Devi
%A P. Sheik Abdul Khader
%T Web Searching With Logarithmic and Probability Measure
%J International Journal of Computer Applications
%@ 0975-8887
%V 64
%N 6
%P 16-19
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The web is a huge and highly dynamic environment which is growing exponentially in content and developing fast in structure. No search engine can cover the whole web, but it has to focus on the most valuable pages for crawling. Many methods have been developed based on link and text analysis for retrieving the pages. In this paper, an algorithm based on link, text, logarithmic distance and probabilistic measure is presented to find the relevancy of the web pages. Here, the most relevant pages are retrieved. It has been proved experimentally that this method provides more number of relevant pages.

References
  1. G. Almpanidis, C. Kotropoulos, I. Pitas, September 2007. Combining text and link analysis for focused crawling—An application for vertical search engines. Information Systems, Vol 32, No: 6, pp: 886-908.
  2. Zhumin Chen; Jun Ma; Jingsheng Lei; Bo Yuan; Li Lian, Aug 24-27, 2007. An Improved Shark-Search Algorithm Based on Multi-information. Fourth International Conference on Fuzzy Systems and Knowledge Discovery, pp: 659 – 658.
  3. J. Jayanthi, Dr. K. S. Jayakumar, January 2011. An Integrated Page Ranking Algorithm for Personalized Web Search, International Journal of Computer Applications, Vol 12 – No. 11.
  4. Sotiris Batsakis, Euripides G. M. Petrakis, Evangelos Milios, Improving the Performance of Focused Web Crawlers, Data & Knowledge Engineering, Vol: 68, No: 10, pp: 1001-1013, October 2009.
  5. Blaž Novak, Survey of focused web crawling algorithms, in Proceedings of SIKDD, pp. 55-58, 2004.
  6. Shalin shah, Spe 2006. Implementing an Effective Web Crawler.
  7. Yang Yongsheng, Wang Hui , Implementation of Focused crawler, Journal of computers Vol. 6, No: 1, January 2011.
  8. Pant, G. , Srinivasan, P. , Menczer, F. , Crawling the Web. Web Dynamics: Adapting to Change in Content, Size, Topology and Use, edited by M. Levene and A. Poulovassilis, Springer- verlog, pp: 153-178, November 2004.
  9. Debashis Hati and Amritesh Kumar, An Approach for Identifying URLs Based on Division Score and Link Score in Focused Crawler, International Journal of Computer Applications, Vol. 2, no. 3, May 2010.
  10. A. Rungsawang, N. Angkawattanawit, Learnable topic-specific web crawler. Journal of Network and Computer Applications, Issue no:28,page no:97-114,2005
  11. Sandeep Sharma, Mr. Ravinder Kumar, Web-Crawling Approaches in Search Engines, June 2008.
  12. Michael Hersovici, Michal Jacovi, Yoelle S. Maarek, Dan Pelleg, Menachem Shtalhaim and Sigalit Ur, The shark-search algorithm. An application: tailored Web site mapping, in Proceedings of the Seventh International World Wide Web Conference on Computer Networks and ISDN Systems, Vol. 30, no. 1-7, pp. 317-326, April 1998.
  13. Brin, S. , & Page, L. The anatomy of a large-scale hyper textual web search engine. In Proceedings of the seventh international conference on World Wide Web (WWW), pp: 107–117, 1998.
  14. P. De Bra, G-J Houben, Y. Kornatzky, and R. Post, Information Retrieval in Distributed Hypertexts, in the Proceedings of RIAO'94, Intelligent Multimedia, Information Retrieval Systems and Management, New York, NY, 1994.
  15. Lili Yan, Wencai Du, Yingbin Wei, and Henian Chen, A Novel Heuristic Search Algorithm Based on Hyperlink and Relevance Strategy for Web Search, Advances in Intelligent and Soft Computing Volume 149, 2012, pp 97-102.
  16. S. Chakrabarti, M. van den Berg, and B. Dom, Focused Crawling: A New Approach for Topic-Specific Resource Discovery, In Proc. 8th WWW, 1999.
  17. M. Najork and J. L. Wiener. Breadth-first crawling yields high-quality pages, In Proceedings of the Tenth Conference on WorldWideWeb, Hong Kong, Elsevier Science, May 2001, pp. 114–118.
  18. Aggarwal C. Garawi F. Yu P. Intelligent crawling on the world wide web with arbitrary predicates, In: Proceedings of the 10th International World Wide Web Conference. Hongkong: 2001. p. 96-105.
  19. A. Rungsawang, N. Angkawattanawit, Learnable Crawling: An Efficient Approach to Topic-specific Web Resource Discovery, 2005.
  20. R. Steele, Techniques for specialized search engines, Directory of Open Access Journals, 2001.
Index Terms

Computer Science
Information Sciences

Keywords

Search engine Logarithmic distance Probabilistic measure