CFP last date
20 May 2024
Reseach Article

A Score based Web Page Ranking Algorithm

by M. Shamiul Amin, Shaily Kabir, Rasel Kabir
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 110 - Number 12
Year of Publication: 2015
Authors: M. Shamiul Amin, Shaily Kabir, Rasel Kabir
10.5120/19367-1035

M. Shamiul Amin, Shaily Kabir, Rasel Kabir . A Score based Web Page Ranking Algorithm. International Journal of Computer Applications. 110, 12 ( January 2015), 11-15. DOI=10.5120/19367-1035

@article{ 10.5120/19367-1035,
author = { M. Shamiul Amin, Shaily Kabir, Rasel Kabir },
title = { A Score based Web Page Ranking Algorithm },
journal = { International Journal of Computer Applications },
issue_date = { January 2015 },
volume = { 110 },
number = { 12 },
month = { January },
year = { 2015 },
issn = { 0975-8887 },
pages = { 11-15 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume110/number12/19367-1035/ },
doi = { 10.5120/19367-1035 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:46:09.719389+05:30
%A M. Shamiul Amin
%A Shaily Kabir
%A Rasel Kabir
%T A Score based Web Page Ranking Algorithm
%J International Journal of Computer Applications
%@ 0975-8887
%V 110
%N 12
%P 11-15
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

With the explosive growth of information in the Web, users face difficulties while finding their desired information. Search engine helps the user by retrieving useful information from this huge collection based on his/her search query and presents a list of relevant web pages as a search result. However, without proper ranking of pages in the result through the relevancy of pages to the search query, the user may need to explore the whole list for discovering the appropriate page(s), thereby involving huge search time. Although a number of ranking algorithms such as HITS, PageRank, Weighted PageRank and etc. , are developed to assist the search engine, but none of them provides page ranking with high accuracy. In this paper, we propose a score-based web page ranking algorithm involving web content mining and usage information of the pages. Our algorithm considers both syntactical and semantic matches of the search query to the pages. For a web page, syntactical score is calculated based on the total number of exact matches of the search words in the page. Besides, semantic score is measured using synonym matches of the search words. Moreover, we incorporate the usage information of the pages as page popularity in order to comprise the user interest in the ranking order. The total relevant score of each page is calculated using the summation of the syntactical and semantic scores of the page and its page popularity. Finally, the pages are ranked according to their total relevant score. Based on several performance evaluation measures, experimental results show considerable improvement in the page ranking using our algorithm as compared to other known approaches.

References
  1. Han, J. , Kamber, M. (2006). Data Mining, Southeast Asia Edition: Concepts and Techniques. Morgan kaufmann.
  2. Lappas, G. (2008). An overview of web mining in societal benefit areas. Online Information Review, 32(2), 179-195.
  3. Chakrabarti, S. , Van den Berg, M. , Dom, B. (1999). Focused crawling: a new approach to topic-specific Web resource discovery. Computer Networks, 31(11), 1623-1640.
  4. Kobayashi, M. , Takeda, K. (2000). Information retrieval on the web. ACM Computing Surveys (CSUR), 32(2), 144-173.
  5. Mayfield, J. , McNamee, P. , Piatko, C. D. (1999). The JHU/APL HAIRCUT System at TREC-8. In TREC.
  6. Kim, S. , Kwon, J. (2007). Effective context-aware recommendation on the semantic web. International Journal of Computer Science and Network Security, 7(8), 154-159.
  7. Cho, J. , Garcia-Molina, H. , Page, L. (1998). Efficient crawling through URL ordering. Computer Networks and ISDN Systems, 30(1), 161-172.
  8. Brin, S. , Page, L. (1998). The anatomy of a large-scale hypertextual Web search engine. Computer networks and ISDN systems, 30(1), 107-117.
  9. Beigi, M. , Benitez, A. B. , Chang, S. F. (1997, December). MetaSEEk: a content-based metasearch engine for images. In Photonics West'98 Electronic Imaging (pp. 118-128). International Society for Optics and Photonics.
  10. Nguyen, S. T. (2009, December). Efficient web usage mining process for sequential patterns. In Proceedings of the 11th International Conference on Information Integration and Web-based Applications & Services (pp. 465-469). ACM. Cooley, R. , Mobasher, B. , Srivastava, J. (1997, November). Web mining: Information and pattern discovery on the world wide web. In Tools with Artificial Intelligence, 1997. Proceedings. Ninth IEEE International Conference on (pp. 558-567). IEEE.
  11. Ramulu, V. S. , Kumar, C. N. S. , Reddy, K. S. A Study of Semantic Web Mining: Integrating Domain Knowledge into Web Mining. International Journal of Soft Computing and Engineering (IJSCE), 2(3).
  12. Chakravarthy, A. (2005). Mining the Semantic Web.
  13. Kleinberg, J. M. (1999). Authoritative sources in a hyperlinked environment. Journal of the ACM (JACM), 46(5), 604-632.
  14. Bharat, K. , Henzinger, M. R. (1998, August). Improved algorithms for topic distillation in a hyperlinked environment. In Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval (pp. 104-111). ACM.
  15. Nomura, S. , Oyama, S. , Hayamizu, T. , Ishida, T. (2004). Analysis and improvement of HITS algorithm for detecting Web communities. Systems and Computers in Japan, 35(13), 32-42.
  16. Page, L. , Brin, S. , Motwani, R. , Winograd, T. (1999). The PageRank citation ranking: Bringing order to the web.
  17. PageRank algorithm. http://pr. efactory. de/e-pagerank-algorithm. shtml. Accessed: 2014-04-15.
  18. Karpeles M. 2009. Modeling and optimizing hyper textual search engines, based on the research of Larry page and sergey brin. Yunfei Zhao Department of Computer Science, University of Vermon Slide from spring.
  19. Miller, G. A. , Beckwith, R. , Fellbaum, C. , Gross, D. , & Miller, K. J. (1990). Introduction to wordnet: An on-line lexical database*. International journal of lexicography, 3(4), 235-244.
  20. Powers, D. M. (2011). Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation.
Index Terms

Computer Science
Information Sciences

Keywords

HITS PageRank Score Based Page Rank (SBPR).