CFP last date
22 April 2024
Reseach Article

A New Approach to Organize the Results of Searching the Web, using a Combination of Ranking and Genetic Structure-based Clustering

by Belal Rostami, Shahriar Lotfi
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 89 - Number 6
Year of Publication: 2014
Authors: Belal Rostami, Shahriar Lotfi
10.5120/15510-4347

Belal Rostami, Shahriar Lotfi . A New Approach to Organize the Results of Searching the Web, using a Combination of Ranking and Genetic Structure-based Clustering. International Journal of Computer Applications. 89, 6 ( March 2014), 34-40. DOI=10.5120/15510-4347

@article{ 10.5120/15510-4347,
author = { Belal Rostami, Shahriar Lotfi },
title = { A New Approach to Organize the Results of Searching the Web, using a Combination of Ranking and Genetic Structure-based Clustering },
journal = { International Journal of Computer Applications },
issue_date = { March 2014 },
volume = { 89 },
number = { 6 },
month = { March },
year = { 2014 },
issn = { 0975-8887 },
pages = { 34-40 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume89/number6/15510-4347/ },
doi = { 10.5120/15510-4347 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:08:34.267137+05:30
%A Belal Rostami
%A Shahriar Lotfi
%T A New Approach to Organize the Results of Searching the Web, using a Combination of Ranking and Genetic Structure-based Clustering
%J International Journal of Computer Applications
%@ 0975-8887
%V 89
%N 6
%P 34-40
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Web mining means searching the Web for find specific information. Web mining operation should be done in a way to give the best results to the user. Two of the best methods in this area are clustering and ranking Web pages. The hereby-proposed method is a new approach which is a combination of the above-mentioned methods. In the proposed method, first, the Web graph is clustered in two phases, based on structural equivalences; next, each cluster is scored according to its value; then, ranking is done on all present pages in the clusters; and, finally, the final rank of each Web page would be the result of multiplying these two values. In the end, Web pages will be presented to the user based on their final rank. The results obtained from the comparison of the proposed algorithm (GCRM) with other methods indicate a good performance of this algorithm in finding high quality Web pages. Since quality is the main parameter in Web mining, main effort in GCRM algorithm is on increasing the quality of found pages, where, according to the results in this area, GCRM has been successful.

References
  1. Batagelj V. , Mrvar A. , Ferligoj A. and Doreian P. , "Generalized Blockmodeling with Pajek," Metodolo?ski zvezki , pp. 455-467, 2004.
  2. Batagelj V. , "Notes on blockmodeling," Social Network, Vol. 100, No. 19, pp. 143-155, 1997.
  3. Carolyn J. A. and Wasserman S. , "Building stochastic blockmodels," Social Networks, pp. 137-161, 1992.
  4. Cason T. P. , September 2012, Role Extraction in Networks, PHD Thesis, computer faculty of University catholique de Louvain.
  5. Douglas R. W. and Karl P. R. , "Graph and Semigroup Homomorphisms on Networks of Relations," Social Networks, pp. 193-234, 1983.
  6. Duhan N. and Sharma A. K. , "A Novel Approach for Organizing Web Search Results using Ranking and Clustering," International Journal of Computer Applications, Vol. 5, No. 10, pp. 8887-8896, 2010.
  7. Faust K. and Wasserman S. , "Blockmodels: Interpretation and evaluation," Social Networks, pp. 5-1, 1992.
  8. Guenoche A. , "Comparing Recent Methods in Graph Partitioning," Electronic Notes in Discrete Mathematics, Vol. 22, pp. 83–89, 2005.
  9. Grabmeier J. and Rudolph A. , "Techniques of Cluster Algorithms in Data Mining," Data Mining and Knowledge Discovery, pp. 303–360, 2002.
  10. Ishii H. , Tempo R. and Wei Bai E. , "A Web Aggregation Approach for Distributed Randomized PageRank Algorithms," IEEE Transactions on Automatic Control, pp. 1203-1232, 2012.
  11. Jain R. and Purohit G. N. , "Page Ranking Algorithms for Web Mining," International Journal of Computer Applications, Vol. 13, No. 5, pp. 8887–8891, 2011
  12. Jessop A. , "Blockmodels with Maximum Concentration," European journal of operational research, pp. 56-64, 2008.
  13. Kamvar S. , Haveliwala T. and Golub G. , "Adaptive Methods for the Computation of PageRank," Linear Algebra and its Applications, Vol. 386, No. 19, pp. 51–65, 2004.
  14. Kohmot K. , Katayama K. and Hiroyuki N. , "Performance of a Genetic Algorithm for the Graph Partitioning Problem," Mathematical and Computer Modeling, Vol. 38, pp. 1325–1332, 2003.
  15. Lorrain F. and White H. C. , "Structural Equivalence of Individuals in Social Networks," The Journal of Mathematical Sociology, pp. 49-80, 2012.
  16. Murugesan K. and Zhang J. , "Hybrid Hierarchical Clustering: an Experimental Analysis," The Journal of Mathematical Sociology, pp. 01-11, 2011.
  17. Page L. , "The PageRank Citation Ranking: Bringing Order to the Web," Technical Report, Computer Science Department, Stanford University, 2000.
  18. Schaeffer S. E. , "Graph Clustering," Computer Science Review, Vol. 1, pp. 27–64, 2007.
  19. Shaojie Q. , Tianrui L. , Hong L. and Hongmei C. , "A New Blockmodeling based Hierarchical Clustering Algorithm for Web Social Networks," Engineering Applications of Artificial Intelligence, Vol. 10, No. 16, pp. 1-9, 2012.
  20. Tormen C. , Leiserson C. , Rivest R. and Stein C. , Introduction to Algorithms, McGraw-Hill, 2001.
  21. Weining Q. and Aoying Z. , "Analyzing Popular Clustering Algorithms from Different Viewpoints," Journal of Software, pp. 1382–1392, 2002.
  22. Wu X. , Kumar V. , Ross Quinlan J. and Ghosh J. , "Top 10 Algorithms in Data Mining," Knowl Inf Syst, Vol. 10, No. 1007, pp. 1-37, 2008.
  23. Yan L. , Gui G. , Du W. and Guo Q. , "An Improved PageRank Method based on Genetic Algorithm for Web Search," Procedia Engineering, Vol. 15, No. 34, pp. 2983– 2987, 2011.
  24. Zareh Bidoki A. M. and Yazdani N. , "DistanceRank: an Intelligent Ranking Algorithm for Web Pages," Information Processing and Management, Vol. 44, No. 10, pp. 877–892, 2008.
  25. Zareh Bidoki A. M. , Oroumchian F. , Ghodsnia P. and Yazdani N. , "A3CRank: an Adaptive Ranking Method base on Connectivity, Content and Click-through Data," Information Processing & Management, Vol. 46, No. 2, pp. 159-169, 2010.
  26. Zdravko M. and Daniel L. , Data Mining The Web, Wiley, 2007.
  27. Zhang K. , October 2007, Visual Cluster Analysis in Data Mining, PHD Thesis, Department of Computing Division of Information and Communication Sciences of Macquarie University.
  28. Zhang D. and Dong Y. , "An Efficient Algorithm to Rank Web Resources," Computer Networks, Vol. 33, No. 6, pp. 449–455, 2000.
  29. Ziberna A. , "Evaluation of Direct and Indirect Blockmodeling of Regular Equivalence in Valued Networks by Simulations," Metodološki zvezki, pp. 99-134, 2009.
Index Terms

Computer Science
Information Sciences

Keywords

Web mining search engines clustering and ranking