CFP last date
20 May 2024
Call for Paper
June Edition
IJCA solicits high quality original research papers for the upcoming June edition of the journal. The last date of research paper submission is 20 May 2024

Submit your paper
Know more
Reseach Article

Web Query Processing Approaches – A Survey and Comparison

by M. Manikantan, S. Duraisamy
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 85 - Number 12
Year of Publication: 2014
Authors: M. Manikantan, S. Duraisamy
10.5120/14893-3362

M. Manikantan, S. Duraisamy . Web Query Processing Approaches – A Survey and Comparison. International Journal of Computer Applications. 85, 12 ( January 2014), 17-30. DOI=10.5120/14893-3362

@article{ 10.5120/14893-3362,
author = { M. Manikantan, S. Duraisamy },
title = { Web Query Processing Approaches – A Survey and Comparison },
journal = { International Journal of Computer Applications },
issue_date = { January 2014 },
volume = { 85 },
number = { 12 },
month = { January },
year = { 2014 },
issn = { 0975-8887 },
pages = { 17-30 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume85/number12/14893-3362/ },
doi = { 10.5120/14893-3362 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:02:15.461556+05:30
%A M. Manikantan
%A S. Duraisamy
%T Web Query Processing Approaches – A Survey and Comparison
%J International Journal of Computer Applications
%@ 0975-8887
%V 85
%N 12
%P 17-30
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

World Wide Web, in short www or simply web, is interconnection of hypertext documents through internet and accessed with the help of web browser. The web search is enabled by navigating hyperlinks in a webpage or through search engines or by web programming. The search queries are classified mainly in four types as Informational queries, Navigational queries, Transactional queries and Connectivity queries. We can classify the evolutionary development of web query processing from database query processing and SQL optimizations as Learning and Adaptive query processing, Web query through HTML and web search taxonomies, Web search query and search engines, Web query languages and its models, Semantic web and Ontologies, Web query optimizations on distributed web as well as on semantic web and Use of context-based techniques in web query processing. In this survey we are discussing on each of these topics and including how synonyms adding approach and Linguistic based approach are used in web query processing. There are three stages of query processing in general namely, Statistics generation, query optimization and query execution. Further, queries are optimized using performance and correctness measures namely Precision, Recall, Fall-out, F-measure, Average precision, R-Precision, Mean average precision, discounted cumulative gain and some more measures. Some of this surveyed paper discusses these details and others concentrate on their research work in different contexts. Our further work will be on the query using synonym based classifier or statistical classifiers, such as Naive Bayes (NB) and Support Vector Machines (SVMs). Other future work will be how to use unlabeled query logs to help with query classification and also on solution to adapt the changes of the queries and categories. We propose to use web query modeling using soft-computing techniques.

References
  1. Yanlei Diao et. al, "Toward Learning Based Web Query Processing", Proceedings of the 26th International Conference on Very Large Databases, Cairo, Egypt, 2000.
  2. Zachary G. Ives," Adaptive Query Processing for Internet Applications", 2001.
  3. Andrei Broder, "A taxonomy of web search", SIGIR Forum 3 Fall 2002, Vol. 36, No. 2.
  4. MOURAD OUZZANI et. al, Query Processing and Optimization on the Web, Distributed and Parallel Databases, 15, 187–218, 2004.
  5. Yen-Yu Chen et. al, Efficient Query Processing in Geographic Web Search Engines, SIGMOD 2006, June 27–29, 2006, Chicago, Illinois, USA.
  6. Wensheng Wu, AnHai Doan et al, WebIQ: Learning from the Web to Match Deep-Web Query Interfaces, 2006.
  7. Minji Wu et al, Corroborating Answers from Multiple Web Sources, Proceedings of the 10th International Workshop on Web and Databases (WebDB 2007), June 15, 2007, Beijing, China
  8. Jian Huang, Exploring Web Scale Language Models for Search Query Processing, WWW 2010, April 26–30, 2010, Raleigh, North Carolina, USA.
  9. Shuai Ding et al, Batch Query Processing for Web Search Engines, WSDM'11, February 9–12, 2011, Hong Kong, China.
  10. Enver Kayaaslan et al, Energy-Price-Driven Query Processing in Multi-center Web Search Engines, SIGIR'11, July 24–28, 2011, Beijing, China.
  11. A. Deshpande, Z. Ives and V. Raman, Adaptive query processing, Foundations and Trends in Databases, Vol. 1, No. 1 (2007) 1–140.
  12. "W3C Semantic Web Activity". World Wide Web Consortium (W3C). November 7, 2011. Retrieved November 26, 2011.
  13. James Cheng, Efficient Query Processing on Graph Databases, ACM Transactions on Database Systems, Vol. V, No. N, September 2008, Pages 1–44.
  14. James Bailey, Web and Semantic Web Query Languages: A Survey.
  15. Stefan Riezler, Statistical Machine Translation for Query Expansion in Answer Retrieval, Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pages 464–471, Prague, Czech Republic, June 2007.
  16. Mohd Kamir Yusof, Implementing of XML and Intelligent Algorithm for Improving Web Query Processing in Heterogeneous Database Access, International Journal of Database Theory and Application Vol. 4, No. 2, June, 2011.
  17. Naphtali Rishe, Semantic Relations: The key to integrating and query processing in heterogeneous databases, Research funded by NASA and NSF.
  18. Mohd Kamir Yusof, Designing an Architecture for Improving Web Query Processing in Heterogeneous Databases Access, WIMS'11, May 25-27, 2011 Sogndal, Norway.
  19. J. Bhogal, A review of ontology based query expansion, Information Processing and Management 43 (2007) 866–886.
  20. Min Song, Integration of association rules and ontologies for semantic query expansion, Data & Knowledge Engineering 63 (2007) 63–75.
  21. Jordi Conesa, Improving web-query processing through semantic knowledge, Data & Knowledge Engineering 66 (2008) 18–34.
  22. Haifeng Jiang, Improving parallelism of federated query processing, Data & Knowledge Engineering 64 (2008) 511–533.
  23. Jihyun Lee, An intelligent query processing for distributed ontologies, The Journal of Systems and Software, June 2009.
  24. Neera Batra, Three tier cache based query optimization model in distributed database, International Journal of Engineering Science and Technology Vol. 2(7), 2010, 3206-3212.
  25. Mohd Kamir Yusof , Ontology and Semantic Web Approaches for Heterogeneous Database Access, International Journal of Database Theory and Application Vol. 4, No. 4, December, 2011.
  26. Eduard C. Dragut, A Hierarchical Approach to Model Web Query Interfaces for Web Source Integration, VLDB '09, August 24-28, 2009, Lyon, France.
  27. Andrew Burton-Jones, A Heuristic-Based Methodology for Semantic Augmentation of User Queries on the Web, ER 2003, LNCS 2813, pp. 476–489, 2003.
  28. Veda C. Storey, CONQUER: A Methodology for Context-Aware Query Processing on the World Wide Web, Information Systems Research, Vol. 19, No. 1, March 2008, pp. 3–25.
  29. Steve Lawrence, Context in Web Search, IEEE Data Engineering Bulletin, Volume 23, Number 3, pp. 25–32, 2000.
  30. Eric J. Glover, Improving Category Specific Web Search by Learning Query Modifications, IEEE, 2001.
  31. Tomohiro TAKAGI, Query Expansion Using Conceptual Fuzzy Sets For Search Engine, lEEE International Fuzzy Systems Conference, 2001.
  32. M. Balabanovic. An Adaptive Web Page Recommendation Service. In Proc. of 1st International Conference on Autonomous Agents, pp. 378-385, 1997.
  33. F. Menczer, R. Belew. Adaptive Retrieval Agents: Internalizing Local Context and Scaling up to the Web. Technical Report CS98-579, University of California, San Diego, 1998. Available: http://www. cse. ucsd. edu/~rik/papers/arachnid/arachnd-mlj. ps.
  34. M. Pazzani, J. Muramatsu, D. Billsus. Syskill & Webert: Identifying Interesting Web Sites. In Proc. of AAAI-96, pp. 54-61, 1996.
  35. T. Joachims, D. Freitag, T. Mitchell. WebWatcher: A Tour Guide for the World Wide Web. In Proc. of the 1997 International Joint Conference on Artificial Intelligence, pp. 770-775, Aug. 1997.
  36. V. Barnett and T. Lewis. Outliers in Statistical Data. John Wiley & Sons, 1994.
  37. A. N. Wilschut and P. M. G. Apers, "Dataflow query execution in a parallel main-memory environment," in PDIS '91: Proceedings of the First International Conference on Parallel and Distributed Information Systems, Fontainebleu Hilton Resort, Miami Beach, FL, pp. 68–77, IEEE Computer Society, 1991.
  38. G. Graefe, "Query evaluation techniques for large databases," ACM Comput. Surv, vol. 25, no. 2, pp. 73–169, 1993.
  39. D. Kossmann, "The state of the art in distributed query processing," ACM Comput. Surv, vol. 32, no. 4, pp. 422–469, 2000.
  40. S. Chaudhuri, "An overview of query optimization in relational systems," in PODS '98: Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, (New York, NY, USA), pp. 34–43, ACM Press, 1998.
  41. Y. E. Ioannidis, "Query optimization," ACM Computing Surveys, vol. 28, no. 1, pp. 121–123, 1996.
  42. S. Babu and P. Bizarro, "Adaptive query processing in the looking glass," in CIDR '05: Second Biennial Conference on Innovative Data Systems Research, pp. 238–249, Asilomar, CA, 2005.
  43. J. Platt. Fast training of support vector machines using sequential minimal optimization. In B. Scholkopf, C. Burges, and A. Smola, editors, Advances in kernel methods - support vector learning. MIT Press, 1998.
  44. J. Jackson. Pop Goes the Interstitial. eMarketeer, 7 June 2001, Available at http://www. emarketer. com/analysis/eadvertising/20010607_ead. html
  45. P. Turney, Mining the Web for Synonyms: PMI-IR Versus LSA on TOEFL, Proceedings of the Twelth European Conference on Machine Learning (ECML-2001), Freiburg, Germany. September 3–7, 2001. pp. 491–502. NRC 44893.
  46. Fellbaum, C. (ed. ): WordNet: An Electronic Lexical Database. Cambridge, Massachusetts: MIT Press (1998). For more information: http://www. cogsci. princeton. edu/~wn/.
  47. Haase, K. : Interlingual BRICO. IBM Systems Journal, 39 (2000) 589-596. For more information: http://www. framerd. org/brico/.
  48. Vossen, P. (ed. ): EuroWordNet: A Multilingual Database with Lexical Semantic Networks. Dordrecht, Netherlands: Kluwer (1998). See: http://www. hum. uva. nl/~ewn/.
  49. Eduardo Mena, OBSERVER-An approach for query processing, Distributed and Parallel Databases, 8, 223-271, 2000.
  50. Church, K. W. , Hanks, P. : Word Association Norms, Mutual Information and Lexicography. In: Proceedings of the 27th Annual Conference of the Association of Computational Linguistics, (1989) 76-83.
  51. Church, K. W. , Gale, W. , Hanks, P. , Hindle, D. : Using Statistics in Lexical Analysis. In: Uri Zernik (ed. ), Lexical Acquisition: Exploiting On-Line Resources to Build a Lexicon. New Jersey: Lawrence Erlbaum (1991) 115-164.
  52. Test of English as a Foreign Language (TOEFL), Educational Testing Service, Princeton, New Jersey, http://www. ets. org/.
  53. Tatsuki, D. : Basic 2000 Words - Synonym Match 1. In: Interactive JavaScript Quizzes forESLStudents,http://www. aitech. ac. jp/~iteslj/quizzes/js/dt/mc-2000-01syn. html(1998).
  54. Landauer, T. K. , Dumais, S. T. : A Solution to Plato's Problem: The Latent Semantic Analysis Theory of the Acquisition, Induction, and Representation of Knowledge. Psychological Review, 104 (1997) 211-240.
  55. Turney, P. D. : Learning Algorithms for Keyphrase Extraction. Information Retrieval, 2 (2000) 303-336.
  56. Sebastian Ryszard Kruk, JeromeDL - Adding Semantic Web Technologies to Digital Libraries, JeromeDL - e-Library with Semantics: http://www. jeromedl. org/
  57. Jason Mchuge, Query optimizations in XML, Rome Laboratories, Stanford University.
  58. Utkarsh Srivastava, Query Optimization over Web Services, VLDB '06, September 12-15, 2006, Seoul, Korea.
  59. King, Jonathan Jay, Query Optimization by Semantic Reasoning, Doctoral Thesis, STANFORD UNIV CA DEPT OF COMPUTER SCIENCE, 1981.
  60. Bin He, Statistical Schema Matching across Web Query Interfaces, SIGMOD 2003, June 912, 2003, San Diego, CA.
  61. Haoran Wang Supervised class-specific dictionary learning for sparse modeling in action recognition, National Laboratory of Pattern Recognition, Institute of Automation, CAS, 2012, Beijing, China.
  62. Figure 1 referred from TempVars in Web Query social. msdn. microsoft. com
  63. Figure 2 referred from WebQuery: Searching and Visualizing the Web Through Connectivity www. ra. ethz. ch
  64. Figure 3 referred from Oracle ATG Web Commerce Query Processing Overview docs. oracle. com
Index Terms

Computer Science
Information Sciences

Keywords

Web query- Semantic Web - Ontology - query classification - query optimizations