CFP last date
20 May 2024
Reseach Article

XML based Keyword Search

by G.seethalakshmi, J. Swathi
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 107 - Number 15
Year of Publication: 2014
Authors: G.seethalakshmi, J. Swathi

G.seethalakshmi, J. Swathi . XML based Keyword Search. International Journal of Computer Applications. 107, 15 ( December 2014), 1-3. DOI=10.5120/18824-0241

@article{ 10.5120/18824-0241,
author = { G.seethalakshmi, J. Swathi },
title = { XML based Keyword Search },
journal = { International Journal of Computer Applications },
issue_date = { December 2014 },
volume = { 107 },
number = { 15 },
month = { December },
year = { 2014 },
issn = { 0975-8887 },
pages = { 1-3 },
numpages = {9},
url = { },
doi = { 10.5120/18824-0241 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
%0 Journal Article
%1 2024-02-06T22:41:06.078481+05:30
%A G.seethalakshmi
%A J. Swathi
%T XML based Keyword Search
%J International Journal of Computer Applications
%@ 0975-8887
%V 107
%N 15
%P 1-3
%D 2014
%I Foundation of Computer Science (FCS), NY, USA

The success of information retrieval style keyword search on the web leads to the emergence of XML based keyword search. The text database and XML database differences leads to three new challenges: 1) The users search intention is to be identified, i. e. , the XML node types that user wants to search for and search via is identified. 2) The similarities in tag name, tag value and the structure of tags are identified. 3) New scoring function is needed to estimate the output of the search results (XML document) relevance to the given query. However, these challenges cannot be addressed by the existing system, which results in low quality results in terms of query relevance. In this paper, an IR-style approach is proposed which basically utilizes the statistics of underlying XML data to address these challenges. First, specific guidelines that a search engine should meet in both search intention identification and relevance oriented ranking for search results is proposed. Then, based on these guidelines, a novel XML TF*IDF ranking strategy to rank the individual matches of all possible search intentions is proposed.

  1. Ziyang Liu Yi Chen, "Reasoning and Identifying Relevant Matches for XML Keyword Search", Proceedings of the VLDB Endowment Volume 1 Issue 1, August 2008.
  2. Y. Xu and Y. Papakonstantinou. Efficient Keyword Search for Smallest LCAs in XML Databases. In SIGMOD, 2005.
  3. L. Guo, F. Shao, C. Botev, and J. Shanmugasundaram. XRANK: Ranked Keyword Search over XML Documents. In SIGMOD, 2003.
  4. Z. Liu and Y. Chen, "Identifying Meaningful Return Information for XML Keyword Search," Proc. ACM SIGMOD Conf. , 2007.
  5. Cohen. S, Mamou. J, Kanza. Y, and Sagiv. Y, 'XSEarch: A semantic search engine for XML', In Proc. Of VLDB Conference, pp. 45–56. 2003
  6. Yunyao Li, Cong Yu, H. V. Jagadish, "Schema-Free XQuery" Proceedings of the 30th VLDB Conference, Toronto, Canada, 2004
  7. D. Chamberlin. XQuery: An XML query language. IBM System Journal, 41:597–615, 2003.
  8. V. Hristidis, Y. Papakonstantinou, and A. Balmin, "Keyword proximity search on XML graphs," in ICDE, 2003, pp. 367–378.
  9. Norbert Fuhr, Kai Großjohann, "XIRQL: A Query Language for Information Retrieval in XML Documents SIGIR'01, September 9–12, New Orleans, Louisiana, USA
  10. Rosie Jones, Benjamin Rey and Omid Madani, Wiley Greiner, "Generating Query Substitutions" WWW 2006, May 23–26, 2006, Edinburgh, Scotland ACM 1-59593-323-9/06/0005.
  11. Albrecht Schmidt, Martin Kersten, Menzo Windhouwer "Querying XML Documents Made Easy: Nearest Concept Queries" proceedings of 17th International conference on data engineering, 2001.
Index Terms

Computer Science
Information Sciences


XML Search ranking.