CFP last date
20 May 2024
Reseach Article

Efficient Querying of Structure and Contents for XML Documents

by Atul D. Raut, M. Atique
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 45 - Number 6
Year of Publication: 2012
Authors: Atul D. Raut, M. Atique
10.5120/6786-9093

Atul D. Raut, M. Atique . Efficient Querying of Structure and Contents for XML Documents. International Journal of Computer Applications. 45, 6 ( May 2012), 30-37. DOI=10.5120/6786-9093

@article{ 10.5120/6786-9093,
author = { Atul D. Raut, M. Atique },
title = { Efficient Querying of Structure and Contents for XML Documents },
journal = { International Journal of Computer Applications },
issue_date = { May 2012 },
volume = { 45 },
number = { 6 },
month = { May },
year = { 2012 },
issn = { 0975-8887 },
pages = { 30-37 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume45/number6/6786-9093/ },
doi = { 10.5120/6786-9093 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:36:54.456678+05:30
%A Atul D. Raut
%A M. Atique
%T Efficient Querying of Structure and Contents for XML Documents
%J International Journal of Computer Applications
%@ 0975-8887
%V 45
%N 6
%P 30-37
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

XML is recognized as a standard for data storage and exchange for web applications. This is because it has certain unique features like it is self describing, extensible and it is stored in the form of text document. In spite of all these unique features XML has an inherent limitation of verbosity. Because of the strong presence of XML in database technology and its inherent verbosity there is ever increasing need to design compact storage for XML which can be effectively utilized for efficient indexing and querying of XML. The proposed technique creates a structure index which is a compact summarization of the XML document and data index which groups and stores the contents of all similar paths at one place. Based on this compact storage a novel query algorithm is proposed which can answer xpath queries very efficiently. This approach dramatically reduces the storage requirement for XML coupled with efficient processing of xpath queries. The implementation of this technique and comparison with other techniques confirms our claim.

References
  1. Dayanand P, Dr. Rajashree Shettar. "Survey on Information Retrieval in Semi Structured Data," International Journal of Computer Applications, vol 32 ,no 8, pp 1-5, Oct 2011.
  2. S. Al. Khalifa, H. V. Jagdish, N Koudas, J. M. Patel, D Srivastava and Y Wu. "Structural Joins: A Primitive for Efficient XML Query Pattern Matching", Proc. of the 18th International Conference on Data Engineering (ICDE), San Jose, CA, pp. 141-152, February 26-March 1, 2002.
  3. N. Bruno, N. Koudas, and D. Srivastava. Holistic Twig Joins: Optimal XML Pattern Matching, Proc. of 21st ACM SIGMOD Int?l Conference on Management of Data (SIGMOD?02), pp. 310–321, 2002.
  4. Ibrahim Dweib, Ayman Awadi and Joan Lu. "MAXDOR: Mapping XML Document into Relational Database," The Open Information System Journal, vol. 3, pp. 108-122, June 2009
  5. Peter Bunaman , Martin Grohe, Christioph Koch. Path Queries on Compressed XML, Proc. of the 29 thVLDB conference, Berlin Germany ,2003.
  6. Raghav Kaushik, Rajasekar Krishnamurthy, Jeffery F. Naughton, Raghu Ramkrishnan. On the Integration of structure Index and Inverted List, Proc. of the 204 ACM SIGMOD international conference on Management of data, Paris, France, pp. 779-790, June 13-18 2004.
  7. Ning Zhang , M. Tamer. et. al. "FIX: Feature-based Indexing Technique for XML Documents" in Pro. 32 nd VLDB conference, Seoul, Korea,2006.
  8. H. Liefke and D. Suciu, "XMill: an efficient compressor for XMLdata," in ACM SIGMOD international conference on management of data pages, 2000, pp. 153-24.
  9. J. Cheney, "Compressing XML with multiplexed hierarchical PPM models," in Proceedings of the IEEE Data Compression Conference, 2000, pp. 163-172
  10. Zhuyan Chan, Johannes Gehrke, Flip Korn, Nick Koudas, Jayavel Shanmugasundram, Divesh Srivastava, " Index Structures for Matching XML Twigs using Relational Query Processor,"in Proceeding of 21 st International Conference on Data Engineering Workshop ICDEW ,Tokyo-Japan, pp 1273-1283,5-8 April 2005.
  11. Igor Totarinov, Stratis D Vigals, Kevin Beyer, Jayavel Shanmugasundram, Eugene Shekita, Chun Zhang, " Storing and Querying Ordered XML using a Relational Database System," in Proceeding of ACM SIGMOD Int'l Conference on anagement of Data, Madison Wisconsin USA, pp. 204-215, June 3-6 2002.
  12. P. Tolani and J. Haritsa, "XGRIND: A query-friendly XML compressor,"in 18th International Conference on Data Engineering (ICDE) IEEE Computer Society, 2002, pp. 225-234.
  13. J. Min, M. Park and C. Chung, "XPRESS: A queriable compressionfor XML data," in Proceedings of the ACM SIGMOD International Conference on Management of Data, San Diego, California,2003.
  14. A. Arion, A. Bonifati, G. Costa, S. D'Aguanno, I. Manolescu, and A. Pugliese, "XQueC: Pushing queries to compressed XML data," in Proceedings of the 29th International Conference on Very LargeData Bases (VLDB'03), 2003.
  15. Yin Fu Huang and Shin-Hang Wang, " An efficient XML Processing based on combining T bitmap and Index Techniques," in Proceeding of IEEE Symposium on Computers and Communication ISCC 2008, Marrakech, Morocco, pp 858-863, July 6-9 2008.
  16. Li Ying, MaJun Sun Yun, "Applying Dewey Encoding to Construct XML Index for Path and Keyword Query," in Proceeding of First International Workshop on Database Technology and Application 09, Wuhan, Hubie,China, pp553-556, 25-26 April 2009.
  17. Radha Senthilkumar, Priyaa Varshinee and A. Kannan. "Designing and Querying a Compact Redundancy Free XML Storage," The Open Information System Journal, vol. 3, pp. 98-107, June 2009.
  18. R. Wong, F. Lam and W. Shui, "Querying and maintaining a compact XML storage," in 16thinternational conference on World Wide Web, Banff, Alberta, Canada, 2007.
  19. Su-Cheng Haw and Chien-Sing Lee, "Structural Query Optimization in Native XML Databases : A Hybrid Approach," Journal of Applied Sciences, vol20, pp 2934-2946,2007
Index Terms

Computer Science
Information Sciences

Keywords

Compact Storage Structure Index Content Index