Call for Paper - May 2023 Edition
IJCA solicits original research papers for the May 2023 Edition. Last date of manuscript submission is April 20, 2023. Read More

XML Parsing on Multicore Processors and Data Representation in . NET Tree Control

International Journal of Computer Applications
© 2014 by IJCA Journal
Volume 99 - Number 18
Year of Publication: 2014
Navreet Kaur
Harwinder Singh Sohal

Navreet Kaur and Harwinder Singh Sohal. Article: XML Parsing on Multicore Processors and Data Representation in .NET Tree Control. International Journal of Computer Applications 99(18):29-35, August 2014. Full text available. BibTeX

	author = {Navreet Kaur and Harwinder Singh Sohal},
	title = {Article: XML Parsing on Multicore Processors and Data Representation in .NET Tree Control},
	journal = {International Journal of Computer Applications},
	year = {2014},
	volume = {99},
	number = {18},
	pages = {29-35},
	month = {August},
	note = {Full text available}


The purpose of this research is to optimize the parsing process of the XML files. There are several ways to parse the XML files. But to comply with the advanced multicore CPUs and their fast performance the XML parsing logics need to be refined and optimized with parallel processing approach. The parallel XML parsing is a step towards this approach. It makes the reading of XML data faster because parser runs on more engines to extract the data. There are several advantages of parallel XML parsing like fast execution, high throughput, time saving, proper CPU utilization and load balancing. To perform the parsing processes simultaneously, the XML files need to be split in small uniform portions. Now it will execute the parsing logic on multiple threads on each CPU's core to parse the each portion of XML file without interfering with each others. In other words, an each segment will be an input to the parser running on different threads on different CPU cores. To enhance the system performance the multicore processors based devices have been introduced. Such system's processing is much faster than conventional sequential processing systems especially when it does repetitive calculations on vast amounts of data. This technique becomes more important when a candidate system or development application is model based application which operates on the XML files. This approach plays a significant role to enhance the application's capability to process large amount of data, improve application performances by providing quick results and eventually expeditious the application processing and dependent operations.


  • Abdul Nizar M. and P. Sreenivasa Kumar (2009) "Ordered Backward XPath Axis Processing against XML Streams" XSym '09 Proceedings of the 6th International XML Database Symposium on Database and XML Technologies Pages 1 - 16.
  • Barbosa D(2002) ToXgene: a template-based data generator for XML, In : Proceedings of ACM Management of Data (SIGMOD), pp. 616.
  • Fernando F et. al (2009) 2LP Double-lazy XML Parser in Journal of Information Systems, pp. 145-163.
  • Gao Z. ( 2007) A High Performance Schema-Specific XML Parser, IEEE Intl. Conf. on e-Science and Grid Computing, pp. 245-252.
  • Gong Li (2010) XML Processing by Tree-Branch symbiosis algorithm, 2nd International Conference on Future Computer and Communication, Volume1.
  • Georgieva A and Georgiev B (2012) Parallel Processing Model for XML Parsing in Journal of Communication and Computer, 1258-1262.
  • G. R. Bamnote(2013) An Empirical Study: XML Parsing using Various Data Structures, International Journal of Computer Science and Applications, Vol. 6, No. 2.
  • James R. Otto et. al (2001) Extensible Markup Language and Knowledge Management in Journal of Knowledge Management, 5(3), pp. 278-284, MCB University Press.
  • Jie Tang et. al(2013) Acceleration of XML Parsing through Prefetching, IEEE TRANSACTIONS ON COMPUTERS, VOL. 62, NO. 8.
  • Kwon, J et. al (2005) FiST: the scalable XML document _ltering by sequencing twig patterns, In: Proceedings of the 31st international conference on Very Large Databases (VLDB), pp. 217 – 228
  • Le Liu et al. (2008) "Parallel Structural Join Algorithm on Shared-memory Multi-core Systems".
  • Li Lu W. and Gannon, D. (2008) ParaXML: A Parallel XML Processing Model on Multicore CPU, Techincal Report.
  • Li Xiaosong (2009) Key Elements Tracing Method for Parallel XML Parsing in Multi-coreSystem, in International Conference on Parallel and Distributed Computing, ApplicationsandTechnologies, IEEE.
  • Martin Krulis and Jakub Yaghob(2010) "Efficient Implementation of XPath Processor on Multi-Core CPUs" J. Pokorn_y, V. Sn_a_sel, K. Richta (Eds. ): Dateso 2010, pp. 60{71, ISBN 978-80-7378-116-3.
  • Nicola M. and J. John(2003) XML Parsing: A Threat to Database Performance, Proc. 12th Int'l Conf. Information and Knowledge Management (CIKM 03), ACM Press, pp. 175-178.
  • Peter Ogden et al. (2013) "Scalable XML Query Processing using Parallel Pushdown Transducers" Proceedings of the VLDB Endowment, Vol. 6, No. 14.
  • Rongxin c. et. al (2002) A Parallel Solution to XML Query Application in Computer Engineering College, Jimei University.
  • S. Chen et al. (2006) Twig2Stack: Bottom-up processing of generalized-tree-pattern queries over XML documents In VLDB, pages 283–294.
  • Su Cheng Haw and G. S. V. Radha Krishna Rao( 2007) A Comparative Study and Benchmarking on XML Parsers, Advanced Communication Technology, The 9th International Conference (Volume:1 ) ISSN :1738-9445 , pp. 321 – 32.
  • Seung Min Kim and Suk Yoo(2009) DOM Tree Browsing of a Very Large XML Document: Design and Implementation in Journal of Systems and Software, 82(11), pp. 1843-1858.
  • Tong T. et al. (2006) Rules about XML in XML, Expert Systems with Applications, Vol. 30, No. 2, pp. 397-411.
  • V. M. Deshmukh and G. R. Bamnote(2012) Design And Development Of An Efficient XML Parsing Algorithm, International Journal of Applied Science and Advance Technology , Vol. 1, No. 1, pp. 5-8.
  • Y. Pan et al. (2007) Parallel XML Parsing Using Meta-DFAs,Proc. 3rd IEEE Int'l Conf. e-Science and Grid Computing (e-Science 07), IEEE CS Press, pp. 237-244.
  • Zacharia Fadika (2009) Parallel and Distributed Approach for Processing Large-Scale XML Datasets in Computer Science Department, Binghamton University P. O. Box 6000, Binghamton, NY 13902-6000, USA