We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 December 2024
Reseach Article

A New Query Engine using Novel Three Dimensional Index for Xml Documents

by Atul D. Raut, M. Atique
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 87 - Number 10
Year of Publication: 2014
Authors: Atul D. Raut, M. Atique
10.5120/15244-3786

Atul D. Raut, M. Atique . A New Query Engine using Novel Three Dimensional Index for Xml Documents. International Journal of Computer Applications. 87, 10 ( February 2014), 20-27. DOI=10.5120/15244-3786

@article{ 10.5120/15244-3786,
author = { Atul D. Raut, M. Atique },
title = { A New Query Engine using Novel Three Dimensional Index for Xml Documents },
journal = { International Journal of Computer Applications },
issue_date = { February 2014 },
volume = { 87 },
number = { 10 },
month = { February },
year = { 2014 },
issn = { 0975-8887 },
pages = { 20-27 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume87/number10/15244-3786/ },
doi = { 10.5120/15244-3786 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:05:34.100259+05:30
%A Atul D. Raut
%A M. Atique
%T A New Query Engine using Novel Three Dimensional Index for Xml Documents
%J International Journal of Computer Applications
%@ 0975-8887
%V 87
%N 10
%P 20-27
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

XML has gained prominence as data storage and exchange format for web applications. This is because there are certain features which are unique to XML like self descriptivism, extensibility and non proprietary text document storage. In spite of all these unique features XML has an inherent limitation of verbosity. This size problem of XML should be dealt with efficiently so that a good compression is achieved and at the same time the compressed data is directly queriable i. e. it should not require decompression at the time of querying. The proposed technique creates a new query engine based on novel three dimensional indexes consisting of structure, attribute and content index. The structure index consists of all unique root to leaf paths of the XML document, the content index stores the contents path wise i. e. all the contents of one particular type of path class is stored in one file and attribute index is created in manner similar to that of content index. Based on this three dimensional compact storage a new query engine is proposed which can answer xpath queries very efficiently. This approach dramatically reduces the storage requirement for XML coupled with efficient processing of xpath queries.

References
  1. S. Al. Khalifa, H. V. Jagdish, N Koudas, J. M. Patel, D Srivastava and Y Wu, " Structural Joins: A Primitive for Efficient XML Query Pattern Matching," In Proc. of the 18th International Conference on Data Engineering (ICDE), San Jose, CA, pp. 141-152, February 26-March 1 2002.
  2. Zhang N, Tamer M. " FIX: Feature-based indexing technique for XML documents," In Proc of 32nd VLDB Conference, Seoul, Korea , pp. 259-270, September 12-15 2006.
  3. H. Liefke and D. Suciu, "XMill: an efficient compressor for XML data," In Proc of ACM SIGMOD international conference on management of data pages, pp. 153-24,2000.
  4. J. Cheney, "Compressing XML with multiplexed hierarchical PPM models," In Proc of IEEE Data Compression Conference, pp. 163-172, 2000.
  5. P. Tolani and J. Haritsa, "XGRIND: A query-friendly XML compressor," In Proc 18th International Conference on Data Engineering (ICDE) IEEE Computer Society, pp. 225-234, 2002.
  6. J. Min, M. Park and C. Chung, "XPRESS: A queriable compression for XML data," In Proc. of the ACM SIGMOD International Conference on Management of Data, San Diego, California, 2003.
  7. R. Wong, F. Lam and W. Shui, "Querying and maintaining a compact XML storage," in 16thinternational conference on World Wide Web, Banff, Alberta, Canada, 2007.
  8. Peter Bunaman , Martin Grohe, Christioph Koch," Path Queries on Compressed XML," In proceedings of the 29 th VLDB conference, Berlin Germany ,2003.
  9. N. Bruno, N. Koudas, and D. Srivastava, "Holistic Twig Joins: Optimal XML Pattern Matching", In Proc. Of 21st ACM SIGMOD Int'l Conference on Management of Data (SIGMOD'02), pp. 310–321, 2002.
  10. Raghav Kaushik, Rajasekar Krishnamurthy, Jeffery F. Naughton, Raghu Ramkrishnan,"On the Integration of Structure Index and Inverted List," In Proc. of the 204 ACM SIGMOD international conference on management of data, Paris, France, pp. 779-790, June 13-18 2004.
  11. Atique M, Raut AD, "Non redundant compact Xml storage for efficient indexing and querying of Xml documents," In: Communications in Computer and Information Science, VIT Vellore, pp 109-113, December 2012.
  12. Ibrahim Dweib, Ayman Awadi and Joan Lu. (2009,June). MAXDOR: Mapping XML Document into Relational Database. The Open Information System Journal. 3, pp. 108-122.
  13. Zhuyan Chan et. Al," Index Structures for Matching XML Twigs using Relational Query Processor," In Proceeding of Data engineering workshop ICDEW ,5-8 April 2005.
  14. Igor Totarinov, Stratis D Vigals, Kevin Beyer et. al. ," Storing and Querying Ordered XML using a Relational Database System," In Proc. Of ACM SIGMOD Int'l Conference on Management of Data, Madison Wisconsin USA, pp. 204-215, 2002.
  15. Yin Fu Huang and Shin-Hang Wang," An efficient XML Processing based on combining T bitmap and Index Techniques," In Proc. IEEE Symposium on Computers and Communication ISCC 2008, Marrakech, Morocco, July 6-9 2008,pp 858-863.
  16. Li Ying, MaJun Sun Yun, "Applying Dewey Encoding to Construct XML Index for Path and Keyword Query," In Proc. First International Workshop on Database Technology and Application 09, Wuhan, Hubie,China, pp553-556, , 25-26 April 2009.
  17. A. Arion, A. Bonifati, G. Costa, S. D'Aguanno, I. Manolescu, and A. Pugliese, "XQueC: Pushing queries to compressed XML data," in Proceedings of the 29th International Conference on Very Large Data Bases (VLDB'03), 2003.
  18. Christain Mathis et. al. "Storing and Indexing XML Documents upside down," CSRD, vol 24, pp 51-68, 2009
  19. Radha Senthilkumar, Priyaa Varshinee and A. Kannan. Designing and Querying a Compact Redundancy Free XML Storage. The Open Information System Journal. 3, pp. 98-107, June 2009.
  20. Massih R Amini, Anatosios Tambros, Nicolas Usunier, Mounia Lolmas. Learning based summarization of XML documents. Information Retrieval. 10, pp 233-255,2007.
  21. Anderi Arion, et. al. Path Summaries and Path Partitioning in Modern XML Databases. World Wide Web, vol 11 pp 117-151, 2008.
  22. Wilfred NG, Wai-Yeung Lam, and James Cheng. Comparative Analysis of XML Compression Technologies. World Wide Web: Internet and Web Information Systems, 9, pp 5–33, 2006.
  23. A. D. Raut, M Atique. Efficient querying of structure and contents for XML documents. International Journal of Computer Applications, 45, pp 30-37, 2012.
  24. Su-Cheng Haw and Chien-Sing Lee. Structural Query Optimization in Native XML Database: A Hybrid Approach. Journal of Applied Sciences. 7(20), pp. 2934-2946, 2007.
Index Terms

Computer Science
Information Sciences

Keywords

Structure index content index attribute index.