CFP last date
20 December 2024
Reseach Article

Comparison of Efficient Parallel Index Algorithms used for RDF Data Store

by Rajani Sharma, Rajender Kumar Trivedi
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 107 - Number 17
Year of Publication: 2014
Authors: Rajani Sharma, Rajender Kumar Trivedi
10.5120/18842-0229

Rajani Sharma, Rajender Kumar Trivedi . Comparison of Efficient Parallel Index Algorithms used for RDF Data Store. International Journal of Computer Applications. 107, 17 ( December 2014), 12-16. DOI=10.5120/18842-0229

@article{ 10.5120/18842-0229,
author = { Rajani Sharma, Rajender Kumar Trivedi },
title = { Comparison of Efficient Parallel Index Algorithms used for RDF Data Store },
journal = { International Journal of Computer Applications },
issue_date = { December 2014 },
volume = { 107 },
number = { 17 },
month = { December },
year = { 2014 },
issn = { 0975-8887 },
pages = { 12-16 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume107/number17/18842-0229/ },
doi = { 10.5120/18842-0229 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:41:18.397389+05:30
%A Rajani Sharma
%A Rajender Kumar Trivedi
%T Comparison of Efficient Parallel Index Algorithms used for RDF Data Store
%J International Journal of Computer Applications
%@ 0975-8887
%V 107
%N 17
%P 12-16
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

GPU acceleration of compute-intensive applications has emerged as a new research frontier with phenomenal success- rates. Such applications are characterized by large data-sets being processed by singular functional units (FUs) often described as SIMD (Single Instruction Multiple Data) computing. Moreover, with the proliferation of internet and its easy access on myriad devices, has resulted in huge amount of data generation. Initially, such data was considered disconnected and not related. But with the advent of semantic web, data has been found to be highly co- related and relevant. Organizing such huge amount of data and subsequently processing requires parallel processing framework that is both distributed and scalable. Graphical processing units (GPUs) are being actively probed in the domain of Big Data analysis, machine learning, and augmented reality since such applications are characterized by massive data spanned and generated over distributed network. GPUs provide a parallel programming framework using CUDA (Compute Unified Device Architecture) that can be utilized to efficiently collate and make inferences on these massive data-sets. Further, GPU multicores are available at commodity rates thus providing an option for cheap and low-power alternatives. The exponential growth of semantic web and the resultant generation of large-scale RDF (Resource Description Framework) triples pose new challenges in the domain of RDF-storage and retrieval. RDF data consist of triples which need to be efficiently indexed. Following are some of the many challenges related to efficient indexing of RDF triples: • As RDF-triples extensively contain recursive redundancies, self-joins so formed are inefficient. • Self-joins also lead to large scale null values. This paper presents the research initiatives of conducting literature survey of contemporary indices including those under active research, which matches the goals as outlined in above sections. Comparing the efficiency of different variety of indices that have been suggested for large data-sets (Map reduce, B+ tree Hashed Index, 3-level-cascade hash index, braided B+ tree index, etc. )

References
  1. M. K. Nguyen, C. Basca and A. Bernstein, Speeding up on-disk RDF index lookups using B+Hash Trees, 2012, IOS Press, Zurich, Switzerland
  2. C. Weiss, P. Karras and A. Bernstein Hexastore:Sextuple Indexing for Semantic Web Data Management, 2008, VLDB Endowment, Auckland, New Zealand
  3. T. Neumann and G. Weikum, RDF3X: a RISCstyle Engine for RDF, 2008, VLDB Endowment, Auckland, New Zealand
  4. A. Matono, A Storing Scheme and A Merge Join Algorithm for RDF Query Processing, Jaban
  5. H. Stuckenschmidt, R. Vdovjak, J. Broekstra, Index Structures and Algorithms for Querying Distributed RDF Repositories. 2004, New York, USA
  6. Daniel J. Abadi • Adam Marcus • Samuel R. Madden • Kate Hollenbach, SW-Store: a vertically partitioned DBMS for Semantic Web data management. 2009, Springer-Verlag.
  7. P. Bakkum and K. Skadron, Accelerating SQL Database Operations on a GPU with CUDA. 2010, Pittsburg, PA, USA
  8. M. A. Bornea, J. Dolby, A. Kementsietsidis, K. Srinivas, P. Dantressangle, O. Udrea, B. Bhattacharjee, Building an Efficient RDF Store Over a Relational Database. 2013, New York, USA.
  9. R. Angles and C. Gutierrez, Querying RDF Data from a Graph Database Perspective.
  10. M. A. Martínez-Prieto, Mario Arias, and Javier D. Fernández. Exchange and Consumption of Huge RDF Data
  11. Y. Luo, F. Picalausa, G. H. L. Fletcher, J. Hidders, and S. Vansummeren, Storing and Indexing Massive RDF Data Sets
  12. R. Blanco, P. Mika, and S. Vigna, Effective and Efficient Entity Search in RDF data
  13. K. Li. Cost Analysis of Joins in RDF Query Processing Using the TripleT Index. 2008. Master Thesis. Faculty of the Graduate School of Emory University.
  14. A. MATONO, T. AMAGASA, M. YOSHIKAWA, and S. UEMURA, An Indexing Scheme for RDF and RDF Schema based on Suffix Arrays
  15. E. I. Chong , S. Das, G. Eadon and J. Srinivasan, An Efficient SQL-based RDF Querying SchemeProceedings of the 31st VLDB Conference, Trondheim, Norway, 2005
  16. T. Tran and G. Ladwig, Structure Index for RDF Data. Workshop on Semantic Data Management (SemData@VLDB) 2010,September 17,2010, Singapore.
  17. YounHee Kim; ByungGon Kim; HaeChull Lim, "The index organizations for RDF and RDF schema," Advanced Communication Technology, 2006. ICACT 2006. The 8th International Conference , vol. 3, no. , pp. 4 pp. ,1874, 20-22 Feb. 2006 doi: 10. 1109/ICACT. 2006. 206357 keywords: {database indexing; metadata;query processing;semantic Web;RDFschema;graph models;index organizations;indexing techniques;keyword index;keyword-based query;metadata;ontology;path-based query;semantic Web;Data m
Index Terms

Computer Science
Information Sciences

Keywords

RDF Semantic Web Graphical Processing Unit (GPU) Compute Unified Device Architecture (CUDA) Hashed B+ tree indexing.