International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 107 - Number 17 |
Year of Publication: 2014 |
Authors: Rajani Sharma, Rajender Kumar Trivedi |
10.5120/18842-0229 |
Rajani Sharma, Rajender Kumar Trivedi . Comparison of Efficient Parallel Index Algorithms used for RDF Data Store. International Journal of Computer Applications. 107, 17 ( December 2014), 12-16. DOI=10.5120/18842-0229
GPU acceleration of compute-intensive applications has emerged as a new research frontier with phenomenal success- rates. Such applications are characterized by large data-sets being processed by singular functional units (FUs) often described as SIMD (Single Instruction Multiple Data) computing. Moreover, with the proliferation of internet and its easy access on myriad devices, has resulted in huge amount of data generation. Initially, such data was considered disconnected and not related. But with the advent of semantic web, data has been found to be highly co- related and relevant. Organizing such huge amount of data and subsequently processing requires parallel processing framework that is both distributed and scalable. Graphical processing units (GPUs) are being actively probed in the domain of Big Data analysis, machine learning, and augmented reality since such applications are characterized by massive data spanned and generated over distributed network. GPUs provide a parallel programming framework using CUDA (Compute Unified Device Architecture) that can be utilized to efficiently collate and make inferences on these massive data-sets. Further, GPU multicores are available at commodity rates thus providing an option for cheap and low-power alternatives. The exponential growth of semantic web and the resultant generation of large-scale RDF (Resource Description Framework) triples pose new challenges in the domain of RDF-storage and retrieval. RDF data consist of triples which need to be efficiently indexed. Following are some of the many challenges related to efficient indexing of RDF triples: • As RDF-triples extensively contain recursive redundancies, self-joins so formed are inefficient. • Self-joins also lead to large scale null values. This paper presents the research initiatives of conducting literature survey of contemporary indices including those under active research, which matches the goals as outlined in above sections. Comparing the efficiency of different variety of indices that have been suggested for large data-sets (Map reduce, B+ tree Hashed Index, 3-level-cascade hash index, braided B+ tree index, etc. )