Distributed Join Query Processing for Big RDF Data

The expansion of the services of the Semantic Web and the evolution of cloud computing technologies have significantly enhanced the capability of preserving and publishing information in standard open web formats, such that data can be both human-readable and machine-processable. This situation meet...

Full description

Bibliographic Details
Main Authors: Elzein, Nahla Mohammed, Mazlina, Abdul Majid, Fakherldin, Mohammed, Hashem, Ibrahim Abaker Targio
Format: Article
Language:English
Published: American Scientific Publisher 2018
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/20172/
http://umpir.ump.edu.my/id/eprint/20172/
http://umpir.ump.edu.my/id/eprint/20172/
http://umpir.ump.edu.my/id/eprint/20172/1/Distributed%20Join%20Query%20Processing%20for%20Big%20RDF%20Data.pdf
Description
Summary:The expansion of the services of the Semantic Web and the evolution of cloud computing technologies have significantly enhanced the capability of preserving and publishing information in standard open web formats, such that data can be both human-readable and machine-processable. This situation meets the challenge in the current big data era to effectively store, retrieve, and analyze resource description framework (RDF) data in swarms. Moreover, efficient data storage and retrieval that can scale to large amounts of possibly schema-less data have proven to be quite difficult to achieve, specifically, RDF data storage with complex and large graph patterns for representing semantic data, and SPARQL query languages. In this paper, we provide comprehensive discussion about the proposed algorithms of Join.Query processing of RDF data by considering MapReduce Framework in a distributed environment. Moreover, we introduced a framework for RDF query processing and the benchmark that is used for the performance evaluation. Finally, we offer an evaluation discussion on distributed join query processing for big RDF data.