Big data analysis solutions using mapReduce framework

Recently, data that generated from variety of sources with massive volumes, high rates, and different data structure, data with these characteristics is called Big Data. Big Data processing and analyzing is a challenge for the current systems because they were designed without Big Data requirements...

Full description

Bibliographic Details
Main Authors: Elagib, Sara B., Najeeb, Athaur Rahman, Hassan Abdalla Hashim, Aisha, Olanrewaju, Rashidah Funke
Format: Conference or Workshop Item
Language:English
English
Published: 2014
Subjects:
Online Access:http://irep.iium.edu.my/41638/
http://irep.iium.edu.my/41638/
http://irep.iium.edu.my/41638/1/41638.pdf
http://irep.iium.edu.my/41638/4/41638_Big%20data%20analysis%20solutions%20using%20map_Scopus.pdf
Description
Summary:Recently, data that generated from variety of sources with massive volumes, high rates, and different data structure, data with these characteristics is called Big Data. Big Data processing and analyzing is a challenge for the current systems because they were designed without Big Data requirements in mind and most of them were built on centralized architecture, which is not suitable for Big Data processing because it results on high processing cost and low processing performance and quality. MapReduce framework was built as a parallel distributed programming model to process such large-scale datasets effectively and efficiently. This paper presents six successful Big Data software analysis solutions implemented on MapReduce framework, describing their datasets structures and how they were implemented, so that it can guide and help other researchers in their own Big Data solutions.