International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 116 - Number 15 |
Year of Publication: 2015 |
Authors: Hamoud Alshammari, Jeongkyu Lee, Hassan Bajwa |
10.5120/20414-2828 |
Hamoud Alshammari, Jeongkyu Lee, Hassan Bajwa . Improving Current Hadoop MapReduce Workflow and Performance. International Journal of Computer Applications. 116, 15 ( April 2015), 38-42. DOI=10.5120/20414-2828
This study proposes an improvement andimplementation of enhanced Hadoop MapReduce workflow that develop the performance of the current Hadoop MapReduce. This architecture speeds up the process of manipulating BigData by enhancing different parameters in the processing jobs. BigData needs to be divided into many datasets or blocks and distributed to many nodes within the cluster. Thus, tasks can access these blocks in parallel mode and be processed easily. However, accessing the same datasets each time the job is executed causes data overloading problem, so we developed the current MapReduce workflow to improve the performance in terms of data size that is read in the relative jobs. This work uses a bioinformatics DNA datasets to implement the solution.