CFP last date
20 January 2025
Reseach Article

Performing Big Data over Cloud on a Test-Bed

by Vishal Dubey, Saumya Gupta, Sapeksh Garg
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 120 - Number 10
Year of Publication: 2015
Authors: Vishal Dubey, Saumya Gupta, Sapeksh Garg
10.5120/21267-3872

Vishal Dubey, Saumya Gupta, Sapeksh Garg . Performing Big Data over Cloud on a Test-Bed. International Journal of Computer Applications. 120, 10 ( June 2015), 49-52. DOI=10.5120/21267-3872

@article{ 10.5120/21267-3872,
author = { Vishal Dubey, Saumya Gupta, Sapeksh Garg },
title = { Performing Big Data over Cloud on a Test-Bed },
journal = { International Journal of Computer Applications },
issue_date = { June 2015 },
volume = { 120 },
number = { 10 },
month = { June },
year = { 2015 },
issn = { 0975-8887 },
pages = { 49-52 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume120/number10/21267-3872/ },
doi = { 10.5120/21267-3872 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:05:53.948198+05:30
%A Vishal Dubey
%A Saumya Gupta
%A Sapeksh Garg
%T Performing Big Data over Cloud on a Test-Bed
%J International Journal of Computer Applications
%@ 0975-8887
%V 120
%N 10
%P 49-52
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Data analytics has been rapidly growing in a variety of application areas like mining business intellect for processing the huge amount of data. MapReduce programming paradigm adds itself well to these data-intensive analytics jobs, given its one of the well known ability to scale-out and force several machines to parallely process data. This paper introduces a detailed analysis of big data over cloud computing with several mapred techniques from system and application aspects. Here in this work we say that such Mapper and Reducer based analytics provide a better result over cloud platform. However, End-Users in this environment has the ability to use MapReduce applications to minimize the incurred cost, while obtaining the best performance. From the implementation point of view, we describe the key issues and challenges of big data on cloud and on local system as well. At last, the challenges come across in implementing MapReduce functions over Hadoop and the analysis of standalone, clustered and virtualized systems over our test-bed.

References
  1. "What is Apache Hadoop," http://hortonworks. com /hadoop/,2011-2014
  2. "Gartner IT Glossary," http://www. gartner. com/it-glossary/big-data/,2013.
  3. "Fern Helper, Bringing big data to the enterprise ," http://www. ibm. com/software/ in/data/ bigdata/, January 2012
  4. Cloudera. http://www. cloudera. com/.
  5. J. Dean and S. Ghemawat. Mapreduce: Simplified data processing on large clusters. In Proc. of OSDI, 2004.
  6. RedHat Enterprises virtualization workbook for student.
  7. Big Data Processing in Cloud Computing Environments College of Information Science and Technology, Dalian Maritime University, Dalian 116026, China
  8. Virtualization in Linux a Key Component for Cloud Computing
  9. "White Paper: Ten Things You Need to Know About Virtualization. " www. datacore. com
  10. B. Golden. Virtualization for Dummies, Wiley: Hoboken, New Jersey: 2008.
  11. Implementation of MapReduce Algorithm and Nutch Distributed File System in Nutch published by IJCA 2011
  12. Pipeline computing. From Wikipedia en. wikipedia. org/wiki/Pipeline_(computing
Index Terms

Computer Science
Information Sciences

Keywords

Cloud computing Big data Hadoop HDFS Map Reduce virtualization