We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 December 2024
Reseach Article

A Catholic Research on Big Data and Hadoop Environment

by Sushma Lakshkar, Geet Kalani, Vinod Todwal
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 130 - Number 11
Year of Publication: 2015
Authors: Sushma Lakshkar, Geet Kalani, Vinod Todwal
10.5120/ijca2015907126

Sushma Lakshkar, Geet Kalani, Vinod Todwal . A Catholic Research on Big Data and Hadoop Environment. International Journal of Computer Applications. 130, 11 ( November 2015), 19-26. DOI=10.5120/ijca2015907126

@article{ 10.5120/ijca2015907126,
author = { Sushma Lakshkar, Geet Kalani, Vinod Todwal },
title = { A Catholic Research on Big Data and Hadoop Environment },
journal = { International Journal of Computer Applications },
issue_date = { November 2015 },
volume = { 130 },
number = { 11 },
month = { November },
year = { 2015 },
issn = { 0975-8887 },
pages = { 19-26 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume130/number11/23254-2015907126/ },
doi = { 10.5120/ijca2015907126 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:25:07.036738+05:30
%A Sushma Lakshkar
%A Geet Kalani
%A Vinod Todwal
%T A Catholic Research on Big Data and Hadoop Environment
%J International Journal of Computer Applications
%@ 0975-8887
%V 130
%N 11
%P 19-26
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In recent years, there are various sources that generates data in petabyte and terabyte, known as big data and its generated by human, machine, sensor etc. So the solution leds with big data is Apache Hadoop has attracted strong attention because of its applicability on processing for the large data sets. This paper present the review about the big data and it’s characteristics and also the types of the open source tools environment like HADOOP. The objective of paper to identify the power of the Hadoop on the big data and motivation behind the new research and outlines to address to Apache Hadoop also includes the programming paradigm that is Map-Reduce.

References
  1. IvaniltonPolato a,b,n, ReginaldoRé b, AlfredoGoldman a, FabioKon a A comprehensive rview of Hadoop research—A systematic 46 (2014)1–25
  2. Seyed Reza Pakize, A Comprehensive View of Hadoop MapReduce Scheduling Algorithms VOL. 2, NO. 9, SEPTEMBER 2014, 308–3179 (2014)
  3. Gothai E, Balasubramanie P, A Novel Approach For Partitioning In Hadoop Using Round Robin Technique, 20th May 2014. Vol. 63 No.2 © 2005 - 2014
  4. Kamble Ashwini ,Kanawade Bhavan, A Brief on MapReduce Performance Volume 1 Issue 1 (April 2014)
  5. Lu Lu, Xuanhua Shi ∗, Hai Jin, Qiuyue Wang, Daxing Yuan, Song Wu, Morpho: A decoupled MapReduce framework for elastic cloud computing , 36 (2014) 80–90
  6. Kirandeep Kaur1, Khushdeep Kaur2, An Improved Longest Approximate Time to End Algorithm using Dynamic Cloud Sim 2319-7064 .2013
  7. Lizhe Wanga,b,∗, Jie Taoc, Rajiv Ranjan d, Holger Martenc, Achim Streit c, Jingying Chene, Dan Chena,∗∗, G-Hadoop: MapReduce across distributed data centers for data-intensive computing 29 (2013) 739–750
  8. Vidyasagar S.D, A Study on “Role of Hadoop in Information Technology era” Volume : 2 | Issue : 2 | Feb 2013 • ISSN No 2277 - 8160
  9. Shaochun Wu ,Xiang Shuai, Liang Chen, Ling Ye, Bowen Yuan, A replica pre-placement strategy based on correlation analysis in cloud environment (CCIS 2013)
  10. Jilan Chen, Dan Wang and Wenbing Zhao, A Task Scheduling Algorithm for Hadoop Platform, VOL. 8, NO. 4, APRIL 2013
  11. Shreyas Kudale1, Advait Kulkarni2, Asst. Prof. Leena A. Deshpande3, Predictive Analysis Using Hadoop: A Survey, Vol. 1, Issue 8, October 2013
  12. Hortonworks, Community Driven Apache Hadoop Apache Hadoop Basics May 2013 ©
  13. S. Chandra Mouliswaran And Shyam Sathyan*, Study On Replica Management And High Availability In Hadoop Distributed File System (Hdfs), Vol 2 / Issue 2 / 2012 / 65-70
  14. Http://Www.Apache.Org
  15. Sarannia,N.Padmapriya, Survey On Big Data Processing In Geo Distributed Data Centers Vol 4, Issue 11, November 2014
  16. Madhury Mohandas & Dhanya P M ,Department Of Computer Science & Engineering, Rajagiri School Of Engineering & Technology, Cochin, Kerala, India, “Algorithm For Efficient Data Placement In Blobseer Architecture” International Journal Of Computer Science Engineeringand Information Technology Research (Ijcseitr) Issn 2249-683,Vol. 3, Issue 3, Aug 2013, 193-200
  17. Tao Gu, Chuang Zuo, Qun Liao, Yulu Yang and Tao Li “Improving MapReduce Performance by Data Prefetching in Heterogeneous or Shared Environments”, International Journal of Grid and Distributed Computing Vol.6, No.5 (2013), pp.71-82
  18. Md. Rezaul Karim1, Azam Hossain1, Md. Mamunur Rashid1, Byeong-Soo Jeong1, and Ho-Jin Choi2,” An Efficient Market Basket Analysis Technique with Improved MapReduce Framework on Hadoop: An E commerce Perspective
  19. Gothai E, 2balasubramanie P,” A Novel Approach For Partitioning In Hadoop Using Round Robin Technique”, Journal Of Theoretical And Applied Information Technology 20th May 2014. Vol. 63 No.2
  20. S. Chandra Mouliswaran And Shyam Sathyan, “Study On Replica Management And High Availability In Hadoop Distributed File System (Hdfs)”S. Chandra Mouliswaran And Shyam Sathyan. Et Al. / Journal Of Science / Vol 2 / Issue 2 / 2012 / 65-70
  21. Chia-WeiLeea, Kuang-YuHsieha, Sun-YuanHsieha,b,∗, Hung-ChangHsiaoa, “A Dynamic Data Placement Strategy for Hadoop in Heterogeneous Environments”, Big DataResearch1(2014)14–22
  22. Balaji Palanisamy, Aameek Singh, Ling Liu, Bhushan Jain,” Purlieus: Locality-aware Resource Allocation for MapReduce in a Cloud”
  23. Ivan Baev† Rajmohan Rajaraman‡ Chaitanya Swamy§,”Approximation Algorithms for Data Placement Problems
  24. R.Jemina Priyadarsini1, Dr.L.Arockiam2,” An Extensive Analysis On Task Scheduling Algorithms In Cloud Environments” (Ijetcas)”
  25. Medhat Tawfeek, Ashraf El-Sisi, Arabi Keshk and Fawzy Torkey,” Cloud Task Scheduling Based on Ant Colony Optimization” The International Arab Journal of Information Technology, Vol. 12, No. 2, March 2015.
  26. Nitesh MaheshwariRadheshyam Nanduri, Vasudeva Varma.”Dynamic Energy Ecient Data Placement and ClusterRecongurationAlgorithmforMapReduceFramework”
  27. Phokham Nonava October 2014,” HDFS Blocks Placement Strategy
  28. Ivanilton Polatoa,b,⇤, Reginaldo R´eb, Alfredo Goldmana, Fabio Kona,” A Comprehensive View of Hadoop Research - A Systematic Literature Review”journal of network and computer applications volume ,46 november 2014, page no 1-25
  29. George Porter UC San Diego La Jolla,” Decoupling Storage and Computation in Hadoop with SuperDataNodes”.
Index Terms

Computer Science
Information Sciences

Keywords

Bigdata ApacheHadoop Map-Reduce distributed systemHDFS MPI G-Hadoop Gfram 3VBigDatamodel 5 VBigData model.