Recent Innovations in Computer Science and Information Technology |
Foundation of Computer Science USA |
RICSIT2016 - Number 1 |
September 2016 |
Authors: Dilbag Singh, Chirag Goyal |
83050051-5417-49fb-a8b9-94b0fab0426b |
Dilbag Singh, Chirag Goyal . Hadoop: An Effective Framework for Big Data Analytics. Recent Innovations in Computer Science and Information Technology. RICSIT2016, 1 (September 2016), 13-16.
In this modern era, analysis of enormous amount of data is becoming a big challenge to the decision makers. Big data is the datasets in size as well as high in variety, velocity and volume. So there is a need of the mean to handle and extract valuable insights from these datasets for better precision. It is very tedious rather impossible in some cases to handle enormous data using traditional databases and techniques their being the need for massive parallel processing and scalability which is not supported by the existing methods. Hadoop supports the scalability as it provides big storage and distribute big data sets over large no of servers operating in parallel. Traditional relational database systems don't scale to process the big data. Scaling of traditional RDBMS to such big data increases cost in many folds which is not affordable. Making efforts to reduce cost, the organizations have had to down-sample data and classify the data on assumptions by deleting raw data that may be useful only for a short term. Hadoop is designed as a scale out architecture and can affordably store company's data for use in future. In the present paper the Big Data Analytics has been carried out using experimental research method. Structured Queries are executed by setting up Hadoop Cluster and RDBMS environment using secondary datasets. The response time of RDBMS with Hadoop framework will be compared.