International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 180 - Number 14 |
Year of Publication: 2018 |
Authors: Ashlesha S. Nagdive, R. M. Tugnayat |
10.5120/ijca2018916273 |
Ashlesha S. Nagdive, R. M. Tugnayat . A Review of Hadoop Ecosystem for BigData. International Journal of Computer Applications. 180, 14 ( Jan 2018), 35-40. DOI=10.5120/ijca2018916273
This paper, describes Concept of Big Data which is collection of large data set that cannot be proceed by traditional computational techniques. Therefore Hadoop technology designed to process Big Data. Hadoop is the platform in businesses for Big Data processing. Hadoop is an open source, Java-based programming framework which supports the processing and storage of extremely large data sets in a distributed computing environment. It helps Big Data analytics by overcoming the difficulties that are usually faced in handling Big Data. Hadoop can break down large computational problems into smaller tasks as smaller elements can be analyzed economically and quickly[1]. Hadoop is an open-source software framework for storing data and running applications on clusters of commodity hardware. It provides massive storage for various kind of data, enormous processing power and the ability to handle virtually limitless concurrent tasks. All these parts are analyzed in parallel and the results of the analysis are regrouped to produce the final output.