International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 105 - Number 12 |
Year of Publication: 2014 |
Authors: N.kamalraj, A.malathi |
10.5120/18433-9798 |
N.kamalraj, A.malathi . Hadoop Operations Management for Big Data Clusters in Telecommunication Industry. International Journal of Computer Applications. 105, 12 ( November 2014), 40-44. DOI=10.5120/18433-9798
This paper describes how data mining techniques are used in Hadoop for cloud data where it is an open source implementation. Extraction of useful information from raw data is always referred by the term DM. The techniques of DM are integrated into the normal day-to-day life has become very popular. Data mining are useful to improve the efficiency for the reduction of cost in the businesses field. In the cloud computing paradigm, the applications and techniques of data mining are most wanted. The users can retrieve meaningful information from virtually integrated data warehouse and it is allowed by implementing the data mining in cloud computing for reducing the cost of storage and infrastructure. This paper aims at predicts the churn customer in telecommunication industry where the dataset is stored in cloud and implemented using data mining techniques in Hadoop. In this paper, classification is used to analyze the dataset of telecommunication industry and classify the numerical and text data and predict the churners who are likely to switch from current network, and the clustering is used to group the result of the classification from the given data set for the best prediction of numerical and text data together in Hadoop. Hadoop is an environment easy to implement the classification; clustering techniques and cloud are used to store the data set for the industry.