Innovations and Trends in Computer and Communication Engineering |
Foundation of Computer Science USA |
ITCCE - Number 4 |
December 2014 |
Authors: Harshali D.gangurde |
78f8ca75-16e6-4ed9-819b-931766397906 |
Harshali D.gangurde . Feature Selection using Clustering Approach for Big Data. Innovations and Trends in Computer and Communication Engineering. ITCCE, 4 (December 2014), 1-3.
Feature selection has been a productive field of research and development in data mining, machine learning and statistical pattern recognition, and is widely applied to many fields such as, image retrieval, genomic analysis and text categorization. Feature selection includes selecting the most useful features from the given data set. The feature selection involves removing irrelevant and redundant features form the data set. The feature selection can be efficient and effective using clustering approach. Based on the criteria of efficiency in terms of time complexity and effectiveness in terms of quality of data, useful features from the big data can be selected. Feature selection reduces the computational complexity of learning and prediction algorithms and saves on the cost of measuring non selected features. The feature selection can be done using the graph clustering approach based on theoretic graph. The most relevant features are selected from the cluster for the relevant target class. The features in every cluster are different and independent of the other.