International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 1 - Number 11 |
Year of Publication: 2010 |
Authors: M. Janaki Meena, K.R. Chandran, J. Mary Brinda |
10.5120/248-405 |
M. Janaki Meena, K.R. Chandran, J. Mary Brinda . Integrating Swarm Intelligence and Statistical Data for Feature Selection in Text Categorization. International Journal of Computer Applications. 1, 11 ( February 2010), 16-21. DOI=10.5120/248-405
Feature selection is the principal step in classification problems with attributes of high dimension. It may also be considered as a problem to determine the subset of terms in training corpus, which maximizes the classifier’s performance. Most of the machine learning algorithms has tainted performance in high dimensional feature space. In this paper, a novel feature selection method based on Ant Colony Optimization, a swarm intelligence algorithm is proposed. Ant Colony Optimization is a metaheuristic algorithm used to increase the ability of finding high quality solutions to NP-hard problems. The heuristic information required for the optimization process is obtained through a chi-square based statistical method, CHIR which results in fast convergence. Performance of the classifier with features selected by proposed method is compared to the feature selected by conventional chi-square and CHIR methods. It is found that the proposed algorithm identifies better feature set than the conventional methods.