International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 125 - Number 14 |
Year of Publication: 2015 |
Authors: Garima Jain, Shailendra Kumar Shrivastava |
10.5120/ijca2015906260 |
Garima Jain, Shailendra Kumar Shrivastava . Evaluation of Clustering around Weighted Prototype and Genetic Algorithm for Document Categorization. International Journal of Computer Applications. 125, 14 ( September 2015), 21-27. DOI=10.5120/ijca2015906260
Document clustering is very important in the field of text categorization. Genetic algorithm, which is an optimization based technique which can be applied for finding out the best cluster centres easily by computing fitness values of data points. While clustering around weighted prototype technique is especially helpful when proper pairwise similarities are available. This technique does not find global solution of the objective function. Experimental result shows that F-measure and Normalized mutual information of genetic algorithm is better than clustering around weighted prototype for 20 Newsgroup dataset. F-measure and accuracy of genetic algorithm is better than clustering around weighted prototype for the Reuter-21578 dataset.