International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 172 - Number 8 |
Year of Publication: 2017 |
Authors: Nandni Patel, Santosh Vishwakarma |
10.5120/ijca2017915199 |
Nandni Patel, Santosh Vishwakarma . A Comparative Analysis of Various Classifications in Vector Space Model with Absolute Pruning. International Journal of Computer Applications. 172, 8 ( Aug 2017), 34-38. DOI=10.5120/ijca2017915199
Text Classification is an important problem in text mining used to categorize an undefined label. In this work, various classification models have been evaluated after pre-processing of the text dataset. The pre-processing steps include tokenization, stop word removal and stemming, after which different term weight scheme have also been implemented. Various pruning techniques have also been implemented to get the maximum count of the terms. Based on this analysis, we summarized that Naïve Bayes method gives the highest accuracy while comparing with other state of the art text classifiers.