Recent Trends in Image Processing and Pattern Recognition |
Foundation of Computer Science USA |
RTIPPR - Number 2 |
None 2010 |
Authors: B S Harish, D S Guru, S Manjunath |
3ad3dfcd-2955-4ec5-b794-cb050b18de92 |
B S Harish, D S Guru, S Manjunath . Representation and Classification of Text Documents: A Brief Review. Recent Trends in Image Processing and Pattern Recognition. RTIPPR, 2 (None 2010), 110-119.
Text classification is one of the important research issues in the field of text mining, where the documents are classified with supervised knowledge. In literature we can find many text representation schemes and classifiers/learning algorithms used to classify text documents to the predefined categories. In this paper, we present various text representation schemes and compare different classifiers used to classify text documents to the predefined classes. The existing methods are compared and contrasted based on qualitative parameters viz., criteria used for classification, algorithms adopted and classification time complexities.