National Seminar on Recent Trends in Data Mining |
Foundation of Computer Science USA |
RTDM2016 - Number 3 |
April 2016 |
Authors: Nausheen Dange, V.v. Bag |
bfc67528-abe7-4bb0-9ee9-212443bab193 |
Nausheen Dange, V.v. Bag . Classification of Text using Innovative Algorithm. National Seminar on Recent Trends in Data Mining. RTDM2016, 3 (April 2016), 19-20.
The exponential growth of the internet has led to a great deal of interest in developing useful and efficient tools and software to assist users in searching the Web. Document retrieval, categorization, routing and filtering can all be formulated as classification problems. However, the complexity of natural languages and the extremely high dimensionality of the feature space of documents have made this classification problem very difficult. We have different methods for text classification: the Naive Bayes classifier, the nearest neighbor classifier, SVM (Support Vector Machine), Feature Selection, Feature Extraction Algorithms, decision trees and a subspace method. Each method involved has its own advantage and disadvantage. In order to avoid these ambiguities and redundancies, some of these methods can be combined together to produce highly accurate results. In addition to this, the produced algorithm will help to enhance the performance of the overall text classification system.