International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 177 - Number 48 |
Year of Publication: 2020 |
Authors: Kiran Bolaj |
10.5120/ijca2020919950 |
Kiran Bolaj . Text Categorization System for English Text Documents using Naïve Bayes Classifier. International Journal of Computer Applications. 177, 48 ( Mar 2020), 7-10. DOI=10.5120/ijca2020919950
Information technology generated huge data on the internet. Most of this data is mainly in English language. Automatic text categorization is useful in better management and retrieval of these text documents and also makes document retrieval as simple task. Various learning techniques exist for the classification of text documents like Naïve Bayes, Support Vector Machine and Decision Trees, etc. The proposed system uses a Naïve Bayesian method. Bayesian algorithms are often used to classify data in different categories in a way that the systems can be trained and learn from human corrections.