International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 167 - Number 6 |
Year of Publication: 2017 |
Authors: Ekta Jadon, Roopesh Sharma |
10.5120/ijca2017913925 |
Ekta Jadon, Roopesh Sharma . Data Mining: Document Classification using Naive Bayes Classifier. International Journal of Computer Applications. 167, 6 ( Jun 2017), 13-16. DOI=10.5120/ijca2017913925
In data mining, classification is the way to splits the data into several dependent and independent regions and each region refer as a class. There are different kinds of classifier uses to accomplish classification task. Moreover classification is bounded in case of classifying of text documents. The motives of the work which a present in the article is to evaluate multiclass document classification and to learn achieve accuracy of classification in the case of text documents. Naive Bayes approach is used to deal with the problem of document classification via a deceptively simplistic model. The Naive Bayes approach is applied in Flat (linear) and hierarchical manner for improving the efficiency of classification model. It has been found that Hierarchical Classification technique is more effective than Flat classification. It also performs better in case of multi-label document classification. In contrast to retrospect we observe significant increase in the generation of data each day. And hence with the advent of smarter technologies, data is required to be classified and sorted before framing out decisions from it. There are so many techniques available for classifying documents into various categories or labels. Data mining is the process of non-trivial extraction of novel, implicit, and actionable knowledge from large data sets.