International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 181 - Number 5 |
Year of Publication: 2018 |
Authors: Saidesh Kumar Padmala |
10.5120/ijca2018917565 |
Saidesh Kumar Padmala . Document Clustering based on the Similarity of Data with Efficient Time Consumption. International Journal of Computer Applications. 181, 5 ( Jul 2018), 40-44. DOI=10.5120/ijca2018917565
Text mining has becoming an emerging research area now-a-days which helps in extracting the useful information from large amount of natural language text documents. The necessity of grouping the documents for different applications is gaining comprehensive review of the techniques used to improve the efficient time consumption, challenges, research issues are presented. The techniques presented in the review are k-means clustering, fuzzy c means clustering, support vector machine classifiers, naive Bayes classifier, Hidden Markov Model (HMM). Furthermore, discussion of the advantages and disadvantages of each technique is contributed to a better understanding and compared with the existing techniques based on the efficiency and computational time.