International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 175 - Number 19 |
Year of Publication: 2020 |
Authors: Arefin Niam, Avijit Das, Mahruba Sharmin Chowdhury, Mohammad Abdullah Al Mumin |
10.5120/ijca2020920716 |
Arefin Niam, Avijit Das, Mahruba Sharmin Chowdhury, Mohammad Abdullah Al Mumin . A Literature Review of Bangla Document Clustering. International Journal of Computer Applications. 175, 19 ( Sep 2020), 28-35. DOI=10.5120/ijca2020920716
Document clustering is a machine learning approach to categorize documents into related groups without any definition to the documents prior to the process. It helps to categorize very large chunks of documents into similar categories for making the process of finding a particular document easier. It also helps in retrieval of the data. There has been numerous works in document clustering in other languages but the amount of work in Bangla is still not sufficient. In this paper it has been aimed to evaluate the techniques that have been adopted in clustering Bangla documents. These techniques and their effectiveness has also been compared in contrast to the contemporary methods adopted by researchers around the world on other languages and a vision is proposed on current state of development in Bangla Document Clustering.