International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 102 - Number 8 |
Year of Publication: 2014 |
Authors: Abul Kalam Md. Rajib Hasan |
10.5120/17838-8724 |
Abul Kalam Md. Rajib Hasan . Review of Stochastic POS Tagging Techniques used in Bengali. International Journal of Computer Applications. 102, 8 ( September 2014), 35-39. DOI=10.5120/17838-8724
In this paper, we describe different stochastic methods or techniques used for POS tagging of Bengali language. We have shown a generalized stochastic model for POS tagging in Bengali. We reviewed kinds of corpus and number of tags used for tagging methods. In the study it is found that as many as 45 useful tags existed in the literature. There are four useful corpus found in the study. As Bengali is a morphologically rich language we outlined a feature list that could be used with different training algorithms. We found that a hybrid HMM model used with a morphological analyzer work best in Bengali with an accuracy of 96. 3%.