International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 46 - Number 23 |
Year of Publication: 2012 |
Authors: Tareque Mohmud Chowdhury, M. A. Mottalib |
10.5120/7108-9811 |
Tareque Mohmud Chowdhury, M. A. Mottalib . An Encoding Scheme to Support Efficient Searching and Linguistic Sorting for Bengali Texts. International Journal of Computer Applications. 46, 23 ( May 2012), 37-40. DOI=10.5120/7108-9811
Most of the known encoding schemes for Bengali language have a common drawback. That is characters order in the encoding scheme is different than the linguistic order. As a result, sorting of Bengali texts as per encoded value does not sort them in correct linguistic order. Even if Bengali characters are encoded in linguistic order, because of special properties of Bengali conjunct character, Bengali text can not be sorted directly using only traditional sorting algorithms. In this paper we proposed an encoding scheme for Bengali script which supports sorting of texts by sorting them as per encoded value. Thus the new encoding scheme can save significant amount of processing time for sort operations over large volume of Bengali texts.