CFP last date
20 December 2024
Reseach Article

A Faster Approach to Sort Unicode Represented Bengali Words

by Aamira Shabnam, Tapashee Tabassum Urmi, Md. Saiful Islam
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 126 - Number 11
Year of Publication: 2015
Authors: Aamira Shabnam, Tapashee Tabassum Urmi, Md. Saiful Islam
10.5120/ijca2015906224

Aamira Shabnam, Tapashee Tabassum Urmi, Md. Saiful Islam . A Faster Approach to Sort Unicode Represented Bengali Words. International Journal of Computer Applications. 126, 11 ( September 2015), 29-32. DOI=10.5120/ijca2015906224

@article{ 10.5120/ijca2015906224,
author = { Aamira Shabnam, Tapashee Tabassum Urmi, Md. Saiful Islam },
title = { A Faster Approach to Sort Unicode Represented Bengali Words },
journal = { International Journal of Computer Applications },
issue_date = { September 2015 },
volume = { 126 },
number = { 11 },
month = { September },
year = { 2015 },
issn = { 0975-8887 },
pages = { 29-32 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume126/number11/22598-2015906224/ },
doi = { 10.5120/ijca2015906224 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:17:12.865790+05:30
%A Aamira Shabnam
%A Tapashee Tabassum Urmi
%A Md. Saiful Islam
%T A Faster Approach to Sort Unicode Represented Bengali Words
%J International Journal of Computer Applications
%@ 0975-8887
%V 126
%N 11
%P 29-32
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Sorting Bengali words, a constituent part of Bengali language processing, Bengali data manipulation and Bengali database system comes up with a lot of challenges. A simple lexicographic ordering based on the Unicode representation does not yield the correct order of Bengali words as the character order in Unicode for Bengali differs from the order suggested by Bangla Academy. Besides, the presence of modifiers, compound characters, dual representation of some characters in Unicode as well as the precedence of vowels have made the situation even more complex. Our study aims to adapt the linguistic order for Unicode represented Bengali text while achieving maximum possible time and space efficiency. In this paper, we propose an approach to sort Bengali texts using popular algorithms with a slight modification in mapping so that it follows the linguistic order of the language and takes no extra memory. Also it shows a run time comparison with the previous works done on this topic.

References
  1. Bengali language, Wkipedia: https://en.wikipedia.org/wiki/Bengali_language Retrieved Aug 01, 2015
  2. Bengali alphabet বাংলা Bangla (Bengali): http://www.omniglot.com/writing/bengali.htm Retrieved August 01,2015
  3. বাংলা একাডেমি: http://www.banglaacademy.org.bd/ Retrieved August 01,2015
  4. Shabnam, Aamira, and Debakar Shamanta Piklu. "An Easily Comprehendible Unicode based Sorting Algorithm for Bangla Words." International Journal of Computer Applications 79.5 (2013): 27-31.
  5. Amin, Md Ruhul, et al. "An Efficient Unicode based Sorting Algorithm for Bengali Words." International Journal of Computer Applications 24.7 (2011).
  6. Khan, Mafizul Haque, et al. "An Efficient and Correct Bangla Sorting Algorithm." 7th ICCIT (2004): 125. K. Elissa, “Title of paper if known,” unpublished.
  7. Islam, Shah Md Emrul, and Muhammad Masroor Ali. "An Approach to Sort Unicode Bengali Text Using Ancillary Maps." Asian Journal of Information Technology 4.10 (2005): 890-894.
  8. The Unicode Standard 4. 0, copyright 1991-2003, Unicode, Inc.
  9. Bangla Academy Bengali-English Dictionary. Bangla Academy, 1994.
  10. Mohammad, Kazi Din. "Adhunik Bangla Byakoron O Rochona." (1999).
  11. Thomas Cormen, Charles Leiserson, and Ronald Rivest: “Introduction to Algorithm”, Prentice – Hall of India Private Limited, 1999.
  12. Dietel, Paul. Java how to program. PHI, 2009
Index Terms

Computer Science
Information Sciences

Keywords

Bengali Word Sort Unicode Bengali sort Bengali Linguistic Sort Bengali Dictionary Sort Bangla Academy Sort