International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 89 - Number 11 |
Year of Publication: 2014 |
Authors: Nur Hossain Khan, Gonesh Chandra Saha, Bappa Sarker, Md. Habibur Rahman |
10.5120/15672-4416 |
Nur Hossain Khan, Gonesh Chandra Saha, Bappa Sarker, Md. Habibur Rahman . Checking the Correctness of Bangla Words using N-Gram. International Journal of Computer Applications. 89, 11 ( March 2014), 1-3. DOI=10.5120/15672-4416
N-gram model is used in many domains like spelling and syntactic verification, speech recognition, machine translation, character recognition and like others. This paper describes a system for checking the correctness of a bangle word using N-gram model. An experimental corpus containing one million word tokens was used to train the system. The corpus was a part of the BdNC01 corpus, created in the SIPL lab. of Islamic university. Collecting several sample text from different newspapers, the system was tested by 50,000 correct and another 50,000 incorrect words. The system has successfully detected the correctness of the test words at a rate of 96. 17%. This paper also describes the limitations of the system with possible solutions.