International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 39 - Number 9 |
Year of Publication: 2012 |
Authors: Deepamala. N, Ramakanth Kumar. P |
10.5120/4852-7124 |
Deepamala. N, Ramakanth Kumar. P . Sentence Boundary Detection in Kannada Language. International Journal of Computer Applications. 39, 9 ( February 2012), 38-41. DOI=10.5120/4852-7124
Sentence Boundary Detection is a pre-processing step for any Natural Language Processing application. Various algorithms have been used to achieve Sentence Boundary Detection or Disambiguation in different languages. In this paper, a rule based method is proposed and tested to achieve Sentence Boundary Detection for Kannada Language. Kannada being a grammatically rich Indian language is analyzed based on semantics and tested with a 227K bytes corpus. The code is written in C using wide characters, with support for Unicode. Results showed 99.2% success in detecting sentence boundary.