International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 11 - Number 12 |
Year of Publication: 2010 |
Authors: Dinesh Kumar, Prince Rana |
10.5120/1634-2196 |
Dinesh Kumar, Prince Rana . Article:Design and Development of a Stemmer for Punjabi. International Journal of Computer Applications. 11, 12 ( December 2010), 18-23. DOI=10.5120/1634-2196
Stemming is the process of removing the affixes from inflected words, without doing complete morphological analysis. A stemming Algorithm is a procedure to reduce all words with the same stem to a common form [20]. It is useful in many areas of computational linguistics and information-retrieval work. This technique is used by the various search engines to find the best solution for a problem. The algorithm is a basic building block for the stemmer. Stemmer is basically used in information retrieval system to improve the performance .The paper present a stemmer for Punjabi, which uses a brute force algorithm. We also use a suffix stripping technique in our paper. Similar techniques can be used to make stemmer for other languages such as Hindi, Bengali and Marathi. The result of stemmer is good and it can be effective in information retrieval system. This stemmer also reduces the problem of over-stemming and under-stemming.