International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 34 - Number 10 |
Year of Publication: 2011 |
Authors: S. Parameswarappa, V.N.Narayana |
10.5120/4133-5954 |
S. Parameswarappa, V.N.Narayana . Kannada Word Sense Disambiguation for Machine Translation. International Journal of Computer Applications. 34, 10 ( November 2011), 1-8. DOI=10.5120/4133-5954
Polysemous Words can have more than one distinct meaning. Word sense disambiguation (WSD) is the ability to identify the exact meaning of such polysemous words in context in a computational manner. WSD is considered as an AI-complete problem, that is, a task whose solution is at least as hard as the most difficult problem in Artificial Intelligence. In this paper, we propose an Integrated Kannada Word Sense Disambiguation system which includes a suite of high performance Natural Language Processing (NLP) modules implemented in Perl (Program Extraction and Reporting Language) to carry out word sense disambiguation task. The corpus builder module will construct the raw Kannada corpora using web. The proposed system uses randomly selected sentences from the corpora as a test bed for disambiguation. The electronic machine readable dictionary is built by Dictionary builder module using the corpora. The Target Word Sense Disambiguation module will disambiguate the potential ambiguous target words in a sentence. The polysemous verb in a sentence is disambiguated by Verb Sense Disambiguation module. The rule based disambiguator will disambiguate all ambiguous words with different lexical category. Experiments conducted and the results obtained have been described. The efficiency of the system proved to be reliable and extendable.