International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 25 - Number 8 |
Year of Publication: 2011 |
Authors: Poornima C, Dhanalakshmi V, Anand Kumar M, Soman K P |
10.5120/3050-4147 |
Poornima C, Dhanalakshmi V, Anand Kumar M, Soman K P . Rule based Sentence Simplification for English to Tamil Machine Translation System. International Journal of Computer Applications. 25, 8 ( July 2011), 38-42. DOI=10.5120/3050-4147
Machine translation is the process by which computer software is used to translate a text from one natural language to another but handling complex sentences by any machine translation system is generally considered to be difficult. In order to boost the translation quality of the machine translation system, simplifying an input sentence becomes mandatory. Many approaches are available for simplifying the complex sentences. In this paper, Rule based technique is proposed to simplify the complex sentences based on connectives like relative pronouns, coordinating and subordinating conjunction. Sentence simplification is expressed as the list of sub-sentences that are portions of the original sentence. The meaning of the simplified sentence remains unaltered. Characters such as (‘.’,’?’) are used as delimiters. One of the important pre-requisite is the presence of delimiter in the given sentence. Initial splitting is based on delimiters and then the simplification is based on connectives. This method is useful as a preprocessing tool for machine translation.