International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 49 - Number 6 |
Year of Publication: 2012 |
Authors: Shweta Dubey, Tarun Dhar Diwan |
10.5120/7631-0710 |
Shweta Dubey, Tarun Dhar Diwan . Supporting Large English-Hindi Parallel Corpus using Word Alignment. International Journal of Computer Applications. 49, 6 ( July 2012), 16-19. DOI=10.5120/7631-0710
This paper gives description about methodology to understand parallel English-Hindi sentences using word alignment. This methodology is foundation to develop the parallel English-Hindi word dictionary after syntactically and semantically analysis of the English-Hindi source text. Methodology of proposed system is used for the English and Hindi sentences; also the methodology can be used for other languages. Outsized parallel corpus of English-Hindi pair language is not frequently available. Development is based on two strategies to solve this problem. First is normalization of tagged English sentences and Hindi sentences. Second is mapping English-Hindi sentence using parallel English-Hindi word dictionary. Fortunately word alignment is clearly known and few aligning algorithms are without restraint accessible.