CFP last date
20 December 2024
Reseach Article

Distance-based Reordering in English to Hindi Statistical Machine Translation

by Sudhakar Kumawat, Nitish Chandra
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 89 - Number 20
Year of Publication: 2014
Authors: Sudhakar Kumawat, Nitish Chandra
10.5120/15750-4693

Sudhakar Kumawat, Nitish Chandra . Distance-based Reordering in English to Hindi Statistical Machine Translation. International Journal of Computer Applications. 89, 20 ( March 2014), 37-40. DOI=10.5120/15750-4693

@article{ 10.5120/15750-4693,
author = { Sudhakar Kumawat, Nitish Chandra },
title = { Distance-based Reordering in English to Hindi Statistical Machine Translation },
journal = { International Journal of Computer Applications },
issue_date = { March 2014 },
volume = { 89 },
number = { 20 },
month = { March },
year = { 2014 },
issn = { 0975-8887 },
pages = { 37-40 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume89/number20/15750-4693/ },
doi = { 10.5120/15750-4693 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:09:47.397066+05:30
%A Sudhakar Kumawat
%A Nitish Chandra
%T Distance-based Reordering in English to Hindi Statistical Machine Translation
%J International Journal of Computer Applications
%@ 0975-8887
%V 89
%N 20
%P 37-40
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This paper compares different reordering models on English to Hindi statistical machine translation system. The two Indo-European languages differ significantly in their word order preferences. While English follows SVO model, Hindi follows SOV model. Therefore both long distance and short distance reordering becomes important. The reordering models available in MOSES SMT are discussed and compared with a more novel approach called distance-based reordering. This new approach significantly improves the quality of English to Hindi translation, both in terms of BLEU score and subjective human evaluation. .

References
  1. Bharati, Akshar, Vineet Chaitanya, and Rajeev Sangal. Natural Language Processing, a Paninian Perspective. Prentice Hall of India, 1995.
  2. Bojar, Ond, Pavel Stra, and Daniel Zeman. English-Hindi translation in 21 days. In Proceedings of the 6th International Conference on Natural Language Processing (ICON-2008) NLP Tools Contest, 2008.
  3. Bushra Jawaid, Daniel Zeman. Word-Order Issues in English-to-Urdu . PBML april 2011. http://ufal. mff. cuni. cz/~jawaid/publications/art-jawaid-zeman. pdf
  4. Nakul Sharma, P Bhatia, V Singh. English to Hindi Statistical Machine Translation System. Thapar University. 2011
  5. Koehn, Philipp. Statistical Machine Translation. Cambridge University Press, Cambridge, UK, 2010.
  6. Michel Galley, Christopher D. Manning. A Simple and Effective Hierarchical Phrase Reordering Model. Proceedings of the 2008 Conference Empirical Methods in Natural Language Processing . Honolulu, October 2008.
  7. Wang Ling, Joao Grac¸a, David Martins de Matos, Isabel Trancoso, Alan Black. Discriminative Phrase-based Lexicalized Reordering Models using weighted Reordering Graphs. Carnegie Mellon University, Pittsburgh, PA, USA.
  8. Jurafsky, Daniel and James H. Martin. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Prentice-Hall, Upper Saddle River, NJ, 2000. ISBN 0-13-095069-6.
  9. Kneser, Reinhard and Hermann Ney. Improved backing-off for m-gram language modeling. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Los Alamitos, California, USA, 1995. IEEE Computer Society Press.
  10. Yizhao Ni, Distance phrase reordering for MOSES. Pattern Analysis and Intelligent Systems Research Group. Department of Engineering Mathematics University of Bristol
  11. Chen, Stanley F. and Joshua Goodman. An empirical study of smoothing techniques for language modeling. In Technical report TR-10-98, Computer Science Group, Harvard, MA, USA, August 1998. Harvard University. URL http://research. microsoft. com/en-us/um/people/joshuago/tr-10-98. pdf.
  12. MOSES , GIZA ++,BLEU tool http://statmt. org/.
Index Terms

Computer Science
Information Sciences

Keywords

Distance-based reordering Statistical machine translation