We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 December 2024
Reseach Article

Improving Phrase-based Statistical Myanmar to English Machine Translation with Morphological Analysis

by Thet Thet Zin, Khin Mar Soe, Ni Lar Thein
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 28 - Number 1
Year of Publication: 2011
Authors: Thet Thet Zin, Khin Mar Soe, Ni Lar Thein
10.5120/3354-4627

Thet Thet Zin, Khin Mar Soe, Ni Lar Thein . Improving Phrase-based Statistical Myanmar to English Machine Translation with Morphological Analysis. International Journal of Computer Applications. 28, 1 ( August 2011), 13-19. DOI=10.5120/3354-4627

@article{ 10.5120/3354-4627,
author = { Thet Thet Zin, Khin Mar Soe, Ni Lar Thein },
title = { Improving Phrase-based Statistical Myanmar to English Machine Translation with Morphological Analysis },
journal = { International Journal of Computer Applications },
issue_date = { August 2011 },
volume = { 28 },
number = { 1 },
month = { August },
year = { 2011 },
issn = { 0975-8887 },
pages = { 13-19 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume28/number1/3354-4627/ },
doi = { 10.5120/3354-4627 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:13:37.476913+05:30
%A Thet Thet Zin
%A Khin Mar Soe
%A Ni Lar Thein
%T Improving Phrase-based Statistical Myanmar to English Machine Translation with Morphological Analysis
%J International Journal of Computer Applications
%@ 0975-8887
%V 28
%N 1
%P 13-19
%D 2011
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This paper presents Myanmar phrases translation model with morphology analysis. The system is based on statistical approach. In statistical machine translation, large amount of information is needed to guide the translation process. When small amount of training data is available, morphological analysis is needed especially for morphology rich language. Myanmar language is inflected language and there are very few creations and researches of corpora in Myanmar, comparing to other language such as English, French, and Czech etc. Therefore, Myanmar phrases translation model is based on syntactic structure and morphology of Myanmar language. Bayes rule is also used to reformulate the translation probability of phrase pairs. Experiment results showed that proposed system can improve translation quality by applying morphological analysis on Myanmar language.

References
  1. Brown, P.E., J.Cocke, S. A. Della Pietra, V.J. Della Pietra, F.Jelinek, J.D. Lafferty, R. L. Mercer, and P. S. Roossin, “A statistical approach to machine translation”, Computational Linguistics 16(2), 79-85, 1990.
  2. Adam L. Berger, Vincent J. Della Pietra, and Stephen A. Della Pietra.1996. A maximum entropy approach to natural language processing. Computational Linguistics 22(1), 39-72.
  3. A. P. Dempster, N. M. Laird, and D. B. Rubin. 1977. Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society, 39, 1-38.
  4. Wang, Y.-Y. And A. Waibel, “Modeling with structures in statistical machine translation.”, In proceedings of COLING/ACL 1998, Montreal, Quebec, Canada, pp. 1357-1363.
  5. Och, F. J., C. Tillmann, and H. Ney, “Improved alignment models for statistical machine translation”, In proceedings of the Conference on Empirical Method in Natural Language Processing and Very Large Corpora, University of Maryland, College Park, MD, pp. 20-28.
  6. Franz. J. Och and Hermann Ney. Discriminative training and maximum entropy models for statistical machine translation. 2002. in proceedings of ACL, pp. 295-302.
  7. Koehn, P., “Pharaoh: a beam search decoder for phrase-based statistical machine translation models”, in proceedings of the Sixth Conference of the Association for Machine Translation in the Americas, pp. 115-124, (2004).
  8. Koehn, P., H. Hoang, A. Birch, C. Callison-Burch, M. Federico, N. Bertoldi, B. Cowan, W. Shen, C. Moran, R. Zens, C. Dyer, O. Bojar, A. Constantin, and E. Herbst, “Moses: Open source toolkit for statistical machine translation”, In proceedings of the ACL 2007 Demo and Poster Sessions, pp, 177-180, (2007).
  9. Philipp Koehn, Fran Josef Och, Daniel Marcu, “Statistical Phrase-Based Translation”, Presentation at DARPA IAO Machine Translation Workshop, July 22-23, Santa Monica, CA, (2002).
  10. Goldwater, S. and McClosky, D. 2005. “ Improving statistical MT through morphological analysis”, In HLT'05: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing. 676-683.
  11. Thai Phuong Nguyen and Akira Shimazu, “Improving Phrase-Based SMT with Morpho-Syntactic Analysis and Transformation”, Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, pages 138-147, Cambridge, August 2006.
  12. Ashin Thumana. 2004. New Method English Grammar. Second Edition. Yangon, Myanmar.
  13. Department of the Myanmar Language Commission, Ministry of Education Yangon, 1998. Myanmar-English Dictionary.
  14. Department of the Myanamar Languge Commission, Ministry of Education, Union of Myanmar. 2005. Myanmar Grammar.
Index Terms

Computer Science
Information Sciences

Keywords

Statistical Machine Translation Morphological Analysis Syntactic Structure Bayes Rules Out-of-Vocabulary