We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 November 2024
Reseach Article

Issues in Hindi to English and Marathi to English Machine Transliteration of Named Entities

by M. L. Dhore, S. K. Dixit, Ruchi M. Dhore
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 51 - Number 14
Year of Publication: 2012
Authors: M. L. Dhore, S. K. Dixit, Ruchi M. Dhore
10.5120/8112-1728

M. L. Dhore, S. K. Dixit, Ruchi M. Dhore . Issues in Hindi to English and Marathi to English Machine Transliteration of Named Entities. International Journal of Computer Applications. 51, 14 ( August 2012), 37-44. DOI=10.5120/8112-1728

@article{ 10.5120/8112-1728,
author = { M. L. Dhore, S. K. Dixit, Ruchi M. Dhore },
title = { Issues in Hindi to English and Marathi to English Machine Transliteration of Named Entities },
journal = { International Journal of Computer Applications },
issue_date = { August 2012 },
volume = { 51 },
number = { 14 },
month = { August },
year = { 2012 },
issn = { 0975-8887 },
pages = { 37-44 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume51/number14/8112-1728/ },
doi = { 10.5120/8112-1728 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:50:24.808096+05:30
%A M. L. Dhore
%A S. K. Dixit
%A Ruchi M. Dhore
%T Issues in Hindi to English and Marathi to English Machine Transliteration of Named Entities
%J International Journal of Computer Applications
%@ 0975-8887
%V 51
%N 14
%P 37-44
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Almost all transactions ranging from various domains such as travel, shopping, insurance, entertainment, hotels, appointments etc. are available through Internet based applications. Needless to say, all these applications require the knowledge of English. As Internet users are growing day by day, it is logical to say that, there is a great demand to develop tools and applications to support Indian languages for them. The solution to provide local language support in the web based commercial applications is Machine Translation which can be used to translate static labels on web form and Machine Transliteration to transliterate dynamic user inputs from local language into the default language English. It is challenging to transliterate names and technical terms occurring in the user input across languages with different alphabets and sound inventories. This paper focuses important issues which frequently occur in Hindi to English and Marathi to English named entities machine transliteration.

References
  1. Manoj Jain, Vijay Kumar and Manish Kumar Singh, Encoding of Indic script : ISCII & Unicode, Department of Information Technology, pp. 95-106, New Delhi
  2. Frost & Sullivan, Local language information technology market in India, TDIL, Department of IT, Ministry of Communications and Information Technology, India, pp. 1-128, 2003
  3. Knight Kevin and Graehl Jonathan, Machine transliteration. In proceedings of the 35th annual meetings of the Association for Computational Linguistics, pp. 128-135, 1998
  4. Al-Onaizan Y, Knight K, Machine translation of names in Arabic text. Proceedings of the ACL conference workshop on computational approaches to Semitic languages. 2002
  5. Nasreen Abdul Jaleel and Leah S. Larkey, Statistical transliteration for English-Arabic cross language information retrieval. In Proceedings of the 12th international conference on information and knowledge management. pp: 139 – 146, 2003
  6. Goto I, Kato N, Uratani N, and Ehara T, Transliteration considering context information based on the maximum entropy method. In Proceedings of MT-Summit IX, pp. 125-132, 2003
  7. Li H, Zhang M, and Su J, A joint source-channel model for machine transliteration. In Proceedings of ACL , pp. 160-167, 2004Niraj Aswani, Robert Gaizauskas, Aligning words in English-Hindi parallel corpora, Proceedings of the ACL Workshop on Building and Using Parallel Texts, pages 115–118, Ann Arbor, June 2005. Association for Computational Linguistics, 2005
  8. Katre D. S. A Position Paper on Cross Cultural Usability Issues of Bilingual (Hindi & English) Mobile Phones in the Proceedings of Indo Danish HCI Research Symposium. 2006
  9. Janet Pierrehumbert and Rami Nair, Implications of Hindi Prosodic Structure, Northwestern Univesity
  10. Karthik Gali, Harshit Surana, Ashwini Vaidya, Praneeth Shishtla and Dipti Misra Sharma. Aggregating Machine Learning and Rule Based Heuristics for Named Entity Recognition. In Proceeding of workshop on Workshop on NER for South and South East Asian Languages, IJCNLP-2008
  11. Abbas Malik, Laurent Besacier, Christian Boitet and Pushpak Bhattacharyya, A Hybrid Model for Urdu Hindi Transliteration, Machine transliteration shared task, named entities workshop: shared task on transliteration, Singapore, pp. 177-185, 2009
  12. Ankit Aggarwal, Transliteration involving English and Hindi languages using syllabification approach, Thesis, Indian Institute of Technology, Bombay, Mumbai, 2009
  13. Manoj K. Chinnakotla, Om P. Damani, and Avijit Satoskar, Transliteration for Resource-Scarce Languages, ACM Trans. Asian Lang. Inform. Process. 9, 4, Article 14, pp 1-30, December 2010
  14. Karimi S, Scholer F, and Turpin, Machine transliteration survey. ACM Computing Surveys, Vol. 43, No. 3, Article 17, pp. 1-46, April 2011
  15. Umair Z Ahmed Kalika Bali Monojit Choudhury and Sowmya, Challenges in Designing Input Method Editors for Indian Languages: The Role of Word-Origin and Context Proceedings of the Workshop on Advances in Text Input Methods, pp. 1–9, Thailand, November 13, 2011.
  16. Design Guidelines (Sanskrit, Hindi, Marathi, Konkani, Sindhi, Nepali), Department of Information Technology, pp. 95-106, New Delhi, pp. 1-38
  17. Omkar N. Koul, Modern Hindi Grammar, Dunwoody Press, 2008
  18. S. P. Mudur, N. Nayak, S. Shanbhag, and R. K. Joshi, An architecture for the shaping of indic texts, Computers and Graphics, vol. 23, pp. 7–24, 1999
  19. Walambe M R, Marathi Shuddalekhan, Nitin Prakashan, Pune, 1990
  20. Walambe M R, Marathi Vyakran, Nitin Prakashan, Pune, 1990
  21. Janet Pierrehumbert and Rami Nair, Implications of Hindi Prosodic Structure, Northwestern Univesity.
  22. Naim R Tyson and Ila Nagar, Prosodic rules for schwa-deletion in Hindi Text-to-Speech synthesis, International Journal of Speech Technology, pp. 15–25, 2009
  23. Haizhou Li, A Kumaran Vladimir Pervouchine and Min Zhang, Report of NEWS 2009 Machine transliteration shared task, named entities workshop: shared task on transliteration, Singapore, pp. 1-18, 2009
Index Terms

Computer Science
Information Sciences

Keywords

Allophones Language of Origin Multiple Transliterations Orthography Phonology