CFP last date
20 December 2024
Reseach Article

Improving Indian Language Dependency Parsing by Combining Transition-based and Graph-based Parsers

by B.venkata Seshu Kumari, R. Rajeswara Rao
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 115 - Number 5
Year of Publication: 2015
Authors: B.venkata Seshu Kumari, R. Rajeswara Rao
10.5120/20146-2275

B.venkata Seshu Kumari, R. Rajeswara Rao . Improving Indian Language Dependency Parsing by Combining Transition-based and Graph-based Parsers. International Journal of Computer Applications. 115, 5 ( April 2015), 13-17. DOI=10.5120/20146-2275

@article{ 10.5120/20146-2275,
author = { B.venkata Seshu Kumari, R. Rajeswara Rao },
title = { Improving Indian Language Dependency Parsing by Combining Transition-based and Graph-based Parsers },
journal = { International Journal of Computer Applications },
issue_date = { April 2015 },
volume = { 115 },
number = { 5 },
month = { April },
year = { 2015 },
issn = { 0975-8887 },
pages = { 13-17 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume115/number5/20146-2275/ },
doi = { 10.5120/20146-2275 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:53:55.369393+05:30
%A B.venkata Seshu Kumari
%A R. Rajeswara Rao
%T Improving Indian Language Dependency Parsing by Combining Transition-based and Graph-based Parsers
%J International Journal of Computer Applications
%@ 0975-8887
%V 115
%N 5
%P 13-17
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

We report our dependency parsing experiments on two Indian Languages, Telugu and Hindi. We first explore two most popular dependency parsers namely, Malt parser and MST parser. Considering pros of both these parsers, we develop a hybrid approach combining the output of these two parsers in an intuitive manner. For Hindi, we report our results on test data provided in the for gold standard track of Hindi Shared Task on Parsing at workshop on Machine Translation and parsing in Indian Languages, Coling 2012. Our system secured unlabeled attachment score of 95. 2% and labelled attachment score 90. 7%. For Telugu, we report our results on test data provided in the ICON 2010 Tools Contest on Indian Languages Dependency Parsing. Our system secured unlabeled attachment score of 92. 0% and labelled attachment score 69. 5%.

References
  1. Bharat Ram Ambati, Phani Gadde, and Karan Jindal. 2009. Experiments in Indian Language Dependency Parsing. In Proceedings of the ICON09 NLP Tools Contest: Indian Language Dependency Parsing, pp 32-37.
  2. G. Attardi, S. D. Rossi, and M. Simi. 2010. Dependency Parsing of Indian Languages with DeSR. In ICON-2010 tools contest on Indian language dependency parsing. Kharagpur, India.
  3. Rafiya Begum, Samar Husain, Arun Dhwaj, Dipti Misra Sharma, Lakshmi Bai and Rajeev Sangal. 2008. Dependency annotation scheme for Indian languages. In Proceedings of IJCNLP-2008.
  4. Akshar Bharati, Vineet Chaitanya, and Rajeev Sangal. 1995. Natural Language Processing: A Paninian Perspective, Prentice-Hall of India, New Delhi, pp. 65-106.
  5. Akshar Bharati, Samar Husain, and Rajeev Sangal. 2008. A Two-Stage Constraint Based Dependency Parser for Free Word Order Languages. In Proceedings of the COLIPS International Conference on Asian Language Processing 2008 (IALP), Chiang Mai, Thailand.
  6. Akshar Bharati, Dipti Misra Sharma, Samar Husain, Lakshmi Bai, Rafiya Begum, and Rajeev Sangal. 2009. AnnCorra: TreeBanks for Indian Languages, Guidelines for Annotating Hindi TreeBank (version 2. 0). http://ltrc. iiit. ac. in/MachineTrans/research/tb/DS-guidelines/DS-guidelines-ver2-28-05-09. pdf.
  7. Sabine Buchholz and Erwin Marsi. 2006. CoNLL-X shared task on multilingual dependency parsing. In Tenth Conf. on Computational Natural Language Learning (CoNLL).
  8. A. Ghosh, P. Bhaskar, A. Das, and S. Bandyopadhyay. 2009. Dependency Parser for Bengali: the JU System at ICON 2009. In ICON09 NLP Tools Contest: Indian Language Dependency Parsing. Hyderabad, India.
  9. E. Hajicova. 1998. Prague Dependency Treebank: From Analytic to Tectogrammatical Annotation. In Proc. TSD'98.
  10. Richard Hudson. 1984. Word Grammar, Basil Blackwell, 108 Cowley Rd, Oxford, OX4 1JF, England.
  11. Samar Husain. 2009. Dependency Parsers for Indian Languages. In ICON09 NLP Tools Contest: Indian Language Dependency Parsing. Hyderabad, India.
  12. Samar Husain, Prashanth Mannem, Bharat Ambati and Phani Gadde. 2010. The ICON-2010 Tools Contest on Indian Language Dependency Parsing. In ICON-2010 Tools Contest on Indian Language Dependency Parsing. Kharagpur, India.
  13. S. R. Kesidi, P. Kosaraju, M. Vijay, and S. Husain. 2010. A Two Stage Constraint Based Hybrid Dependency Parser for Telugu. In ICON-2010 tools contest on Indian language dependency parsing. Kharagpur, India.
  14. Prudhvi Kosaraju, Sruthilaya Reddy Kesidi, Vinay Bhargav Reddy Ainavolu and Puneeth Kukkadapu. 2010. Experiments on Indian Language Dependency Parsing. In ICON-2010 tools contest on Indian language dependency parsing. Kharagpur, India.
  15. M. Marcus, B. Santorini, and M. A. Marcinkiewicz. 1993. Building a large annotated corpus of English: The Penn Treebank, Computational Linguistics
  16. Ryan McDonald, Kevin Lerman, and Fernando Pereira. 2006. Multilingual dependency analysis with a two-stage discriminative parser. In Tenth Conference on Computational Natural Language Learning (CoNLL-X), pp. 216–220.
  17. Ryan McDonald and Joakim Nivre. 2007. Characterizing the errors of data-driven dependency parsing models. In Proceedings of the Conference on Empirical Methods in Natural Language Processing and Natural Language Learning.
  18. Joakim Nivre. 2003. An efficient algorithm for projective dependency parsing. In Proceedings of the 8th International Workshop on Parsing Technologies (IWPT), pages 149–160.
  19. Joakim Nivre. 2008. Algorithms for deterministic incremental dependency parsing. Computational Linguistics, 34(4):513–553.
  20. Joakim Nivre. 2009. Parsing Indian Languages with MaltParser. In ICON09 NLP Tools Contest: Indian Language Dependency Parsing. Hyderabad, India.
  21. Joakim Nivre and Jens Nilsson. 2005. Pseudo-projective dependency parsing. In ACL '05: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pages 99–106, Ann Arbor, Michigan.
  22. Joakim Nivre, Johan Hall, Sandra Kubler, Ryan McDonald, Jens Nilsson, Sebastian Riedel, and Deniz Yuret. 2007a. The CoNLL 2007 Shared Task on Dependency Parsing. In Proceedings of the CoNLL Shared Task Session of EMNLP-CoNLL 2007. Prague, Czech Republic, 915–932.
  23. Joakim Nivre, Johan Hall, Jens Nilsson, Atanas Chanev, Gulsen Eryigit, Sandra Kubler, Svetoslav Marinov, and Erwin Marsi. 2007b. Malt parser: A language-independent system for data-driven dependency parsing. Natural Language Engineering 13, 2 (2007), 95–135.
  24. Sebastian Riedel , Ruket Cakici and Ivan Meza-Ruiz. 2006. Multi-lingual Dependency Parsing with Incremental Integer Linear Programming. In Proceedings of the Tenth Conference on Computational Natural Language Learning (CoNLL-X).
  25. Stuart M. Shieber. 1985. Evidence against the context-freeness of natural language. In Linguistics and Philosophy, p. 8, 334–343.
  26. M. V. Yeleti, and K. Deepak. 2009. Constraint based Hindi dependency parsing. In ICON09 NLP Tools Contest: Indian Language Dependency Parsing. Hyderabad, India.
  27. D. Zeman. 2009. Maximum Spanning Malt: Hiring World's Leading Dependency Parsers to Plant Indian Trees. In ICON09 NLP Tools Contest: Indian Language Dependency Parsing. Hyderabad, India.
Index Terms

Computer Science
Information Sciences

Keywords

Dependency Parsing Telugu Hindi Malt Parser MST Parser