CFP last date
20 December 2024
Reseach Article

Using Word Sketches to Resolve Prepositional Phrase Attachment Ambiguity in Arabic

by Imtiaz Hussain Khan
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 177 - Number 16
Year of Publication: 2019
Authors: Imtiaz Hussain Khan
10.5120/ijca2019919618

Imtiaz Hussain Khan . Using Word Sketches to Resolve Prepositional Phrase Attachment Ambiguity in Arabic. International Journal of Computer Applications. 177, 16 ( Nov 2019), 51-56. DOI=10.5120/ijca2019919618

@article{ 10.5120/ijca2019919618,
author = { Imtiaz Hussain Khan },
title = { Using Word Sketches to Resolve Prepositional Phrase Attachment Ambiguity in Arabic },
journal = { International Journal of Computer Applications },
issue_date = { Nov 2019 },
volume = { 177 },
number = { 16 },
month = { Nov },
year = { 2019 },
issn = { 0975-8887 },
pages = { 51-56 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume177/number16/30987-2019919618/ },
doi = { 10.5120/ijca2019919618 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:46:06.452891+05:30
%A Imtiaz Hussain Khan
%T Using Word Sketches to Resolve Prepositional Phrase Attachment Ambiguity in Arabic
%J International Journal of Computer Applications
%@ 0975-8887
%V 177
%N 16
%P 51-56
%D 2019
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Resolving prepositional-phrase (PP) attachment ambiguity is a challenging task in natural language processing. Unlike English language, researchers has paid little attention to address this problem in Arabic language. In this study, we use word collocation data derived from a large Arabic corpus to predict the most likely interpretation of potentially ambiguous PP-attachment phrases. We administered an empirical study in which human participants were presented with Arabic text involving potential PP-attachment ambiguity and their task was to judge whether the PP is attached to the preceding noun (low attachment) or verb (high attachment), or it is unclear. This exercise was used to collect a small-size labelled corpus of 50 examples (= 5 prepositions x 10 phrases). Subsequently, this labeled corpus was analysed to derive rules based on words collocational frequencies obtained from sketch engine operated on arTenTen12 corpus. Finally, the derived rules were validated using human judgment on unseen examples which were not used during the rules derivation step. We achieve 83% precision and 88% recall, which suggests that words collocation data generated by sketch engine can be used to resolve PP-attachment ambiguities.

References
  1. N. Habash, "Arabic tutorial.," in The fifth international conference on Language Resources and Evaluation, LREC’06, 2006., 2006.
  2. A. Farghaly and K. Shaalan, "Arabic natural language processing: Challenges and solutions.," ACM transactions on Asian language information processing (TALIP)., 2009.
  3. N. Habash, Introduction to Arabic natural language processing., Morgan & Claypool Publishers., 2010.
  4. K. Shaalan, A. A. Monem, A. Rafea and H. Baraka, "Generating Arabic text from interlingua.," in Proceedings of the 2nd workshop on computational approaches to Arabic script-based languages, Stanford, USA, 2008.
  5. A. Rozovskaya, R. Sproat and E. Benmamoun, "Challenges in processing colloquial Arabic: The challenge of Arabic for NLP/MT.," in Proceedings of international conference at the British computer society., London, 2006.
  6. K. Shaalan, A. Rafea, H. Baraka and A. A. Monem, "Generating Arabic text from interlingua.," in Proceedings of the 2nd workshop on computational approaches to Arabic script-based languages, Stanford, USA, 2008.
  7. K. Darwish, "Building a shallow Arabic morphological analyzer in one day.," in Proceedings of the computational approaches to semitic languages, a workshop affiliated with ACL-2002., 2002.
  8. A. Willis, F. Chantree and A. De Roeck, "Automatic identification of nocuous ambiguity.," Research on language and computation., vol. 6, no. 3, pp. 355-374, 2008.
  9. I. H. Khan, K. Van Deemter and G. Ritchie, "Managing ambiguity in reference generation: The role of surface structure," Topics in cognitive science., 2012.
  10. A. Kilgarriff, P. Rychly, P. Smrz and D. Tugwell, "The sketch engine.," in Proceedings of EURALEX., 2004.
  11. T. Arts, Y. Belinkov, N. Habash, A. Kilgarriff and V. Suchomele, "arTenTen: Arabic corpus and word sketches.," Journal of King Saud University - computer and information sciences., vol. 26, no. 4, pp. 357-371, 2014.
  12. D. Hindle and M. Rooth, "Structural ambiguity and lexical relations.," Computational Linguistics, pp. 103-120, 1993.
  13. P. Nakov and M. Hearst, "Using the web as an implicit training set: application to structural ambiguity resolution.," in Proceedings of the conference on human language technology and empirical methods in natural language processing., 2005.
  14. A. Ratnaparkhi, J. Reynar and S. Roukos, "A maximum entropy model for prepositional phrase attachment.," in Proceedings of the ARPA human language technology workshop., 1994.
  15. M. Collins and J. Brooks, "Prepositional phrase attachment through a backed-off model.," in Proceedings of the third workshop on very large corpora., 1995.
  16. S. Zhao and D. Lin, "A nearest-neighbor method for resolving PP-attachment ambiguity.," in Proceedings of the first international joint conference on natural language processing (IJCNLP-04)., 2004.
  17. M. Olteanu and D. Moldovan, "PP-attachment disambiguation using large context.," in Proceedings of human language technology conference and conference on empirical methods in natural language processing (HLT/EMNLP)., 2005.
  18. R. Al-sabbagh and K. Elghamry, "A Web-based approach for Arabic PP-attachment.," in Proceedings of the 6th international conference on informatics and systems., Cairo, Egypt, 2008.
  19. E. Othman, K. Shaalan and A. Rafea, "Towards resolving ambiguity in understanding Arabic sentence.," in Proceedings of international conference on Arabic language resources and tools, 2004.
  20. E. Othman, K. Shaalan and A. Rafea, "A chart parser for analyzing modern standard Arabic sentence.," in MT summit IX workshop on machine translation for semitic languages: issues and approaches, New Orleans, Louisiana, USA, 2003.
  21. K. Daimi, "Identifying syntactic ambiguities in single-parse Arabic sentence.," Department of mathematics and computer science, University of Detroit Mercy, 2001.
  22. M. Hayadre, D. Kurzon, O. Peleg and E. Zohar, "Ambiguity resolution in lateralized Arabic.," Journal of reading and writing., vol. 28, no. 3, pp. 395-418, 2015.
  23. N. Ghezaie and K. Haddar, "Toward the resolution of Arabic lexical ambiguities with transduction on text automaton.," in Proceedings of first international conference on Arabic computational linguistics., 2015.
  24. M. A. Attia, "An ambiguity-controlled morphological analyzer for modern standard Arabic modelling finite state networks.," in Proceedings of challenges of Arabic for NLP/MT conference., 2008.
  25. A. T. Al-Taani, N. A. K. Al-Awad and H. Abu-Salem, "An adaptive parser for Arabic language processing.," International journal of computer processing of languages., vol. 23, no. 1, pp. 67-80, 2011.
  26. K. Shalaan, "Rule-based approach in Arabic natural language processing.," International journal on information and communication technologies., 2010.
  27. M. H. Hamdan and I. H. Khan, "An analysis of prepositional-phrase attachment disambiguation.," International Journal of Computational Linguistics Research., vol. 9, no. 2, pp. 60-80, 2018.
Index Terms

Computer Science
Information Sciences

Keywords

Arabic word sketches pp-attachment ambiguity ambiguity resolution arTenTen12 corpus sketch engine