CFP last date
20 January 2025
Reseach Article

Sensitivity Analysis of Feature Set Employed for Anaphora Resolution

by Pardeep Singh, Kamlesh Dutta
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 128 - Number 14
Year of Publication: 2015
Authors: Pardeep Singh, Kamlesh Dutta
10.5120/ijca2015906732

Pardeep Singh, Kamlesh Dutta . Sensitivity Analysis of Feature Set Employed for Anaphora Resolution. International Journal of Computer Applications. 128, 14 ( October 2015), 10-14. DOI=10.5120/ijca2015906732

@article{ 10.5120/ijca2015906732,
author = { Pardeep Singh, Kamlesh Dutta },
title = { Sensitivity Analysis of Feature Set Employed for Anaphora Resolution },
journal = { International Journal of Computer Applications },
issue_date = { October 2015 },
volume = { 128 },
number = { 14 },
month = { October },
year = { 2015 },
issn = { 0975-8887 },
pages = { 10-14 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume128/number14/22940-2015906732/ },
doi = { 10.5120/ijca2015906732 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:21:38.378836+05:30
%A Pardeep Singh
%A Kamlesh Dutta
%T Sensitivity Analysis of Feature Set Employed for Anaphora Resolution
%J International Journal of Computer Applications
%@ 0975-8887
%V 128
%N 14
%P 10-14
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Sensitivity analysis is the process of doing a systematic review involving a sequence of parameter, feature set and decisions to calculate the impact of these parameters on the study. It will guide the researchers to evaluate the parameter to consider their relevance in the study. In this paper we consider two features out of seven tags which were employed to resolve the anaphora in Hindi. These tags and their values analyzed empirically for the corpus. We analyzed 165 news items of Ranchi Express from EMILEE corpus of plain text. It consists 1745 sentences. Eight files of dialogue base from the same corpus have been analyzed which will have 1521 sentences. We exploited tag set proposed by different authors and their features.

References
  1. Gambhir, V., “Syntactic restrictions and discourse functions of word order in standard Hindi,” Doctoral Dissertation, Univ. of Pennsylvania, Philadelphia, Penn (1981).
  2. Prasad, R., Strube, M., “Discourse Salience and Pronoun Resolution in Hindi,” In Penn Working Papers in Linguistics, 6.3. UPenn pp. 189-208 (2000).
  3. Botley, S. P., “Indirect anaphora: Testing the limits of corpus-based linguistics,” International Journal of Corpus Linguistics, 11(1), pp 73–112, 2006.
  4. Botley, S. P., McEnery, A., “Demonstratives in English: a corpus-based study,” In Journal of English Linguistics, vol. 29, pp. 7–33, (2001).
  5. Dutta, K., Kaushik, S., Prakash, N., “Machine Learning Approach for the Classification of Demonstrative Pronouns for Indirect Anaphora in Hindi News Items,” The Prague Bulletin of Mathematical Linguistics No. 95, pp 33–50, doi: 10.2478/v10108-011-0003-4, (2011).
  6. Prasaad, R., Miltaski, E., Joshi, A., Webber, B., “Annotation and Data Mining of the Penn Discourse Tree Bank,” In ACL Workshop on Discourse Annotation, (2004).
  7. Hammami, S., Belguith, L. H., Hamadou A. B., “Arabic anaphora resolution: corpora annotation with coreferential links,” In The International Arab Journal of Information Technology - IAJIT , vol. 6, no. 5, pp 480-488, (2009).
  8. Sinha, S., “A Corpus-based Account of Anaphor Resolution in Hindi,” Master’s thesis, University of Lancaster, UK, (2002).
  9. Singh, P., Dutta, K., “Sentence Structure for Free Word Order Language in Context with Anaphora Resolution: A Case Study of Hindi,” International Conference on Computer Design Engineering and Technology (ICCDET 2014), vol:8 no:6 part XIX, pp 2011-2014, June 29-30, 2014 at London, United Kingdom.
  10. Singh P., Dutta, K., “Analysis and Comparison of Antecedent Type of Demonstrative pronoun in Context of Co-reference Resolution: A Corpus Based Study of Hindi for Monologue and Dialogue,” Sixth IEEE International Conference on Computational Intelligence and Communication Networks (CICN 2014), pp 536-540, 14-16 Nov. 2014, DOI 10.1109/.122 537 DOI 10.1109/CICN.2014.122
  11. Singh P., Dutta, K., “Semiautomatic annotation scheme for demonstrative pronoun considering indirect anaphora for Hindi”, IEEE symposium of NLP of International Conference on Advances in Computing, Communications and Informatics (ICACCI, 2014),” pp 1710 - 1714, India, 24-27 Sept. 2014, Print ISBN: 978-1-4799-3078-4, DOI:10.1109/ ICACCI. 2014. 6968538.
  12. Swift, M., Allen, J., and Gildea, D., (2004), “Skeletons in the parser: using a shallow parser to improve deep parsing,” In Proceedings of the 20th international conference on Computational Linguistics (COLING '04). Association for Computational Linguistics, Stroudsburg, PA, USA, , Article 383 . DOI=10.3115/1220355.1220410 http://dx.doi.org/10.3115/1220355.1220410
  13. Esteve, Y., Bazillon, T., Antoine, J. Y., Béchet, F., Farinas, J.,“The EPAC Corpus: Manual and Automatic Annotations of Conversational Speech in French Broadcast News”. In Proceedings of the Seventh conference on International Language Resources and Evaluation, Valletta, Malta, may 2010. ELRA.
  14. Hinrichs, E., Zastrow, T. “Automatic Annotation and Manual Evaluation of the Diachronic German Corpus TüBa-D/DC,” Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12), pp. 22-29. 2012.
  15. Palmer, M., Gildea, D., Kingsbury, P., "The Proposition Bank: An Annotated Corpus of Semantic Role," Computational Linguistics archive, Vol 31, Issue 1, pp. 71-106, March 2005.
  16. Botley, S., Mcenery, T., “Proximal and Distal Demonstratives A Corpus-Based Study,” Journal of English Linguistics, vol 29; pp 214-233, 2001, DOI: 10.1177/00754240122005341
  17. Botley, S., 2000, “Corpora and Discourse anaphora: using corpus evidence to test theoretical claims,” Ph.D. thesis, Lancaster University.
  18. Sinha, S., 2003, “Demonstrative anaphors in Hindi newspaper reportage: a corpus-based study” MA dissertation, Lancaster University.
Index Terms

Computer Science
Information Sciences

Keywords

Coreference resolution sensitivity analysis Anaphora resolution Annotation.