CFP last date
20 December 2024
Reseach Article

Evaluation of Punjabi Named Entity Recognition using Context Word Feature

by Amandeep Kaur, Gurpreet Singh Josan
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 96 - Number 20
Year of Publication: 2014
Authors: Amandeep Kaur, Gurpreet Singh Josan
10.5120/16913-7011

Amandeep Kaur, Gurpreet Singh Josan . Evaluation of Punjabi Named Entity Recognition using Context Word Feature. International Journal of Computer Applications. 96, 20 ( June 2014), 32-38. DOI=10.5120/16913-7011

@article{ 10.5120/16913-7011,
author = { Amandeep Kaur, Gurpreet Singh Josan },
title = { Evaluation of Punjabi Named Entity Recognition using Context Word Feature },
journal = { International Journal of Computer Applications },
issue_date = { June 2014 },
volume = { 96 },
number = { 20 },
month = { June },
year = { 2014 },
issn = { 0975-8887 },
pages = { 32-38 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume96/number20/16913-7011/ },
doi = { 10.5120/16913-7011 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:22:19.014028+05:30
%A Amandeep Kaur
%A Gurpreet Singh Josan
%T Evaluation of Punjabi Named Entity Recognition using Context Word Feature
%J International Journal of Computer Applications
%@ 0975-8887
%V 96
%N 20
%P 32-38
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Named Entity Recognition is the task of identifying and classifying Named Entities in the given text. In this paper evaluation of Named Entity Recognition in Punjabi language has been performed using context word feature. Words preceding and succeeding the target word are very helpful in determining its category. In this work context word feature of word window 7, 5 and 3 have been used. Experiments have been performed using different training and test sets. In this evaluation a Named Entity Tagset of 14 tags namely PERSON, ORGANIZATION, LOCATION, FACILITY, EVENT, RELATIONSHIP, TIME, DATE, DESIGNATION, TITLE-PERSON, NUMBER, MEASURE, ABBREVIATION and ARTIFACT has been used. It has been observed that word window 7 and 5 have given better results as compared to word window 3. Although F-scores and Precision values of word window 7 are slightly higher than that of word window 5 but recall of word window 7 was found to be lower than that word window 5.

References
  1. Borthwick, A. , 1999. Maximum Entropy Approach to Named Entity Recognition. Ph. D. dissertation, Comput. Sci. Dept. , New York Univ. , New York, USA.
  2. Chaudhuri, B. B. and Bhattacharya, S. , 2008. An Experiment on Automatic Detection of Named Entities in Bangla. In Proceedings of the IJCNLP-08 Workshop on NER for South and South East Asian Languages. 75-82.
  3. Ekbal, A. and Bandyopadhyay, S. , 2008. Bengali Named Entity Recognition using Support Vector Machine. In Proceedings of the IJCNLP-08 Workshop on NER for South and South East Asian Languages. 51–58.
  4. Ekbal, A. , Haque, R. , Das, A. , Poka V. and Bandyopadhyay, S. , 2008. Language Independent Named Entity Recognition in Indian Languages. In Proceedings of the IJCNLP-08 Workshop on NER for South and South East Asian Languages. 33–40.
  5. Gali, K. , Surana, H. , Vaidya, A. , Shishtla, P. and Sharma, D. M. , 2008. Aggregating Machine Learning and Rule Based Heuristics for Named Entity Recognition. In Proceedings of the IJCNLP-08 Workshop on NER for South and South East Asian Languages. 25-32.
  6. Grishman, R. and Sundheim B. , 1996. Message Understanding Conference - 6: A Brief History. In the Proceedings of the 16th International Conference on Computational Linguistics (COLING). 466 – 471.
  7. Kaur, A. and Josan, G. , 2014. Improved Named Entity Tagset for Punjabi Language. In the Proceedings of 2014 RAECS.
  8. Kaur, A. , Josan, G. and Kaur, J. , 2009. Named Entity Recognition For Punjabi: A Conditional Random Field Approach. In Proceedings of ICON-2009: 7th International Conference on Natural Language Processing. 277-282.
  9. Lafferty, J. D. , McCallum, A. and Pereira, F. C. N. , 2001. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. In Proceedings of International Conference on Machine Learning. 282-289
  10. Mansouri, A. , Suriani Affendey, L. and Mamat, A. , 2008. Named Entity Recognition Approaches. International Journal of Computer Science and Network Security. 339-344.
  11. Saha, S. K. , Chatterji, S. , Dandapat, S. , Sarkar, S. and Mitra, P. , 2008. A Hybrid Approach for Named Entity Recognition in Indian Language. In Proceedings of the IJCNLP-08 Workshop on NER for South and South East Asian Languages. 17-24.
  12. Sang, E. F. T. K. and Meulder, F. D. , 2003. Introduction to the CoNLL-2003 shared task: Language-independent named entity recognition. In Proceedings of 7th Conference on Natural Language Learning CoNLL-2003.
  13. Sang, E. F. T. K. , 2002. Introduction to the CoNLL- 2002 shared task: Language-independent named entity recognition. In Proceedings of 6th Workshop on Computational Language Learning, CoNLL-2002.
  14. Sekine, S. and Ishara, H. , 2000. IREX: IR & IE evaluation project in Japanese. In Proceedings of the 2nd International Conference on Language Resources and Evaluation.
  15. Sekine, S. , Sudo, K. and Nobata, C. , 2002. Extended Named Entity Hierarchy. In Proceedings of the 3rd International Conference on Language Resources and Evaluation, LREC 2002.
  16. Shishtla, P. M. , Gali, K. , Pingali P. and Varma, V. , 2008. Experiments in Telugu NER: A Conditional Random Field Approach. In Proceedings of the IJCNLP-08 Workshop on NER for South and South East Asian Languages. 105-110.
  17. Singh, A. K. , 2008. Named Entity Recognition for South and South East Asian Languages: Taking Stock. In Proceedings of the IJCNLP-08 Workshop on NER for South and South East Asian Languages. 5–16.
  18. Srikanth, P. and Murthy, K. N. , 2008. Named Entity Recognition for Telugu. In Proceedings of the IJCNLP-08 Workshop on NER for South and South East Asian Languages. 41-50.
Index Terms

Computer Science
Information Sciences

Keywords

Named Entities Named Entity Recognition Punjabi Language Context word feature