CFP last date
20 January 2025
Reseach Article

Named Entity Recognition using Gazetteer Method and N-gram Technique for an Inflectional Language: A Hybrid Approach

by Arindam Dey, Bipul Syam Prukayastha
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 84 - Number 9
Year of Publication: 2013
Authors: Arindam Dey, Bipul Syam Prukayastha
10.5120/14607-2859

Arindam Dey, Bipul Syam Prukayastha . Named Entity Recognition using Gazetteer Method and N-gram Technique for an Inflectional Language: A Hybrid Approach. International Journal of Computer Applications. 84, 9 ( December 2013), 31-35. DOI=10.5120/14607-2859

@article{ 10.5120/14607-2859,
author = { Arindam Dey, Bipul Syam Prukayastha },
title = { Named Entity Recognition using Gazetteer Method and N-gram Technique for an Inflectional Language: A Hybrid Approach },
journal = { International Journal of Computer Applications },
issue_date = { December 2013 },
volume = { 84 },
number = { 9 },
month = { December },
year = { 2013 },
issn = { 0975-8887 },
pages = { 31-35 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume84/number9/14607-2859/ },
doi = { 10.5120/14607-2859 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:00:29.932996+05:30
%A Arindam Dey
%A Bipul Syam Prukayastha
%T Named Entity Recognition using Gazetteer Method and N-gram Technique for an Inflectional Language: A Hybrid Approach
%J International Journal of Computer Applications
%@ 0975-8887
%V 84
%N 9
%P 31-35
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Named Entity Recognition (NER) is a task to discover the Named Entities (NEs) in a document and then categorize these NEs into diverse Named Entity classes such as Name of Person, Location, River, Organization etc. Area of concentration is the performance of NER in the Indian languages (IL). Nepali is the target language. In this paper different technique of NER and a brief introduction of Gazetteer method and Hidden Markov Model especially n-gram technique has been described. Different types of problem faced in handling Nepali Grammar are also described.

References
  1. W. Li and A. McCallum, Sept 2003 "Rapid Development of Hindi Named Entity Recognition using Conditional Random Fields and Feature Induction(Short Paper)," ACM Transactions on Computational Logic.
  2. Suleiman H. Mustafa and Qasem A. Al-Radaideh 2004 "Using N-Grams for Arabic Text Searching" journal of the american society for information science and technology.
  3. Zubek, R. 2006. Introduction to Hidden Markov Models. In Rabin, S. (ed. ), AI Game Programming Wisdom 3. Charles River Media, Hingham, MA.
  4. A. Ekbal and S. Bandyopadhyay 2008, "Named Entity Recognition using Support Vector Machine: A Language Independent Approach," International Journal of Computer, Systems Sciences and Engg (IJCSSE.
  5. S. K. Saha, S. Sarkar, and P. Mitra January 2008, "A Hybrid Feature Set based Maximum Entropy Hindi Named Entity Recognition," in Proceedings of the 3rd International Joint Conference on NLP, Hyderabad , India.
  6. A. Goyal , "Named Entity Recognition for South Asian Languages Jan 2008," in Proceedings of the IJCNLP-08 Workshop on NER for South and South-East Asian Languages, Hyderabad, India.
  7. S. K. Saha, P. S. Ghosh, S. Sarkar, and P. Mitra 2008, "Named Entity Recognition in Hindi using Maximum Entropy and Transliteration," Research journal on Computer Science and Computer Engineering with Applications.
  8. Asif Ekbal, Rajewanul Hague, Amitava Das, Venkateswarlu Poka and Sivaji Bandyopadhyay 2008 "Language Independent Named Entity Recognition in Indian Languages" Proceedings of the IJNLP-08 Workshop on NER for South and South East Asian Languages, Hyderabad, India.
  9. M. Hasanuzzaman, A. Ekbal, and S. Bandyopadhyay, May 2009, "Maximum Entropy Approach for Named Entity Recognition in Bengali and Hindi, "International Journal of Recent Trends in Engineering, vol. 1.
  10. Anastasia Rita Widiarti, and Phalita Nari Wastu 2009, "Javanese Character Recognition Using Hidden Markov Model"World Academy of Science, Engineering and Technology 33.
  11. Padmaja Sharma, Utpal Sharma, Jugal Kalita May 2011, "Named Entity Recognition: A Survey for the Indian Languages".
  12. David Nadeau, Peter D. Turney and Stan Matwin March 11 , 2011, "Unsupervised Named-Entity Recognition: Generating Gazetteers and Resolving Ambiguity" National Research Council Canada.
  13. Nusrat Jahan, Sudha Morwal and dipti Chopra 12 Dec 2012,"Named Entity Recognition in Indian Languages Using Gazetteer Method and Hidden Markov Model: A Hybrid Approach".
  14. Deepti Chopra, Sudha Morwal Dec 12, 2012, "Named Entity Recognition in Punjabi Using Hidden Markov Model", "International Journal of Computer Science & Engineering Technology (IJCSET)".
  15. David Nadeau, Satoshi Sekine , "A survey of named entity recognition and classification" National Research Council Canada / New York University.
  16. Sujan Kumar Saha, Sudeshna Sarkar, Pabitra Mitra "Gazetteer Preparation for Named Entity Recognition in Indian Languages". Available at: http://www. aclweb. org/anthology-new/I/I08/I08-7002. pdf
  17. M. N. Karthik, Moshe Davis "Search Using N-gram Technique Based Statistical Analysis for Knowledge Extraction in Case Based Reasoning Systems" .
  18. B. Sasidhar#1, P. M. Yohan*2, Dr. A. Vinaya Babu3, Dr. A. Govardhan4," A Survey on Named Entity Recognition in Indian Languages with particular reference to Telugu", http://www. ijcsi. org/papers/IJCSI-8-2-438-443. pdf
  19. A. Ekbal, R. Hague, and S. Bandyopadhyay, "Named Entity Recognition in Bengali: A Conditional Random Field," in Proceedings of ICON, India, pp. 123–128.
Index Terms

Computer Science
Information Sciences

Keywords

Named Entities (NEs) Named Entity Recognition (NER) Indian Languages (ILs) and Hidden Markov Model (HMM).