We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 November 2024
Reseach Article

Named Entity Recognizer employing Multiclass Support Vector Machines for the Development of Question Answering Systems

by Bindu.M.S, Sumam Mary Idicula
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 25 - Number 10
Year of Publication: 2011
Authors: Bindu.M.S, Sumam Mary Idicula
10.5120/3146-4343

Bindu.M.S, Sumam Mary Idicula . Named Entity Recognizer employing Multiclass Support Vector Machines for the Development of Question Answering Systems. International Journal of Computer Applications. 25, 10 ( July 2011), 40-46. DOI=10.5120/3146-4343

@article{ 10.5120/3146-4343,
author = { Bindu.M.S, Sumam Mary Idicula },
title = { Named Entity Recognizer employing Multiclass Support Vector Machines for the Development of Question Answering Systems },
journal = { International Journal of Computer Applications },
issue_date = { July 2011 },
volume = { 25 },
number = { 10 },
month = { July },
year = { 2011 },
issn = { 0975-8887 },
pages = { 40-46 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume25/number10/3146-4343/ },
doi = { 10.5120/3146-4343 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:11:28.399006+05:30
%A Bindu.M.S
%A Sumam Mary Idicula
%T Named Entity Recognizer employing Multiclass Support Vector Machines for the Development of Question Answering Systems
%J International Journal of Computer Applications
%@ 0975-8887
%V 25
%N 10
%P 40-46
%D 2011
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Named Entity Recognition (NER) seeks to locate and classify atomic elements in text into predefined categories such as names of person, organization, location, Quantities, Percentage etc. Named entities tell us the roles of each meaning bearing word in a sentence and hence identification of these entities certainly helps us to extract the essence of the text which is very important in Question Answering(QA) , Information Extraction (IE) and Summarization. The system presented here is a Named Entity (NE) Classifier created using Multiclass Support Vector Machines based on linguistic grammar principles. Malayalam NER is a difficult task as each word of named entity has no specific feature such as Capitalization feature in English. NERs in other languages are not suitable for Malayalam language since its morphology, syntax and lexical semantics is different from them. Also there is no tagged corpus available for training. For testing this system, documents from well known Malayalam news papers and magazines containing passages from five different fields such as sports, health, politics, science and agriculture are selected. Experimental results show that the average precision recall and F-measure values are 89.12%, 89.15% and 89.13% respectively.

References
  1. Diego Moll ´a and Menno van Zaanen and Daniel Smith, “Named Entity Recognition for Question Answering”, Proceedings of the 2006 Australasian Language Technology Workshop (ALTW2006), pages 51–58
  2. Asif Ekbal, Rejwanul Haque, Amitava Das, Venkateswarlu Poka and Sivaji Bandyopadhyay, “Language Independent Named Entity Recognition in Indian Languages”, Proceedings of the IJCNLP-08 Workshop on NER for South and South East Asian Languages, pages 33–40,Hyderabad, India, January 2008.
  3. Lev Ratinov Dan Roth, “Design Challenges and Misconceptions in Named Entity Recognition”, Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL), pages 147–155, Boulder, Colorado, June 2009
  4. A .R .Rajarajavarma,” Keralapanineeyam”, National Book Stall, Kottayam, 2000.
  5. Asif Ekbal and Sivaji Bandyopadhyay,” Named Entity Recognition using Support Vector Machine: A Language Independent Approach”, International Journal of Electrical and Electronics Engineering 4:2 2010
  6. Kashif Riaz , “Rule-based Named Entity Recognition in Urdu”, Proceedings of the 2010 Named Entities Workshop, ACL 2010, pages 126–135, Uppsala, Sweden, 16 July 2010.
  7. B. Sasidhar, P. M. Yohan,Dr. A. Vinaya Babu, Dr. A. Govardhan,” Named Entity Recognition in Telugu Language using Language Dependent Features and Rule based Approach”, International Journal of Computer Applications (0975 – 8887) Volume 22– No.8, May 2011
  8. Mohammad Hasanuzzaman1, Asif Ekbal2 and Sivaji Bandyopadhyay3,” Maximum Entropy Approach for Named Entity Recognition in Bengali and Hindi”, International Journal of Recent Trends in Engineering, Vol. 1,No.1, May 2009
  9. Kitoogo Fredrick Edward, Venansius Baryamureeba and Guy De Pauw; Towards Domain Independent Named Entity Recognition, International Journal of Computing and ICT Research, Vol. 2, No. 2, pp. 84 –95
  10. GuoDong Zhou Jian Su ,” Named Entity Recognition using an HMM-based Chunk Tagger”, “Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL), Philadelphia, July 2002, pp. 473-480.
  11. Asif Ekbal and Sivaji Bandyopadhyay, “Named Entity Recognition Using Appropriate Unlabeled Data, Post-processing and Voting”, Informatica 34 (2010) 55–76
  12. Burr Settles,” Biomedical Named Entity Recognition Using Conditional Random Fields and Rich Feature Sets”, Proceedings of the COLING 2004 International Joint Workshop on Natural Language Processing in Biomedicine and its Applications (NLPBA). Geneva, Switzerland. 2004.
  13. Sujan Kumar Saha *, Shashi Narayan, Sudeshna Sarkar, Pabitra Mitra,” A composite kernel for named entity recognition”, Pattern Recognition Letters (2010)
  14. Xiaofeng Yu,”Chinese Named Entity Recognition with Cascaded Hybrid model”,Proceedings of NAACL HLT 2007 Companion Volume, pp 197-200,April 2007
  15. Gjorgji Madzarov, Dejan Gjorgjevikj and Ivan Chorbev,” A Multi-class SVM Classifier Utilizing Binary Decision Tree”, Informatica 33 (2009) 233-241
  16. Samir AbdelRahman, Mohamed Elarnaoty, Marwa Magdy and Aly Fahmy,” Integrated Machine Learning Techniques for Arabic Named Entity Recognition”, IJCSI International Journal of Computer Science Issues, Vol. 7, Issue 4, No 3, July 2010
  17. Alireza Mansouri,Lilly Suriani Affendey,Ali Mamat,” Named Entity Recognitin Approaches”,International Journal of Computer Science and Network Security,Vol.8,No.2,Februart,2008.
Index Terms

Computer Science
Information Sciences

Keywords

Named Entity Recognition Parts-of-Speech Tagger Phrase chunker Compound word splitter