CFP last date
20 December 2024
Reseach Article

Comparative Study of POS Taggers

Published on January 2015 by Aastha Gupta, Rachna Rajput, Richa Gupta, Monika Arora
Emerging Paradigms of Information and Communication Technologies and its Impact on Society
Foundation of Computer Science USA
EPICTIS2014 - Number 1
January 2015
Authors: Aastha Gupta, Rachna Rajput, Richa Gupta, Monika Arora
4cb5bb3d-7283-436f-bf67-418fde4cee0c

Aastha Gupta, Rachna Rajput, Richa Gupta, Monika Arora . Comparative Study of POS Taggers. Emerging Paradigms of Information and Communication Technologies and its Impact on Society. EPICTIS2014, 1 (January 2015), 19-25.

@article{
author = { Aastha Gupta, Rachna Rajput, Richa Gupta, Monika Arora },
title = { Comparative Study of POS Taggers },
journal = { Emerging Paradigms of Information and Communication Technologies and its Impact on Society },
issue_date = { January 2015 },
volume = { EPICTIS2014 },
number = { 1 },
month = { January },
year = { 2015 },
issn = 0975-8887,
pages = { 19-25 },
numpages = 7,
url = { /proceedings/epictis2014/number1/19425-3012/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 Emerging Paradigms of Information and Communication Technologies and its Impact on Society
%A Aastha Gupta
%A Rachna Rajput
%A Richa Gupta
%A Monika Arora
%T Comparative Study of POS Taggers
%J Emerging Paradigms of Information and Communication Technologies and its Impact on Society
%@ 0975-8887
%V EPICTIS2014
%N 1
%P 19-25
%D 2015
%I International Journal of Computer Applications
Abstract

POS Tagging provides important grammatical as well as contextual information for each word in the corpus. POS Tagging enables various companies to be able to track user reviews and can even be used for Speech Synthesis. In this paper, different POS Tagging Algorithms, namely, Memory-Based Learning Algorithm, Multi-Domain Web Based Algorithm and the Hybrid Model, will be compared on the basis of their execution time as well as efficiency. In Memory-Based Learning algorithm, the word to be tagged is searched in the lexicon using weighted similarity matrix, if an exact match is found, its lexical representation is retrieved, but, if it is not found, the lexical representation of its nearest neighbor is retrieved. Thus, the algorithm will not work efficiently for sparse data. On the other hand, Multi-Domain Web Based Algorithm is used to tag unknown words. The word is searched over the web for its possible tags. Due to the web search, runtime overhead is induced for each word. The tag with highest occurring probability is assigned to the word. The Hybrid Model executes Memory-Based Learning algorithm for known words and Multi-Domain Web Based Algorithm for unknown words.

References
  1. Aastha Gupta, Rachna Rajput, Richa Gupta and Monika Arora "Improved POS Tagging for Unknown Words ", International Journal of Soft Computing and Engineering, ISSN:2231-2307 Vol. 4,Issue-ICCIN-2k14, March 2014.
  2. Aastha Gupta, Rachna Rajput, Richa Gupta & Monika Arora, 2014,'Hybrid Model to Improve Time Complexity of Words Search in POS Tagging'. Paper presented at International Conference on Data Mining and Intelligent Computing,IEEE, Delhi, India
  3. Amit S. Chavan, Kartik R. Nayak, Keval D. Vora, Manish D. Purohit and Pramila M. Chawan, "A Comparison of Page Replacement Algorithms", ACSIT International Journal of Engineering and Technology, Vol. 3, No. 2, April 2011
  4. Antony P J and Dr. Soman K P, "Parts Of Speech Tagging for Indian Languages: A Literature Survey", International Journal of Computer Applications (0975 – 8887) Vol. 3, No. 8, November 2011.
  5. Ari Rappoport, Roi Reichart and Shulamit Umansky-Pesin, "A Multi DomainWeb-Based Algorithm for POS Tagging of Unknown Words", Coling 2010: Poster Volume, pages 1274–1282,Beijing, August 2010
  6. Debnath Bhattacharyya, Susmita Biswas and Tai-hoon Kim, "A Review on Natural Language Processing in Opinion Mining", International Journal of Smart Home Vol. 4, No. 2, April, 2010.
  7. Erik Cambria, Robert Speer, Catherine Havasi and Amir Hussain," SenticNet: A Publicly Available Semantic Resource for Opinion Mining", Commonsense Knowledge: Papers from the AAAI Fall Symposium (FS-10-02).
  8. Guido Minnen, Francis Bond and Ann Copestake, "Memory-Based Learning for Article Generation", in Proceedings Of CoNLL-2000 and LLL-2000, pages 43-48, Lisbon, Portugal, 2000.
  9. Hejab M. Alfawareh and Shaidah Jusoh, "Resolving Ambiguous Entity through Context Knowledge and Fuzzy Approach", International Journal on Computer Science and Engineering (IJCSE) ISSN : 0975-3397, Vol. 3 No. 1 Jan 2011
  10. Jakub Zavrel & walter Daelemans,"Recent Advances in Memory Based Part of Speech Tagging. ", VI Simposio Internacional de Comunicacion Social, Santiago de Cuba pp. 590-597, 1999
  11. Lars Bungum, Bjorn Gamback, "Evolutionary Algorithms in Natural Language Processing", Norwegian Artificial Intelligence Symposium, Gjøvik, 22 November 2010
  12. Mahesh T R, Suresh M B, M Vinayababu. "Text Mining: Advancements, Challenges And Future directions", International Journal of Reviews in Computing, ISSN: 2076-332, © 2009-2010 IJRIC& LLS.
  13. Omae Al-Harbi, Shaidah Jusoh and Norita Md Norwawi, "Lexical Disambiguation in Natural Language Questions (NLQs), IJCSI International Journal of Computer Science Issues, Vol. 8, Issue 4, No 2, July 2011 ISSN (Online): 1694-0814
  14. Parag Bhalchandra, Nilesh Deshmukh, Sakharam Lokhande, and Santosh Phulari, "A Comprehensive Note on Complexity Issues in Sorting Algorithms", Advances in Computational Research, ISSN: 0975–3273, Volume 1, Issue 2, 2009, pp-1-09
  15. Parmar Mitixa R, Prof. Arpit Rana, "A Survey on Opinion and Sentiment Analysis With Applications and Issues", International Journal of Computational Linguistics and Natural Language Processing ISSN 2279 – 0756, Vol. 2, Issue 1, January 2013
  16. Shaidah Jusoh and Hejab M. Alfawareh, "Techniques, Applications and Challenging Issue in Text Mining", IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 6, No 2, November 2012 ISSN: 1694-0814
  17. Shaidah Jusoh and Hejab M. Al Fawareh, "Semantic Extraction from Texts", 2009 International Conference on Computer Engineering and Applications IPCSIT vol. 2 (2011) © (2011) IACSIT Press, Singapore
  18. Yin Shaohong and Fan Guidan, "Research of POS Tagging Rules Mining Algorithm", Proceedings of the 2nd International Conference on Computer Science and Electronics Engineering (ICCSEE 2013).
Index Terms

Computer Science
Information Sciences

Keywords

Pos Tagging Multi-domain Web Based Algorithm Memory Based Learning Algorithm Hybrid Model