CFP last date
20 December 2024
Reseach Article

Part of Speech Tagger for Marathi Language

by Sharvari Govilkar, Bakal J. W, Shubhangi Rathod
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 119 - Number 18
Year of Publication: 2015
Authors: Sharvari Govilkar, Bakal J. W, Shubhangi Rathod
10.5120/21169-4245

Sharvari Govilkar, Bakal J. W, Shubhangi Rathod . Part of Speech Tagger for Marathi Language. International Journal of Computer Applications. 119, 18 ( June 2015), 29-32. DOI=10.5120/21169-4245

@article{ 10.5120/21169-4245,
author = { Sharvari Govilkar, Bakal J. W, Shubhangi Rathod },
title = { Part of Speech Tagger for Marathi Language },
journal = { International Journal of Computer Applications },
issue_date = { June 2015 },
volume = { 119 },
number = { 18 },
month = { June },
year = { 2015 },
issn = { 0975-8887 },
pages = { 29-32 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume119/number18/21169-4245/ },
doi = { 10.5120/21169-4245 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:04:24.491697+05:30
%A Sharvari Govilkar
%A Bakal J. W
%A Shubhangi Rathod
%T Part of Speech Tagger for Marathi Language
%J International Journal of Computer Applications
%@ 0975-8887
%V 119
%N 18
%P 29-32
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

A part of speech (POS) tagging is one of the most well studied problem in the field of Natural Language Processing (NLP). A POS Tagger is the process of assigning correct tag like noun, adjective, verb, adverb etc to each word of the input sentence. Disambiguation rules and Tagset is vital parts of POS tagger. POS tagging is difficult for Marathi language due to unavailability of corpus for computational processing. In this paper, a POS Tagger for Marathi language using Rule based technique is presented. Our proposed system find root word using morphological analyzer and compare the root word with corpus to assign appropriate tag. If word has assigned more than one tags then by using grammar rules ambiguity is removed. Meaningful rules are provided to improve the performance of the system.

References
  1. Jyoti Singh Nisheeth Joshi Iti Mathur "Development of Marathi Part of Speech Tagger Using Statistical Approach", Advances in Computing, Communications and Informatics (ICACCI), 2013.
  2. H. B. Patil, A. S. Patil, B. V. Pawar "Part-of-Speech Tagger for Marathi Language using Limited Training Corpora" 2014 in International Journal of Computer Applications (0975 –8887) Recent Advances in Information Technology.
  3. Pallavi Bagul, Archana Mishra, Prachi Mahajan, Medinee Kulkarni, Gauri Dhopavkar, "Rule Based POS Tagger for Marathi Text" 2014 in proceeding of: International Journal of Computer Science and Information Technologies, Vol. 5 (2) , 2014, 1322-1326 .
  4. Jyoti Singh Nisheeth Joshi Iti Mathur "PART OF SPEECH TAGGING OF MARATHI TEXT USING TRIGRAM METHOD" 2013 International Journal of Advanced Information Technology (IJAIT) Vol. 3, No. 2.
  5. Nidhi Mishra, Amit Mishra, "Part of Speech Tagging for Hindi Corpus" 2011 in proceeding of : International Conference on Communication Systems and Network Technologies.
  6. Namrata Tapaswi Suresh Jain, "Treebank Based Deep Grammar Acquisition and Part- Of-Speech Tagging for Sanskrit Sentences" Software Engineering (CONSEG), 2012 CSI Sixth International Conference on.
  7. Javed Ahmed MAHAR Ghulam Qadir MEMON,"Rule Based Part of Speech Tagging of Sindhi Language" 2010 proceeding of International Conference on Signal Acquisition and Processing.
  8. Sankaran Baskaran , Kalika Bali1, Tanmoy Bhattacharya, Pushpak Bhattacharyya, Monojit Choudhury, Girish Nath Jha, Rajendran S. 5, Saravanan K. 1, Sobha L. 6, and KVS Subbarao "A Common Parts-of-Speech Tagset Framework for Indian Languages".
  9. Kh Raju Singha Bipul Syam Purkayastha Kh Dhiren Singha "Part of Speech Tagging in Manipuri: A Rule-based Approach "International Journal of Computer Applications (0975 – 8887) Volume 51– No. 14, August 2012.
  10. Bharati, A. , Sharma, D. M. , Bai, L. , Sangal, R. , "AnnCorra: Annotating Corpora Guidelines for POS and Chunk Annotation for Indian Languages" (2006).
Index Terms

Computer Science
Information Sciences

Keywords

Part of Speech (POS) Tagset Tokenizer Stemmer Morphological analyzer Disambiguation