CFP last date
20 January 2025
Reseach Article

Support Vector Machines based Part of Speech Tagging for Nepali Text

by Tej Bahadur Shahi, Tank Nath Dhamala, Bikash Balami
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 70 - Number 24
Year of Publication: 2013
Authors: Tej Bahadur Shahi, Tank Nath Dhamala, Bikash Balami
10.5120/12217-8374

Tej Bahadur Shahi, Tank Nath Dhamala, Bikash Balami . Support Vector Machines based Part of Speech Tagging for Nepali Text. International Journal of Computer Applications. 70, 24 ( May 2013), 38-42. DOI=10.5120/12217-8374

@article{ 10.5120/12217-8374,
author = { Tej Bahadur Shahi, Tank Nath Dhamala, Bikash Balami },
title = { Support Vector Machines based Part of Speech Tagging for Nepali Text },
journal = { International Journal of Computer Applications },
issue_date = { May 2013 },
volume = { 70 },
number = { 24 },
month = { May },
year = { 2013 },
issn = { 0975-8887 },
pages = { 38-42 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume70/number24/12217-8374/ },
doi = { 10.5120/12217-8374 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:33:44.577916+05:30
%A Tej Bahadur Shahi
%A Tank Nath Dhamala
%A Bikash Balami
%T Support Vector Machines based Part of Speech Tagging for Nepali Text
%J International Journal of Computer Applications
%@ 0975-8887
%V 70
%N 24
%P 38-42
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Optimal part-of-speech tagging have great importance in various field of natural language processing such as machine translation, information extraction, word sense disambiguation, speech recognition and others. Due to the special nature of the Nepali language, Tagset used and Size of the corpus (training data), getting accurate part-of-speech tagger is one of the challenging task. This study is oriented to build an analytical machine learning model based on which it can be possible to determine the attainable accuracy. To complete this task, the support vector machine based part-of-speech tagger has been developed and tested for various instances of input to verify the accuracy level. The SVM tagger construct the feature vectors for each word in input and classify the word into one of two classes (One Vs Rest).

References
  1. B. Prasain, LP. Khatiwada, B. K. Bal, and P. Shrestha. Part-of-speech Tagset for Nepali, Madan Puraskar Pustakalaya, 2008.
  2. A. Ekbal and S. Bandopadhya, Part of Speech Tagging in Bengali Using Support Vector Machine, In: Proceeding of IEEE 2008.
  3. Jesus Giménez and Lluís Márquez . SVMTool: A General POS Tagger Generator Based on Support Vector Machines, In: Proceedings of the 4th International Conference on Language Resources and Evaluation (LREC-2004). Lisbon, Portugal, 2004.
  4. Valdimir Vapnik, Corinna Cortes, Support Vector Networks, machine learning 20, 273-297, Kunwer Acedemic Publisher 1995.
  5. T. Joachims, Making Large-Scale SVM Learning Practical. Advances in Kernel Methods Support Vector Learning, B. , Schölkopf and C. Burges and A. Smola (ed. ), MIT-Press, 1999.
  6. B. K. Bal, and P. Shrestha, Reports on Computational Grammar Madan Puraskar Pustakalaya, Patan Dhoka, Lalitpur, Kathmandu.
  7. A. Hardie, The Computational Analysis of Morphosyntactic Categories in Urdu, (PhD Thesis, Department of Linguistics and Modern English Language, Lancaster University, 2003)
  8. M. R. Jaishi. , Hidden Markov Model Based Probabilistic Part Of Speech Tagging For Nepali Text, (Masters Dissertation, Central Department of Computer Science and IT ,Tribhuvan University 2009, Nepal).
  9. T. B. Shahi, Support Vector Machine Based POS Tagging For Nepali Text, (Masters Dissertation, Central Department of Computer Science and IT 2012 ,Tribhuvan University, Nepal).
Index Terms

Computer Science
Information Sciences

Keywords

Support Vector Machine POS Tagging HMM Supervised Machine Learning