International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 37 - Number 11 |
Year of Publication: 2012 |
Authors: Nidhi, Vishal Gupta |
10.5120/4731-6925 |
Nidhi, Vishal Gupta . Algorithm for Punjabi Text Classification. International Journal of Computer Applications. 37, 11 ( January 2012), 30-35. DOI=10.5120/4731-6925
Text Mining is a field that extracts hidden, not yet discovered, useful information from the text document according to user’s query. And Text Classification is one of the text mining tasks to manage the information efficiently, by classifying the documents into classes using classification algorithms. Any text classification method uses a set of features to characterize each text document, where these features should be relevant to the task at hand. Not much work has been done for Punjabi text classification. Adequate annotated corpora are not yet available in Punjabi. This paper introduces preprocessing techniques, features selection methods for Punjabi and classification algorithm to classify the Punjabi Text documents.