We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 November 2024
Call for Paper
December Edition
IJCA solicits high quality original research papers for the upcoming December edition of the journal. The last date of research paper submission is 20 November 2024

Submit your paper
Know more
Reseach Article

Recent Trends in Text Classification Techniques

by Nidhi, Vishal Gupta
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 35 - Number 6
Year of Publication: 2011
Authors: Nidhi, Vishal Gupta
10.5120/4408-6125

Nidhi, Vishal Gupta . Recent Trends in Text Classification Techniques. International Journal of Computer Applications. 35, 6 ( December 2011), 45-51. DOI=10.5120/4408-6125

@article{ 10.5120/4408-6125,
author = { Nidhi, Vishal Gupta },
title = { Recent Trends in Text Classification Techniques },
journal = { International Journal of Computer Applications },
issue_date = { December 2011 },
volume = { 35 },
number = { 6 },
month = { December },
year = { 2011 },
issn = { 0975-8887 },
pages = { 45-51 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume35/number6/4408-6125/ },
doi = { 10.5120/4408-6125 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:21:19.217817+05:30
%A Nidhi
%A Vishal Gupta
%T Recent Trends in Text Classification Techniques
%J International Journal of Computer Applications
%@ 0975-8887
%V 35
%N 6
%P 45-51
%D 2011
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Text Mining is the discovery of valuable, yet hidden, information from the text document. Text classification (Also called Text Categorization) is one of the important research issues in the field of text mining. With the dramatic increase in the amount of content available in digital forms gives rise to a problem to manage this online textual data. As a result, it has become a necessary to classify/categorize large texts (documents) into specific classes. Text Classification assigns a text document to one of a set of predefined classes. This paper covers different text classification techniques and also includes Classifier Architecture and Text Classification Applications.

References
  1. J.H. Kroeze, M.C. Matthee and T.J.D. Bothma, July 2007, “Differentiating between data-mining and text-mining terminology”, “doi: 10.1.1.95.7062”.
  2. F. Sebastiani, 2002 “Machine learning in automated text categorization”, ACM Computer Surveys 34(1), 1–47.
  3. Nawei Chen and Dorothea Blostein, 2006, “A survey of document image classification: problem statement, classifier architecture and performance evaluation”, Springer-Verlag, “doi: 10.1007/s10032-006-0020-2”.
  4. Christoph Goller, Joachim Löning, Thilo Will and Werner Wolff, 2009, “Automatic Document Classification: A thorough Evaluation of various Methods”, “doi=10.1.1.90.966”.
  5. Fabrizio Sebastiani, 2005 “Text categorization”, In Alessandro Zanasi (ed.), Text Mining and its Applications, WIT Press, Southampton, UK, 2005, pp. 109-129.
  6. Vishal Gupta, Gurpreet S. Lehal, August 2009 “A Survey of Text Mining Techniques and Applications”, Journal of Emerging Technologies in Web Intelligence, VOL. 1, NO. 1.
  7. Jiawei Han, Michelin Kamber, 2001, “Data Mining Concepts and Techniques”, Morgan Kaufmann publishers, USA, 70-181.
  8. Megha Gupta, Naveen Aggrawal, 19-20 March 2010, “Classification Techniques Analysis”, NCCI 2010 -National Conference on Computational Instrumentation CSIO Chandigarh, INDIA, pp. 128-131.
  9. B S Harish, D S Guru and S Manjunath, 2010, “Representation and Classification of Text Documents: A Brief Review”, IJCA Special Issue on “Recent Trends in Image Processing and Pattern Recognition” RTIPPR.
  10. Yu Wang and Zheng-Ou Wang, 2007, “ A Fast KNN Algorithm for Text Classification”, Machine Learning and Cybernetics, International Conference on, Vol. 6, pp. 3436-3441, doi : 10.1109/ICMLC.2007.4370742, Hong Kong, IEEE.
  11. Wei Wang, Sujian Li and Chen Wang, 2008, “ICL at NTCIR-7: An Improved KNN Algorithm for Text Categorization”, Proceedings of NTCIR-7 Workshop Meeting, December 16–19, Tokyo, Japan.
  12. Yancong Zhou and Hyuk Cho, 2001, “Classification Algorithms on Text Documents”.
  13. Jingnian Chen, Houkuan Huang, Shengfeng Tian and Youli Qu, 2009, “Feature selection for text classification with Naïve Bayes”, Expert Systems with Applications: An International Journal, Volume 36 Issue 3, Elsevier.
  14. Wen Zhang, Taketoshi Yoshida and Xijin Tang, 2008, “Text classification based on multi-word with support vector machine”, Journal: Knowledge Based Systems - KBS , vol. 21, no. 8, pp. 879-886, doi: 10.1016/j.knosys.2008.03.044, Elsevier.
  15. Steve R. Gunn, 1998, “Support Vector Machines for Classification and Regression”, University of Southampton.
  16. Wenmin Li, Jiawei Han and Jian Pei, 2001, “CMAR: Accurate and Efficient Classification Based on Multiple Class-Association Rules”, IEEE International Conference on Data Mining - ICDM , pp. 369-376, DOI: 10.1109/ICDM.2001.989541.
  17. Xiaoxin Yin, Jiawei Han. CPAR, 2003,” Classification based on Predictive Association Rules”, in Proceedings of SDM, doi=10.1.1.12.7268.
  18. Fernando Berzal, Juan-Carlos Cubero, Nicolás Marín, Daniel Sánchez, Jose-María Serrano, Amparo Vila, “Association rule evaluation for classification purposes”.
  19. Chowdhury Mofizur Rahman, Ferdous Ahmed Sohel, Parvez Naushad, S. M. Kamruzzaman, May 2003, “Text Classification using the Concept of Association Rule of Data Mining”, International Conference on Information Technology, Kathmandu, Nepal, pp. 234-241.
  20. Xin Lu, Barbara Di Eugenio, Stellan Ohlsson, 2007, “Learning Tutorial Rules Using Classification Based On Associations” , In Proceeding of the 2007 conference on Artificial Intelligence in Education, ISBN: 978-1-58603-764-2.
  21. Wei Wang , Diep Bich Do , Xuemin Lin, 2005, “Term Graph Model for Text Classification”, doi=10.1.1.149.6207.
  22. Chuntao Jiang, Frans Coenen, Robert Sanderson, Michele Zito, May 2010, “Text classification using graph mining-based feature extraction”, Journal Knowledge-Based Systems Volume 23 Issue 4, Elsevier.
  23. Dat Huynh, Dat Tran, Wanli Ma, Dharmendra Sharma, 2011, “A New Term Ranking Method Based on Relation Extraction and Graph Model for Text Classification”, Faculty of Information Sciences and Engineering, University of Canberra ACT 2601, Australia.
  24. Songbo Tan, 2008, “An improved centroid classifier for text categorization”, Expert Systems with Applications 35, 279–285, Elsevier.
  25. Verayuth Lertnattee, Thanaruk Theeramunkong, 2006, “Class normalization in centroid-based text categorization”, Information Sciences 176, 1712–1738, Elsevier.
  26. Guoqiang Peter Zhang, November 2000, “Neural Networks for Classification: A Survey”, IEEE Transactions on systems, man and cybernetics-Part C, Applications and Reviews, Vol. 30, NO. 4.
  27. Larry Manevitz, Malik Yousef, 2007, “One-class document classification via Neural Networks”, Neurocomputing 70, 1466–1481, Elsevier.
  28. David Faraggi, Richard Simon, 1995, “The maximum likelihood neural network as a statistical classification model”, Journal of Statistical Planning and Inference 46, 93-104, Elsevier.
  29. Ali Selamat, Sigeru Omatu, 2004, “Web page feature selection and classification using neural networks”, Information Sciences 158, 69–88, Elsevier.
Index Terms

Computer Science
Information Sciences

Keywords

KNN Naïve Bayes Support Vector Machine Term Graph Model Association Based Classification Decision Tree Induction Centroid based classification Classification using neural network