CFP last date
20 February 2025
Reseach Article

A Survey on Various Features and Techniques of Text Content Classification

by Vishal Sahu, Vivek Kumar
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 178 - Number 9
Year of Publication: 2019
Authors: Vishal Sahu, Vivek Kumar
10.5120/ijca2019918799

Vishal Sahu, Vivek Kumar . A Survey on Various Features and Techniques of Text Content Classification. International Journal of Computer Applications. 178, 9 ( May 2019), 13-15. DOI=10.5120/ijca2019918799

@article{ 10.5120/ijca2019918799,
author = { Vishal Sahu, Vivek Kumar },
title = { A Survey on Various Features and Techniques of Text Content Classification },
journal = { International Journal of Computer Applications },
issue_date = { May 2019 },
volume = { 178 },
number = { 9 },
month = { May },
year = { 2019 },
issn = { 0975-8887 },
pages = { 13-15 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume178/number9/30557-2019918799/ },
doi = { 10.5120/ijca2019918799 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:49:54.547554+05:30
%A Vishal Sahu
%A Vivek Kumar
%T A Survey on Various Features and Techniques of Text Content Classification
%J International Journal of Computer Applications
%@ 0975-8887
%V 178
%N 9
%P 13-15
%D 2019
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Traditional information retrieval methods become inadequate for increasing vast amount of data. Without knowing what could be in the documents; it is difficult to formulate effective queries for analyzing and extracting useful information from the data. This survey focused on some of the present strategies used for filtering documents. Starting with different types of text features this paper has discussed about recent developments in the field of classification of text documents. This paper gives a concise study of methods proposed by different researchers. Here various pre-processing steps were also discussed with a comprehensive and comparative understanding of existing literature.

References
  1. Brindha, S., Sukumaran, S., & Prabha, K. (2016). A survey on classification techniques for text mining. Proceedings of the 3rd International Conference on Advanced Computing and Communication Systems. IEEE. Coimbatore, India.
  2. Vasa, K. (2016). Text classification through statistical and machine learning methods: A survey. International Journal of Engineering Development and Research, 4, 655-658.
  3. Farman Alia, Kyung-Sup Kwaa,Yong-Gi Kimb,” Opinion mining based on fuzzy domain ontology and Support Vector Machine: A proposal to automate online review classification”, Applied Soft Computing-2016.
  4. Isidro Peñalver-Martinez, Francisco GarciaSanchez, Rafael Valencia-Garcia,” Featurebased opinion mining through ontologies”, Expert Systems with Applications-2014.
  5. RuiXia,FengXu,JianfeiYu,” Polarity shift detection, elimination and ensemble: A three stage model for document-level sentiment analysis” Information Processing and Management 52 (2016) 36– 45.
  6. Wanxiang Che, Yanyan Zhao, Honglei Guo, Zhong Su, and Ting Liu,” Sentence Compression for spect-Based Sentiment Analysis” IEEE/ACM transactions on audio, speech, and language processing, vol. 23, no. 12, December 2015.
  7. Selma Ayşe Özel. Esra Saraç “ Web Page Classification Using Firefly Optimization “, 978-1-4799-0661-1/13/$31.00 ©2013 Ieee.
  8. Gongde Guo, Hui Wang, David Bell, Yaxin Bi and Kieran Greer, “KNN Model-Based Approach in Classification”, Proc. ODBASE pp- 986 – 996, 2003
  9. Eiji Aramaki and Kengo Miyo, “Patient status classification by using rule based sentence extractionand bm25-knn based classifier”, Proc. of i2b2 AMIA workshop, 2006.
  10. SHI Yong-feng, ZHAO, “Comparison of text categorization algorithm”, Wuhan university Journal of natural sciences. 2004.
  11. Joachims, T. “Text categorization with support vector machines: learning with many relevant features”. In Proceedings of ECML-98, 10th European Conference on Machine Learning (Chemnitz, DE), pp. 137–142 1998.
  12. MIgual E .Ruiz, Padmini Srinivasn, “Automatic Text Categorization Using Neural networks”, Advaces in Classification Research, Volume VIII.
  13. Yiming Yang Christopher G. Chute “A Linear Least Squares Fit Mapping Method For Information Retrieval From Natural Language Texts” Acres De Coling-92 Nantes, 23-28 AOUT 1992
  14. B S Harish, D S Guru, S Manjunath ” Representation and Classification of Text Documents: A Brief Review” IJCA Special Issue on “Recent Trends in Image Processing and Pattern Recognition”RTIPPR, 2010.
  15. Seyyed Mohammad Hossein Dadgar et al “A Novel Text Mining Approach Based on TF-IDF and Support Vector Machine for News Classification” 2nd IEEE International Conference on Engineering and Technology (ICETECH), 17th& 18thMarch 2016.
  16. Adel Hamdan Mohammad et al “Arabic Text Categorization Using Support vector machine, Naïve Bayes and Neural Network” GSTF Journal on Computing (JOC) ,Volume 5, Issue 1; 2016 pp. 108-115.
  17. Omar Al-Momani, Tariq Alwada et al. “Arabic Text Categorization using k-nearest neighbour, Decision Trees (C4.5) and Rocchio Classifier: A Comparative Study” International Journal of Current Engineering and Technology 2016.
  18. E Jadon, R Sharma et al. “Data Mining: Document Classification using Naive Bayes Classifier” International Journal of Computer Applications (0975 – 8887) Volume 167 – No.6, June 2017.
  19. Alan Díaz-Manríquez , Ana Bertha Ríos-Alvarado, José Hugo Barrón-Zambrano, Tania Yukary Guerrero-Melendez, And Juan Carlos Elizondo-Leal. “An Automatic Document Classifier System Based on Genetic Algorithm and Taxonomy”. accepted March 9, 2018, date of publication March 15, 2018, date of current version May 9, 2018.
Index Terms

Computer Science
Information Sciences

Keywords

Content filtering Fake Profile Online Social Networks Spam Detection.