CFP last date
20 December 2024
Reseach Article

Sentiment Classification of Hotel Reviews in Social Media with Decision Tree Learning

by Stanimira Yordanova, Dorina Kabakchieva
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 158 - Number 5
Year of Publication: 2017
Authors: Stanimira Yordanova, Dorina Kabakchieva
10.5120/ijca2017912806

Stanimira Yordanova, Dorina Kabakchieva . Sentiment Classification of Hotel Reviews in Social Media with Decision Tree Learning. International Journal of Computer Applications. 158, 5 ( Jan 2017), 1-7. DOI=10.5120/ijca2017912806

@article{ 10.5120/ijca2017912806,
author = { Stanimira Yordanova, Dorina Kabakchieva },
title = { Sentiment Classification of Hotel Reviews in Social Media with Decision Tree Learning },
journal = { International Journal of Computer Applications },
issue_date = { Jan 2017 },
volume = { 158 },
number = { 5 },
month = { Jan },
year = { 2017 },
issn = { 0975-8887 },
pages = { 1-7 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume158/number5/26901-2017912806/ },
doi = { 10.5120/ijca2017912806 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:03:59.474629+05:30
%A Stanimira Yordanova
%A Dorina Kabakchieva
%T Sentiment Classification of Hotel Reviews in Social Media with Decision Tree Learning
%J International Journal of Computer Applications
%@ 0975-8887
%V 158
%N 5
%P 1-7
%D 2017
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The aim of this paper is to present an approach for prediction of customer opinion, using supervised machine learning approach and Decision tree method for classification of online hotel reviews as positive or negative. The preliminary extraction and preparation of the data used in the research are described. Three classification models are generated for three different data sets - balanced and unbalanced training sets with two schemes of filtering frequent and infrequent words in the attribute list. The results from the classifier evaluation are compared and discussed. The three classification models are also applied on new unseen data for predicting opinion of hotel guests. The achieved results reveal that the most accurate prediction is achieved when applying the model generated from the balanced training set with filtering rare words.

References
  1. Andreas M. Kaplan, Michael Haenlein, 2010. Users of the world, unite! The challenges and opportunities of Social Media. Business Horizons (2010) 53, 59—68.
  2. Bing Liu. 2012 Sentiment Analysis and Opinion Mining, Morgan & Claypool Publishers, May 2012.
  3. Huifeng Tang, Songbo Tan, Xueqi Cheng, 2009. A survey on sentiment detection of reviews, Expert System with Applications 36 (2009) 10760-10773
  4. Gautami Tripathi, Naganna S.2015. Feature Selection and Classification Approach for Sentiment Analysis, Machine Learning and Applications: An International Journal (MLAIJ) Vol.2, No.2, June 2015
  5. M. Bilal, H. Israr, M. Shahid, A. Khan, 2016. Sentiment classification of Roman-Urdu opinions using Naïve Bayesian, Decision Tree and KNN classification techniques, Journal of King Saud University - Computer and Information Sciences, Volume 28, Issue 3, July 2016, Pages 330–344
  6. Vikram Elango and Govindrajan Narayanan. 2014. Sentiment Analysis for Hotel Reviews, http://cs229.stanford.edu/projects2014.html
  7. P.Kalaivani, K.L.Shunmuganathan, 2013. Sentiment Classification of Movie Reviews by Supervised Machine Learning Approaches. Vol. 4 No.4 Aug-Sep 2013. Indian Journal of Computer Science and Engineering (IJCSE)
  8. A. Sharma, S. Dey. 2012. Performance Investigation of Feature Selection Methods and Sentiment Lexicons for Sentiment Analysis, Special Issue of International Journal of Computer Applications (0975 – 8887) on Advanced Computing and Communication Technologies for HPC Applications - ACCTHPCA, June 2012
  9. H. Sui, C. Khoo, S. Chan. 2003. Sentiment Classification of Product Reviews Using SVM and Decision Tree Induction, 14th ASIS SIG/CR Classification Research Workshop, 2003
  10. B. Pang, L. Lee, and S. Vaithyanathan, 2002. “Thumbs up? sentiment classification using machine learning techniques.” in Proceedings of the ACL-02 conference on Empirical methods in natural language processing-Volume 10. Association for Computational Linguistics, 2002, pp.79–86.
  11. Rüdiger Wirth, Jochen Hipp, CRISP-DM: Towards a Standard Process Model for Data Mining, http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.198.5133&rep=rep1&type=pdf
  12. Kotu V., Deshpande B., 2015. Predictive Analytics and Data Mining. Concepts and Practice with RapidMiner, ISBN 978-0-12-801460-8
  13. M.F. Porter, 1980, An algorithm for suffix stripping, Program, 14(3) pp 130−137.https://tartarus.org/martin/PorterStemmer/
  14. Ian H. Witten, Eibe Frank, Mark A. Hall, 2011. Data Mining. Practical Machine Learning Tools and Techniques, Third Edition.
  15. Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, 2008. Introduction to Information Retrieval, Cambridge University Press. 2008. http://www-nlp.stanford.edu/IR-book/
  16. Kelly A. McGuire, 2011. Do Guest Reviews Really Matter? Linking Social Media and Operations Data, Paper 381-2011, SAS Global Forum 2011 Travel, Hospitality and Entertainment
  17. Diane Korte, Thilini Ariyachandra, and Mark Frolick 2013. Business Intelligence in the Hospitality Industry, International Journal of Innovation, Management and Technology, Vol. 4, No. 4, August 2013
Index Terms

Computer Science
Information Sciences

Keywords

Sentiment classification supervised machine learning decision tree