We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 December 2024
Reseach Article

An Improved Sentiment Classification using Lexicon into SVM

by S. S. K. Rastogi, Rohit Singhal, Anil Kumar
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 95 - Number 1
Year of Publication: 2014
Authors: S. S. K. Rastogi, Rohit Singhal, Anil Kumar
10.5120/16562-6226

S. S. K. Rastogi, Rohit Singhal, Anil Kumar . An Improved Sentiment Classification using Lexicon into SVM. International Journal of Computer Applications. 95, 1 ( June 2014), 37-42. DOI=10.5120/16562-6226

@article{ 10.5120/16562-6226,
author = { S. S. K. Rastogi, Rohit Singhal, Anil Kumar },
title = { An Improved Sentiment Classification using Lexicon into SVM },
journal = { International Journal of Computer Applications },
issue_date = { June 2014 },
volume = { 95 },
number = { 1 },
month = { June },
year = { 2014 },
issn = { 0975-8887 },
pages = { 37-42 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume95/number1/16562-6226/ },
doi = { 10.5120/16562-6226 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:18:20.697667+05:30
%A S. S. K. Rastogi
%A Rohit Singhal
%A Anil Kumar
%T An Improved Sentiment Classification using Lexicon into SVM
%J International Journal of Computer Applications
%@ 0975-8887
%V 95
%N 1
%P 37-42
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

With the emergence of web 2. 0 and availability of huge amount of digital data on the social web, people always want to discover unknown, to predict events that could occur, and the procedure on how it works and change over time. Similarly, sentiment analysis is related with the automatic extraction of sentiment information from textual data available at various social webs. While most sentiment analysis deals commercial jobs like fetching opinions from product reviews, there is significant growth in social web and it becomes a source to promote various products. This is actual reason why most of the commercial web support login through social web like facebook, twitter. There are two approaches to sentiment analysis. First one is based on lexicon and second is machine learning. It has been proved that machine learning approach performs better than lexicon based approaches but it ignores the knowledge encoded in sentiment lexicons. This paper describes a method that includes sentiment lexicons as prior information to SVM approach, a machine learning techniques, to improve the accuracy of sentiment analysis. It also describes a technique to automatically create domain specific sentiment lexicons for this learning purpose. A result shows that the domain specific lexicons lead to a significant accuracy improvement for sentiment analysis.

References
  1. G. Vural, B. B. Cambazoglu, P. Senkul, and O. Tokgoz. A framework for sentiment analysis in Turkish: Application to polarity detection of movie reviews in Turkish, Computer and Information Sciences III, pp 437-445, 2013.
  2. Cuneyd M. O. and Arzucan O. Word Polarity Detection Using a Multilingual Approach. Springer-Verlag Berlin Heidelberg, CICLing 2013, Part II, LNCS 7817, pp. 75–82, 2013.
  3. Thelwall, M. , & Buckley, K. Topic-based sentiment analysis for the Social Web: The role of mood and issue-related words. Journal of the American Society for Information Science and Technology, 64(8), 1608–1617, 2013.
  4. Erik T. , Mykola P. RBEM: A Rule Based Approach to Polarity Detection. In Proceedings of WISDOM'13, August 11, Chicago, USA, 2013.
  5. Rodrigo D. F. , Gaston L. and Juan D. V. Sentiment Polarity of Trends on the Web Using Opinion Mining and Topic Modeling. In the proceeding of web intelligence constorium chile research center, USA, 2013.
  6. E. Cambria, B. Schuller, Y. Xia, and C. Havasi. New avenues in opinion mining and sentiment analysis. Intelligent Systems, IEEE, 28(2):15-21, 2013.
  7. D. Bespalov, Y. Qi, B. Bai, and A. Shokoufandeh. Sentiment classification with supervised sequence encoder. In Proceedings of European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD), volume LNCS 7523, pages 159-174. Springer, 2012.
  8. Mike Thelwall, Kevan Buckley, and Georgios Paltoglou, Sentiment Strength Detection for the Social Web. In the Journal of the american society for information science and technology, 63(1):163–173, 2012.
  9. Thelwall, M. , Buckley, K. , & Paltoglou, G. Sentiment strength detection for the social Web, Journal of the American Society for Information Science and Technology, 63(1), 163-173, 2012.
  10. G. Paltoglou and M. Thelwall. Twitter, myspace, digg: Unsupervised sentiment analysis in social media. Volume 3, pages 66:1-66:19, New York, USA,ACM, 2012.
  11. E. Cambria, D. Olsher, and K. Kwok. Sentic activation: A two-level affective common sense reasoning framework. In Proceedings of AAAI, pages 186-192, 2012.
  12. X. Glorot, A. Bordes, and Y. Bengio. Domain adaptation for large-scale sentiment classification: A deep learning approach. In Proceedings of the 28th International Conference on Machine Learning, ICML2011, pages 513-520, 2011.
  13. Hassan, A. , Abu-Jbara, A. , Jha, R. , Radev, D. : Identifying the Semantic Orientation of Foreign Words. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, vol. 2, pp. 592–597, 2011.
  14. Thelwall, M. , Buckley, K. , & Paltoglou, G. Sentiment in Twitter events. Journal of the American Society for Information Science and Technology, 62(2), 406–418, 2011.
  15. K. Veselovská. Sentence-Level Polarity Detection in a Computer Corpus. In the Proceedings of WDS'11, Part I, 167–170, 2011.
  16. Taboada, M. , Brooke, J. ,Tofiloski, M. ,Voll, K. ,&Stede, M. Lexicon based methods for sentiment analysis. Computational Linguistics, 37(2), 267–307, 2011.
  17. E. Tromp and M. Pechenizkiy. Graph-based n-gram language identification on short texts. In Proceedings of the 20th Machine Learning conference of Belgium and The Netherlands, pages 27-34, 2011.
  18. E. Tromp and M. Pechenizkiy. Senticorr: Multilingual sentiment analysis of personal correspondence. In Proceedings of IEEE ICDM 2011 Workshops, pages 470-479. IEEE, 2011.
  19. Thelwall, M. , Buckley, K. , Paltoglou, G. , Cai, D. , & Kappas, A. Sentiment strength detection in short informal text. Journal of the American Society for Information Science and Technology, 61(12), 2544–2558, 2010.
  20. Taboada, M. , Lu,Y. ,Kong, X. , Quan, X. , Liu,W. ,& Xu,Y. Exploring the sentiment strength of user reviews. Lecture Notes in Computer Science, 6184/2010, 471–482, 2010.
  21. A. Pak and P. Paroubek. Twitter as a corpus for sentiment analysis and opinion mining. In Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10), pages 1320-1326, 2010.
  22. Haccianella, S. , Esuli, A. , & Sebastiani, F. SentiWordNet 3. 0: An enhanced lexical resource for sentiment analysis and opinion mining. In Proceedings of the Seventh conference on International Language Resources and Evaluation, 2010.
  23. M. Mathioudakis and N. Koudas, "Twittermonitor: trend detection over the twitter stream," in Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data, ser. SIGMOD '10. Indianapolis, Indiana, USA: ACM, pp. 1155–1158, 2010.
  24. Hassan, A. , Qazvinian, V. , Radev, D. : What's with the Attitude? Identifying Sentences with Attitude in Online Discussions. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 1245–1255, 2010.
  25. Hassan, A. , Radev, D. : Identifying Text Polarity Using Random Walks. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 395–403, 2010.
  26. Balahur, A. , Kozareva, Z. , & Montoyo, A. Determining the polarity and source of opinions expressed in political debates. Lecture Notes in Computer Science, 5449, 468–480, 2009.
  27. V. Sindhwani and P. Melville. Document-word co-regularization for semi-supervised sentiment analysis. In Eighth IEEE International Conference on Data Mining (ICDM'08), pages 1025-1030, 2008.
  28. Ann Devitt and Khurshid Ahmad. Sentiment Polarity Identification in Financial News: A Cohesion-based Approach. In the Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pages 984–991, Prague, Czech Republic, June 2007.
  29. Turney, P. D. Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL '02) (pp. 417–424). Stroudsburg, PA: Association for Computational Linguistics. 2002.
Index Terms

Computer Science
Information Sciences

Keywords

Lexicon Approach Sentiment analysis Polarity Detection