CFP last date
20 January 2025
Reseach Article

JIBCA: Jaccard Index based Clustering Algorithm for Mining Online Review

by Nihalahmad R. Shikalgar, Arati M. Dixit
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 105 - Number 15
Year of Publication: 2014
Authors: Nihalahmad R. Shikalgar, Arati M. Dixit
10.5120/18454-9735

Nihalahmad R. Shikalgar, Arati M. Dixit . JIBCA: Jaccard Index based Clustering Algorithm for Mining Online Review. International Journal of Computer Applications. 105, 15 ( November 2014), 23-28. DOI=10.5120/18454-9735

@article{ 10.5120/18454-9735,
author = { Nihalahmad R. Shikalgar, Arati M. Dixit },
title = { JIBCA: Jaccard Index based Clustering Algorithm for Mining Online Review },
journal = { International Journal of Computer Applications },
issue_date = { November 2014 },
volume = { 105 },
number = { 15 },
month = { November },
year = { 2014 },
issn = { 0975-8887 },
pages = { 23-28 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume105/number15/18454-9735/ },
doi = { 10.5120/18454-9735 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:37:47.876410+05:30
%A Nihalahmad R. Shikalgar
%A Arati M. Dixit
%T JIBCA: Jaccard Index based Clustering Algorithm for Mining Online Review
%J International Journal of Computer Applications
%@ 0975-8887
%V 105
%N 15
%P 23-28
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Sentiment analysis, also known as opinion mining, is the analysis of the feelings (i. e. attitudes, emotions and opinions) behind the words. Sentiment analysis involves classifying the opinions as positive, negative, or neutral. Classification of textual objects in accordance with sentiment is considered to be a more difficult task than classification of textual objects in accordance with the content because opinions in natural language can be expressed in subtle and complex ways containing slang, ambiguity, sarcasm, irony, and idiom. This paper investigates the problem of sentiment analysis of online review. A Jaccard index based clustering algorithm (JIBCA) is proposed to support mining online reviews and predicting sales performance. The information gain is the change in information by considering number of datasets. The performance of information gain varies depending on the dataset. It is observed that the information gain performed better in JIBCA than existing methods for the movie review dataset. It is therefore recommended that JIBCA can be a good feature selection method for sentiment classification tasks. This paper also proposes a new approach for movie reviews classification based on extraction and analysis of appraisal groups such as action, thrill, comedy, and romantic.

References
  1. Xiaohui Yu, Yang Liu, Xiangji Huang, Aijun, "Mining Online Reviews for Predicting Sales Performance: A Case Study in the Movie Domain", A Knowledge and Data Engineering, IEEE Transactions on 24, 2012.
  2. Deng Bin, Peiji, Shao, Zhao Dan, "E-Commerce Reviews Management System Based on Online Customer Reviews Mining Innovative Computing & Communication", 2010Intl Conf on and Information Technology & Ocean Engineering, 2010 Asia-Pacific Conf on (CICC-ITOE)2010
  3. Soliman, T. H. A. Elmasry, M. A. Hedar, A. R. , Doss, M. M. , "Utilizing support vector machines in mining online customer reviews" Computer Theory and Applications (ICCTA), 2012 22nd International Conference on2012
  4. Peng Jiang, Chunxia Zhang, Hongping Fu, Zhen dong Niu, Qing Yang, "An Approach Based on Tree Kernels for Opinion Mining of Online Product Reviews" ,Data Mining (ICDM), 2010 IEEE 10th International Conference on2010
  5. Lai C. L. , Xu K. Q. , Lau R. Y. ,K. Yue feng Li, Dawei Song, "High-Order Concept Associations Mining and Inferential Language Modeling for Online Review Spam Detection", Data Mining Workshops (ICDMW), 2010 IEEE International Conference, on 2010
  6. Weishu Hu, Zhiguo Gong, Jingzhi Guoe ,"Mining Product Features from Online Reviews" , Business Engineering (ICEBE), 2010 IEEE 7th International Conference ,on 2010
  7. Samsudin N. , Puteh M. , Hamdan A. R. , "Bess or xbest: Mining the Malaysian online reviews" Data Mining and Optimization (DMO), 2011 3rd Conference ,on 2011
  8. XuXueke, Cheng Xueqi, Tan Songbo, Liu Yue, Shen Huawei, "Aspect-level opinion mining of online customer reviews Communications", China102013
  9. Lin E. , Shiaofen Fang, Jie Wang ,"Mining Online Book Reviews for Sentimental Clustering", Advanced Information Networking and Applications Workshops (WAINA), 2013 27th International Conference on 2013
  10. Anwer, N. Rashid, A. Hassan, "Feature based opinion mining of online free format customer reviews using frequency distribution and Bayesian statistics", Networked Computing and Advanced Information Management (NCM), 2010 Sixth International Conference on2010
  11. Algur S. P. , Patil A. P. , Hiremath P. S. , Shivashankar, "Conceptual level similarity measure based review spam detection", Signal and Image Processing (ICSIP), 2010 International Conference on 2010
  12. Ghose, A. Ipeirotis, P. G. ,"Estimating the Helpfulness and Economic Impact of Product Reviews: Mining Text and Reviewer Characteristics", Knowledge and Data Engineering, IEEE Transactions on 23, 2011
  13. Wenying Zheng, QiangYe, "Sentiment Classification of Chinese Traveler Reviews by Support Vector Machine Algorithm", Intelligent Information Technology Application, 2009. IITA 2009. Third International Symposium on32009
  14. Hai Wang, Shou hong Wang, "A Purchasing Sequences Data Mining Method for Customer Segmentation" Service Operations and Logistics, and Informatics, 2006. SOLI '06. IEEE International Conference on 2006
  15. Rahayu, D. A. , Krishnaswamy S. , Alahakoon O. , Labbe C. , "RnR: Extracting Rationale from Online Reviews and Ratings", Data Mining Workshops (ICDMW), 2010 IEEE International Conference on 2010
  16. S. Loster, M. Lofi, C. Balke, "Will I Like It? Providing Product Overviews Based on Opinion Excerpts Homoceanu", W-T Commerce and Enterprise Computing (CEC), 2011 IEEE 13th Conference on 2011.
  17. Li Shi, Luo Siqing ,"Improving the performance of features extraction from Chinese customer reviews" Communication Systems, Networks and Applications (ICCSNA), 2010 Second International Conference on 2010.
  18. Lingyan Ji, Hanxiao Shi, Mengli Li, Meng xia Cai, Peiqi Feng ,"Opinion mining of product reviews based on semantic role labeling", Computer Science and Education (ICCSE), 2010 5th International Conference on 2010.
  19. Bo Pang, Lillian Lee, "Opinion Mining and Sentiment Analysis", ACM Journal, Foundations and Trends in Information Retrieval, Volume 2 Issue 1-2, January, 2008.
  20. Minqing Hu, Bing Liu," Mining and summarizing customer reviews", Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, 2004
  21. Nihalahmad Shikalgar, Deepak Badgujar, "Online Review Mining for forecasting sales". In International Journal for research in Engineering & Technologies (IJRET), Volume 2 issue 12, December 2013.
  22. Nihalahmad Shikalgar, Arati Dixit, "Clustering & Regression based Sentiment Prediction for Online data. In Third post-graduation conference for Computer Engineering, held at MCOERC, Nashik, March 2014.
  23. Nihalahmad Shikalgar, Arati Dixit, "Prediction of Appraisal groups for movie review analysis". International Journal of Science & research (IJSR), Volume 3 issue 6, June 2014.
  24. http://www. rottentomatoes. com, accessed on dated 15 January, 2014.
Index Terms

Computer Science
Information Sciences

Keywords

Sentiment Analysis Review Classification Opinion Mining Review Mining Prediction.