CFP last date
20 December 2024
Reseach Article

Clustering Technique for Feature Segregation in Opinion Analysis

by Tanvir Ahmad
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 76 - Number 17
Year of Publication: 2013
Authors: Tanvir Ahmad
10.5120/13343-0924

Tanvir Ahmad . Clustering Technique for Feature Segregation in Opinion Analysis. International Journal of Computer Applications. 76, 17 ( August 2013), 43-49. DOI=10.5120/13343-0924

@article{ 10.5120/13343-0924,
author = { Tanvir Ahmad },
title = { Clustering Technique for Feature Segregation in Opinion Analysis },
journal = { International Journal of Computer Applications },
issue_date = { August 2013 },
volume = { 76 },
number = { 17 },
month = { August },
year = { 2013 },
issn = { 0975-8887 },
pages = { 43-49 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume76/number17/13343-0924/ },
doi = { 10.5120/13343-0924 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:48:43.162006+05:30
%A Tanvir Ahmad
%T Clustering Technique for Feature Segregation in Opinion Analysis
%J International Journal of Computer Applications
%@ 0975-8887
%V 76
%N 17
%P 43-49
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The World Wide Web (WWW) is a reservoir of enormous amount of data which is primarily embedded within unstructured text documents. E-commerce websites, social networking sites, and discussion forums have become a common place for writing informal opinions about products and other related information. A substantial amount of research has been directed towards mining these texts and concludes on the overall meaning of the users and to assign a grade to the products under discussion. These grading systems often become helpful for users to get an informed opinion about the products he/she wants to buy. There have been different techniques adopted by the opinion website developers to provide end users an overall meaning of the contents, like numerical rating on some predefined scale, star rating, and calculation of the percentage of users who are satisfied or dissatisfied with a product. However, all these methods have failed to segregate the features on the basis of opinion expressed in them or to cluster them in different group which gives a general insight into the features grouped together. In this paper, a framework has been presented which first extracts the feature, modifier and opinion from the dataset and then using clustering mechanism divides them into discrete clusters on the basis of users' opinion, in which the intra-cluster similarity between the features are high whereas the inter-cluster similarity is very low.

References
  1. Liu, B. , Hsu, W. , Ma, Y. 1999. "Prunning and Summarizing the Discovered Associations". In Proceeding of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'99), pp. 125-134.
  2. Liu, B. 2007. "Web Data Mining-Exploring Hyperlinks, Contents, and Usage Data", Springer Series on Data-Centric Systems and Applications.
  3. Hu, M. , Liu, B. 2006. "Opinion Features Extraction using Class Sequential Rules". In Proceedings of the Spring Symposia on Computational Approaches to Analyzing Weblogs.
  4. Liu, B. , Hu, M. , Cheng, J. 2005. "Opinion Observer: Analyzing and Comparing Opinions on the Web". In Proceedings of the 14th International World Wide Web Conference (WWW 05), pp. – 342-351.
  5. Ding, X. , Liu, B. , Philip. S. Y. 2008. "A Holistic Lexicon-Based Approach to Opinion Mining", In proceedings of the first ACM International Conference on Web search and Data Mining (WSDM'08), California, USA, pp. 231-240.
  6. Bodendorf, F. , Kaiser, C. 2010. "Mining Customer Opinions on the Internet- A case study in the Automotive Industry". In Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, pp. 24-27.
  7. Chaoji, V. , Hoonlor, A. , Szymanski. B. K. 2008. "Recursive Data Mining for Role Identification", In Proceedings of IEEE/ACM Fifth International Conference on Soft Computing as Transdisciplinary Science and Technology, pp. 218-225.
  8. Balahur, A. , Montoyo, A. 2008. "A Feature Dependent Method for Opinion Mining and Classification", In Proceedings of the IEEE International Conference on Natural Language Processing and Knowledge Engineering, pp. 1-7.
  9. Lafferty, J. , McCallum, A. , Pereira, F. 2001. "Conditional Random Fields: Probabilistics Models for Segmentating and Labelling of Sequence Data". In Proceedings of the International Conference on Machine Learning (ICML '01), pp. 282-289.
  10. Freitag, D. , McCallum, A. 2000. "Information Extraction with HMM Structures Learned by Stochastic Optimization". In Proceedings of National Conference on Artificial Intelligence (AAAI'00).
  11. Pang, B. , Lee, L. 2004. "A Sentiment Education: Sentiment analysis using subjectivity summarization based on minimum cuts". In Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL), pp. 271-278.
  12. Yeh, E. 2006. "Final Project Picking the Fresh from the Rotten: Quote and Sentiment Extraction from Rotten Tomatoes Movie Reviews", CS224N/Ling237.
  13. Hall, M. , Frank, E. , Holmes, G. , Pfahringer, B. , Reutemann, P. , Witten, I. H. , 2009; The WEKA Data Mining Software: An Update; SIGKDD Explorations, Volume 11, Issue 1.
  14. Kalashnikov, D. V. , Chen, Z. , Mehrotra, S. , and Nuray-Turan, R. 2008. "Web People Search via Connection Analysis". IEEE Transactions on Knowledge and Data Engineering, 20(11):1550-1565.
  15. Borgatti, S. P. , Everett, M. G. , and Freeman, L. C. 2002. UCINET 6 for Windows: Software for Social Network Analysis. Harvard: Analytic Technologies.
  16. Abulaish, M,. Jahiruddin, Doja, M. N. , Ahmad, T. , "Feature and Opinion Mining from Customer Review Documents", in Proceedings of Pattern Recognition and Machine Intelligence, 2009, (PReMI 2009),pp. : 219-224.
Index Terms

Computer Science
Information Sciences

Keywords

Pattern Recognition Feature Extraction Clustering Technique