CFP last date
20 January 2025
Reseach Article

Rough Set and Entropy based Feature Selection for Online Forums Hotspot Detection

by K. Nirmala Devi, V. Murali Bhaskaran
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 117 - Number 10
Year of Publication: 2015
Authors: K. Nirmala Devi, V. Murali Bhaskaran
10.5120/20593-3087

K. Nirmala Devi, V. Murali Bhaskaran . Rough Set and Entropy based Feature Selection for Online Forums Hotspot Detection. International Journal of Computer Applications. 117, 10 ( May 2015), 37-41. DOI=10.5120/20593-3087

@article{ 10.5120/20593-3087,
author = { K. Nirmala Devi, V. Murali Bhaskaran },
title = { Rough Set and Entropy based Feature Selection for Online Forums Hotspot Detection },
journal = { International Journal of Computer Applications },
issue_date = { May 2015 },
volume = { 117 },
number = { 10 },
month = { May },
year = { 2015 },
issn = { 0975-8887 },
pages = { 37-41 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume117/number10/20593-3087/ },
doi = { 10.5120/20593-3087 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:59:02.501163+05:30
%A K. Nirmala Devi
%A V. Murali Bhaskaran
%T Rough Set and Entropy based Feature Selection for Online Forums Hotspot Detection
%J International Journal of Computer Applications
%@ 0975-8887
%V 117
%N 10
%P 37-41
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The exponential growth of web arouses much attention on public opinion. The rapid progress of online forums, micro blogs and new reports are having large volume of public opinion information. These are proving to be extremely valuable resources in helping to anticipate, detect and forecast societal events. But most of the online data is unstructured or semi structured and that is difficult to decipher automatically. Therefore, it is very much essential to analyze in time and understands the trends of their opinion correctly. The hotspot detection is one of the promising research areas in web mining and it helps to make appropriate decision in timely manner. Feature selection is an essential component in text categorization to identify the relevant features and reduces the dimensionality of data to gain the improved higher accuracy. The proposed system integrates rough set approach with entropy for detecting the online forums hotspot. The experimental results demonstrate that the proposed hybrid feature selection method outperforms with Naïve Bayes and Support Vector Machine based hotspot detection models.

References
  1. Bo-Tsuen Chen, Mu-Yen Chen, Hsiu-Sen Chiang and Chia-Chen Chen. 2011. Forecasting Stock Price Based on Fuzzy Time-Series with Entropy-Based Discretization Partitioning, Springer-Verlag Berlin, pp. 382–39.
  2. Richard Jensen and Qiang Shen. 2009. New Approaches to Fuzzy-Rough Feature Selection, IEEE Transactions on Fuzzy Systems, vol. 17, no. 4, pp. 824-838.
  3. Hameed. A,Qaheri, Hassanien A. E, and Abraham. A. 2008. A Generic Scheme for Generating Prediction Rules Using Rough Set, Neural Network World, vol. 18, no. 3, pp. 181-198.
  4. Wang Ruizhong. 2012. Analyses the Financial Data of Stocks Based Rough Set Theory, In Proceedings of Eighth International Conference on Computational Intelligence and Security, pp. 387-390.
  5. Francis E. H. Tay, Lixiang Shen. 2002. Economic and financial prediction using rough sets model, European Journal of Operational Research, vol. 141, pp. 641–659.
  6. Chung-Ho Su,Tai-Liang Chen, Ching-Hsue Cheng and Ya-Ching Chen. 2010. Forecasting the Stock Market with Linguistic Rules Generated from the Minimize Entropy Principle and the Cumulative Probability Distribution Approaches, Entropy, vol. 12,pp. 2397-2417.
  7. Salim Lahmiri. 2014. Entropy-Based Technical Analysis Indicators Selection for International Stock Markets Fluctuations Prediction Using Support Vector Machines, Fluctuation and Noise Letters, vol. 13, no. 2, pp. 1450013-1 to 1450013-16.
  8. Liu, B. 2012. Sentiment Analysis and Opinion Mining, Morgan & Claypool publishers, San Rafael, USA.
  9. Nirmala Devi. K and Murali Bhaskaran. V. Text Sentiment Computation for Online Forums Hotspot Detection, International Journal of Information and Communication Technology – Inderscience, 2015, in press .
  10. Peng. W. 2012. Predicting Collective Sentiment Dynamica from Time Series Social Media, in Proceedings of the Confrenece WISDOM'12.
  11. Liu, H. 2010. Internet public opinion hotspot detection and analysis based on Kmeans and SVM algorithm, in Proceedings of the Conference of Information Science and Management Engineering – ISME – 2010, pp. 257-261.
  12. Bun, K. K and Ishizuka, M. 2002. Topic extraction from news using TF * PDF algorithm, in Proceedings of the 3rd International conference on Web Information Systems Engineering, pp. 73-82.
  13. Chen,K. , Luesukprasert,L. and Chou, S. 2007. Hot topic extraction based on timeline analysis and multidimensional senetence modeling, IEEE Transactions on Knowledge and Data Engineering, pp. 1016-1025.
  14. Zhang, D. and Li, F. 2011. QuestionHolic: hot topic discovery and trend analysis in community question answering systems, Expert Systems with Applications, vol. 38, no. 6, pp. 6949-6855.
  15. Khoza, M. and Marwala, T. 2011. A rough set theory based predictive model for stock prices, in Proceedings of CINTI 12th IEEE International Symposium on Computational Intelligence and Informatics.
  16. Nirmala Devi,K. and Murali Bhaskaran, V. 2015. Forecasting Indian Stock Market Using Particle Swarm Optimization and Support Vector Machine, International Journal of Applied Engineering Research, vol. 10, no. 1, pp. 1891-1900.
  17. Pang,B. Lee,L. and Vaithyanathan, S. 2002. Thumbs Up? Sentiment classification using machine learning techniques, in Proceedings of the Conference on mperical methods in Natural Language Processing, pp. 79-86.
  18. Pawlak. Z. 1991. Rough Sets, Theoretical Aspects of Reasoning about Data, Dordrecht: Kluwer Academic.
  19. Z. Pawlak. Z, Grzymala-Busse. J, Slowinski. R and Ziarko. W. 1995. Rough sets, Communications of the ACM, vol. 38, no. II , pp. 89-95.
  20. Nirmala Devi K and Murali Bhaskaran V. Sentiment Analysis for Online Forums Hotspot Detection, ICTACT Journal on Soft Computing, vol. 2, no. 2, pp. 280-284, 2012.
  21. Nirmala Devi K and Murali Bhaskaran V. Online Forums Hotspot Prediction Based on Sentiment Analysis, Journal of Computer Science, vol. 8, no. 8, pp. 1219-1224, 2012.
  22. Digital Point Forums http:// forums. digitalpoint. com
Index Terms

Computer Science
Information Sciences

Keywords

Hotspot Opinion Sentiment Analysis Rough Set Entropy Naïve Bayes Support Vector Machine