International Conference on Innovations in Computing Techniques (ICICT 2015) |
Foundation of Computer Science USA |
ICICT2015 - Number 2 |
July 2015 |
Authors: Raja Mohana S.p, Umamaheshwari K., Karthiga R. |
Raja Mohana S.p, Umamaheshwari K., Karthiga R. . Sentiment Classification based on Latent Dirichlet Allocation. International Conference on Innovations in Computing Techniques (ICICT 2015). ICICT2015, 2 (July 2015), 14-16.
Opinion miningrefers to the use of natural language processing, text analysis and computational linguistics to identify and extract the subjective information. Opinion Mining has become an indispensible part of online reviews which is in the present scenario. In the field of information retrieval, a various kinds of probabilistic topic modeling techniques have been used to analyze contents present in a document. A topic model is a generative technique for document. All topic models share the idea that documents are having mixture of topics, and the topic is a probability distribution over words. Recently topic modeling techniques have been used to identify the meaningful review aspects, but existing topic models like Latent Dirichlet Markov Allocation (LDMA), hierarchical aspect sentiment model (HASM) do not identify aspect specific opinion words and also not suitable for shared features. In the proposed system, movie review dataset is collected from the IMDB database and is preprocessed. TF-IDF is calculated for the preprocessed data and result is given to LDA model which is then used to discover both the aspects and aspect specific opinion words. After that CHI value has been determined, SVM classifier is used to classify the topics preferable to each and every document.