Opinion Mining of Twitter Data using Hadoop and Apache Pig

Anjali Barskar; Ajay Phulre

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 21 July 2025

Submit your paper

Know more

The week's pick

FORENSIC ANALYSIS FRAMEWORKS FOR ENCRYPTED CLOUD STORAGE INVESTIGATIONS

Joy Awoleye Sarah Mavire Allan Munyira Kelvin Magora

Random Articles

Impact of using Snowflake Schema and Bitmap Index on Data Warehouse Querying

Jan

2018

Customer Complain Detection in E-commerce Platforms using NLP

Dec

2022

Comparative Analysis of Search Algorithms

Jun

2018

Enhanced HMM Speech Emotion Recognition using SVM and Neural Classifier

February

2014

Reseach Article

Opinion Mining of Twitter Data using Hadoop and Apache Pig

by Anjali Barskar, Ajay Phulre

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 158 - Number 9

Year of Publication: 2017

Authors: Anjali Barskar, Ajay Phulre

10.5120/ijca2017912854

Anjali Barskar, Ajay Phulre . Opinion Mining of Twitter Data using Hadoop and Apache Pig. International Journal of Computer Applications. 158, 9 ( Jan 2017), 1-6. DOI=10.5120/ijca2017912854

@article{ 10.5120/ijca2017912854,

author = { Anjali Barskar, Ajay Phulre },

title = { Opinion Mining of Twitter Data using Hadoop and Apache Pig },

journal = { International Journal of Computer Applications },

issue_date = { Jan 2017 },

volume = { 158 },

number = { 9 },

month = { Jan },

year = { 2017 },

issn = { 0975-8887 },

pages = { 1-6 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume158/number9/26933-2017912854/ },

doi = { 10.5120/ijca2017912854 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-07T00:04:21.426902+05:30

%A Anjali Barskar

%A Ajay Phulre

%T Opinion Mining of Twitter Data using Hadoop and Apache Pig

%J International Journal of Computer Applications

%@ 0975-8887

%V 158

%N 9

%P 1-6

%D 2017

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Twitter, one of the largest and famous social media site receives millions of tweets every day on variety of important topic. This large amount of raw data can be used for industrial , Social, Economic, Government policies or business purpose by organizing according to our need and processing. Hadoop is one of the best tool options for twitter data analysis and hadoop works for distributed Big data , Streaming data , Time Stamped data , text data etc. This paper discuss how to use FLUME for extracting twitter data and store it into HDFS for opinion mining because twitter contains variety of opinions on various topics so we have to analyse these opinions using hadoop and its ecosystems to check every tweets polarity either tweets contains positive ,negative or neutral opinions on particular topic. This paper provides an efficient mechanism to perform opinion mining by coming up with a finish to finish pipeline with the assistance of Apache Flume ,Apache HDFS, and Apache Pig. Here we have used dictionary based approach for analysis for which we have implemented pig statements through which we can analysis these complex twitter data to check polarity of the tweets based on the polarity dictionary through which we can say that which tweets have negative opinion or positive opinion.

References

Marco Furini, Manuela Montangero, “TSentiment: On Gamifying Twitter Sentiment Analysis”, IEEE ISCC 2016 Workshop: DENVECT, IEEE 2016, ISSN: 978-1-5090-0679-3/16.
Rahul Kumar Chawda, Dr. Ghanshyam Thakur, “Big Data and Advanced Analytics Tools”, 2016 Symposium on Colossal Data Analysis and Networking (CDAN), IEEE 2016, ISSN: 978-1-5090-0669-4/16.
Mahalakshmi R, Suseela S , “Big-SoSA:Social Sentiment Analysis and Data Visualization on Big Data”, International Journal of Advanced Research in Computer and Communication Engineering, Vol. 4, Issue 4, April 2015 , pp 304-306, ISSN : 2278-1021.
Manoj Kumar Danthala, “Tweet Analysis: Twitter Data processing Using Apache Hadoop”, International Journal Of Core Engineering & Management (IJCEM) Volume 1, Issue 11, February 2015, pp 94-102.
Manoj Kumar Danthala, “Bigdata Analysis: Streaming Twitter Data with Apache Hadoop and Visualizing using BigInsights”, International Journal of Engineering Research & Technology, Volume. 4 - Issue. 05 , May – 2015.
Judith Sherin Tilsha S , Shobha M S, “A Survey on Twitter Data Analysis Techniques to Extract Public Opinion”, International Journal of Advanced Research in Computer Science and Software Engineering, Volume 5, Issue 11, November 2015, pp 536-540.
Mr. Sagar Nadagoud, Mr. Kotresh Naik.D, “Market Sentiment Analysis for Popularity of Flipkart ”, International Journal of Advanced Research in Computer Engineering & Technology (IJARCET), Volume4Issue5,May2015,pp 2117-2123.
Ramesh R, Divya G, Divya D, Merin K Kurian , “Big Data Sentiment Analysis using Hadoop “, (IJIRST )International Journal for Innovative Research in Science & Technology,Volume 1 , Issue 11 , April 2015 ISSN : 2349-6010
Sunil B. Mane , Sunil B. Mane, Yashwant Sawant, Saif Kazi, Vaibhav Shinde , “Real Time Sentiment Analysis of Twitter Data Using Hadoop”, (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 5 (3) , 2014, 3098 – 3100 , ISSN:0975-9646.
Praveen Kumar, Dr Vijay Singh Rathore,” Efficient Capabilities of Processing of Big Data using Hadoop Map Reduce”, International Journal of Advanced Research in Computer and Communication Engineering Vol. 3, Issue 6, June 2014, pp 7123-7126.
G.Vinodhini , RM.Chandrasekaran, “Sentiment Analysis and Opinion Mining: A Survey” , International Journal of Advanced Research in Computer Science and Software Engineering, Volume 2, Issue 6, June 2012 ISSN: 2277 128X.
Aditya B. Patel, Manashvi Birla, Ushma Nair, "Addressing Big Data Problem Using Hadoop and Map Reduce",6-8 Dec. 2012.
Michael G. Noll, Applied Research, Big Data, Distributed Systems, Open Source, "Running Hadoop on Ubuntu Linux (Single-Node Cluster)", [online], available at http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/

Index Terms

Computer Science

Information Sciences

Keywords

Hadoop twitter Flume opinion mining social analysis apache pig.