Call for Paper - October 2019 Edition
IJCA solicits original research papers for the October 2019 Edition. Last date of manuscript submission is September 20, 2019. Read More

Opinion Mining of Twitter Data using Hive

Print
PDF
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Year of Publication: 2016
Authors:
Pratyancha Kirar, Deepak Sain, S. K. Shrivastava
10.5120/ijca2016912351

Pratyancha Kirar, Deepak Sain and S K Shrivastava. Opinion Mining of Twitter Data using Hive. International Journal of Computer Applications 156(1):44-49, December 2016. BibTeX

@article{10.5120/ijca2016912351,
	author = {Pratyancha Kirar and Deepak Sain and S. K. Shrivastava},
	title = {Opinion Mining of Twitter Data using Hive},
	journal = {International Journal of Computer Applications},
	issue_date = {December 2016},
	volume = {156},
	number = {1},
	month = {Dec},
	year = {2016},
	issn = {0975-8887},
	pages = {44-49},
	numpages = {6},
	url = {http://www.ijcaonline.org/archives/volume156/number1/26675-2016912351},
	doi = {10.5120/ijca2016912351},
	publisher = {Foundation of Computer Science (FCS), NY, USA},
	address = {New York, USA}
}

Abstract

In todays extremely developed world, each minute, individuals round the globe specific themselves via numerous platforms on the net. And in every minute, an enormous quantity of unstructured information is generated. This information is within the style of text that is gathered from forums and social media websites. Such information is termed as massive information. User opinions square measure associated with a good vary of topics like politics, latest gadgets and merchandise. Social Networking sites provides tremendous impetus for large information in mining people’s opinion. Public API’s catered by sites like Twitter provides North American nation with helpful information for studying writer’s perspective in terms of of a specific topic, product etc. To distinguish people’s opinion, tweets square measure labeled into positive, negative or neutral indicators. This paper provides an efficient mechanism to perform opinion mining by coming up with a finish to finish pipeline with the assistance of Apache Flume ,Apache HDFS, and Apache Hive. Here we proposed to develop a opinion Analysis mechanism to analyze the various polarity of opinions of Twitter users through their tweets in order to extract what they think.

Here we have used dictionary based approach for analysis for which we have implemented hive queries through which we can analysis these complex twitter data to check polarity of the tweets based on the polarity dictionary through which we can say that which tweets have negative opinion or positive opinion.

References

  1. Aditya Bhardwaj, Vineet Kumar Singh,Vanraj, Yogendra Narayan, “Analyzing BigData with Hadoop Cluster in HDInsight Azure Cloud”,IEEE 2015, 978-1-4673-6540-6/15.
  2. Aditya Bhardwaj, Vanraj, Ankit Kumar, Yogendra Narayan , Pawan Kumar, “Big Data Emerging Technologies: A CaseStudy with Analyzing Twitter Data using Apache Hive”, in IEEE 2015, 978-1-4673-8253-3/15.
  3. Sunil B. Mane , Sunil B. Mane, Yashwant Sawant, Saif Kazi, Vaibhav Shinde , “Real Time Sentiment Analysis of Twitter Data Using Hadoop”, (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 5 (3) , 2014, 3098 – 3100 , ISSN:0975-9646.
  4. Mahalakshmi R, Suseela S , “Big-SoSA:Social Sentiment Analysis and Data Visualization on Big Data”, International Journal of Advanced Research in Computer and Communication Engineering, Vol. 4, Issue 4, April 2015 , pp 304-306, ISSN : 2278-1021.
  5. Praveen Kumar, Dr Vijay Singh Rathore,” Efficient Capabilities of Processing of Big Data using Hadoop Map Reduce”, International Journal of Advanced Research in Computer and Communication Engineering Vol. 3, Issue 6, June 2014, pp 7123-7126.
  6. Manoj Kumar Danthala, “Tweet Analysis: Twitter Data processing Using Apache Hadoop”, International Journal Of Core Engineering & Management (IJCEM) Volume 1, Issue 11, February 2015, pp 94-102.
  7. Manoj Kumar Danthala, “Bigdata Analysis: Streaming Twitter Data with Apache Hadoop and Visualizing using BigInsights”, International Journal of Engineering Research & Technology, Volume. 4 - Issue. 05 , May – 2015.
  8. Judith Sherin Tilsha S , Shobha M S, “A Survey on Twitter Data Analysis Techniques to Extract Public Opinion”, International Journal of Advanced Research in Computer Science and Software Engineering, Volume 5, Issue 11, November 2015, pp 536-540.
  9. Mr. Sagar Nadagoud, Mr. Kotresh Naik.D, “Market Sentiment Analysis for Popularity of Flipkart ”, International Journal of Advanced Research in Computer Engineering & Technology (IJARCET), Volume4Issue5,May2015,pp 2117-2123.
  10. Michael G. Noll, Applied Research, Big Data, Distributed Systems, Open Source, "Running Hadoop on Ubuntu Linux (Single-Node Cluster)", [online], available at http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/
  11. Aditya B. Patel, Manashvi Birla, Ushma Nair, "Addressing Big Data Problem Using Hadoop and Map Reduce",6-8 Dec. 2012.

Keywords

Opinion mining, hadoop, apache flume, hive, Dictionary based approach, bigdata.