Call for Paper - January 2024 Edition
IJCA solicits original research papers for the January 2024 Edition. Last date of manuscript submission is December 20, 2023. Read More

Location Wise Opinion Mining of Real-Time Twitter Data using Hadoop

International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Year of Publication: 2021
Farha Naaz, Rajesh Boghey, Sandeep Rai

Farha Naaz, Rajesh Boghey and Sandeep Rai. Location Wise Opinion Mining of Real-Time Twitter Data using Hadoop. International Journal of Computer Applications 183(28):24-30, September 2021. BibTeX

	author = {Farha Naaz and Rajesh Boghey and Sandeep Rai},
	title = {Location Wise Opinion Mining of Real-Time Twitter Data using Hadoop},
	journal = {International Journal of Computer Applications},
	issue_date = {September 2021},
	volume = {183},
	number = {28},
	month = {Sep},
	year = {2021},
	issn = {0975-8887},
	pages = {24-30},
	numpages = {7},
	url = {},
	doi = {10.5120/ijca2021921665},
	publisher = {Foundation of Computer Science (FCS), NY, USA},
	address = {New York, USA}


Opinion Mining is the process of detecting the contextual polarity of text. In other words, it reflects a piece of writing that is positive, negative, or neutral. The opinions of others seem to be crucial in decision making. Compressing out the usable content from these opinion sources becomes a perplexing task. Today social networking data is the best and accurate source for gathering public opinions. A large volume of data is generated everyday online which is not easy to handle and processed by traditional methods. In this research, a methodology is discussed which allows interpretation of real-time Twitter data in opinion mining. We take Twitter data because on Twitter huge opinions are shared. The analysis was done on tweets about iphone8. For this, we can fetch real-time Twitter data by using flume and storingit in HDFS. Hadoop is a best open-source solution for storing and processing a large amount of data. Hadoop has two separate components HDFS for storage and MapReduce for processing. We can integrate Apache Pig with Flume for analyzing the sentiment on the basis of location because opinions are changing from location to location. Apache Pig is used for analysis as it is best suited for both structured and unstructured data.


  1. Ankur Goel, Jyoti Gautam, Sitesh Kumar, “Real Time Sentiment Analysis of Tweets Using Naive Bayes”, 2016 2nd International Conference on Next Generation Computing Technologies (NGCT-2016) Dehradun, India 14-16 October 2016, IEEE.
  2. Mrunal Sogodekar, Shikha Pandey, Isha Tupkari, Amit Manekar, “Big Data Analytics: Hadoop And Tools”, 2016 IEEE Bombay Section Symposium (IBSS), IEEE 2016.
  3. Aditya Bhardwaj, Vanraj, Ankit Kumar, Yogendra Narayan, Pawan Kumar, “ Big Data Emerging Technologies: A case-study with Analyzing Twitter Data using Apache Hive”, IEEE 2015.
  4. Sagiroglu, S., & Sinanc, D., “Big data: A review”, IEEE International Conference on Collaboration Technologies and Systems (CTS), 2013.
  5. Can Uzunkayaa, Tolga Ensaria, Yusuf Kavurucu, “Hadoop Ecosystem and Its Analysis on Tweets”, World Conference on Technology, Innovation and Entrepreneurship, Procedia - Social and Behavioral Sciences 195 ( 2015 ) 1890 – 1897, Elsevier 2015.
  6. Manoj Kumar Danthala, “Tweet Analysis: Twitter Data processing Using Apache Hadoop”, International Journal Of Core Engineering & Management (IJCEM) Volume 1, Issue 11, February 2015.
  7. Sitaram Asur, Bernardo A. Huberman, “Predicting the future with social media”, International conference on Web intelligence and intelligent agent technology (WI-IAT), IEEE/WIC/ACM vol. 1, 2010.
  8. Judith Sherin Tilsha S, Shobha M.S, “A Survey on Twitter Data Analysis Techniques to Extract Public Opinion”, IJARCSE, Vol. 5, Issue 11, Nov 2015.
  9. M. Trupthi, Suresh Pabboju, G. Narasimha, “Sentiment Analysis on Twitter Using Streaming API”, 2017 IEEE 7th International Advance Computing Conference (IACC), 2017.
  10. M. Mazhar Rathore, Anand Paul, Awais Ahmad, Muhammad Imran, Mohsen Guizani, “Big Data Analytics of Geosocial Media for Planning and Real-Time Decisions”, SAC Symposium Big Data Networking Track, IEEE ICC 2017.
  11. Nikitha Johnsirani Venkatesan, Earl Kim, Dong Ryeol Shin, “PoN: Open Source solution for Real-time Data Analysis”, IEEE 2016.
  12. Divya Sehgal and Dr. Ambuj Kumar Agarwal, “Sentiment Analysis of Big Data Applications using Twitter Data with the Help of Hadoop Framework”, Proceedings of the SMART -2016, IEEE Conference 5th International Conference on System Modeling & Advancement in Research Trends.
  13. Nikitha Johnsirani Venkatesan, Dong Ryeol Shin, Earl Kim, “PoN: Open Source solution for Real-time Data Analysis”, IEEE, 2016.
  14. Syed Akib Anwar Hridoy, M. Tahmid Ekram, Mohammad Samiul Islam, Faysal Ahmed and Rashedur M. Rahman, “Localized twitter opinion mining using sentiment analysis”, Springer open journal, 2011.


Sentiment analysis, Hadoop, Apache Flume, Pig, Location-based, Big data.