Call for Paper - September 2020 Edition
IJCA solicits original research papers for the September 2020 Edition. Last date of manuscript submission is August 20, 2020. Read More

Analysis of Airport Data using Hadoop-Hive: A Case Study

IJCA Proceedings on National Conference on “Recent Trends in Information Technology”
© 2016 by IJCA Journal
NCRTIT 2016 - Number 2
Year of Publication: 2016
S. K. Pushpa
Manjunath T. N.

S K Pushpa, Manjunath T N. and Srividhya. Article: Analysis of Airport Data using Hadoop-Hive: A Case Study. IJCA Proceedings on National Conference on Recent Trends in Information Technology NCRTIT 2016(2):23-28, August 2016. Full text available. BibTeX

	author = {S. K. Pushpa and Manjunath T. N. and Srividhya},
	title = {Article: Analysis of Airport Data using Hadoop-Hive: A Case Study},
	journal = {IJCA Proceedings on National Conference on Recent Trends in Information Technology},
	year = {2016},
	volume = {NCRTIT 2016},
	number = {2},
	pages = {23-28},
	month = {August},
	note = {Full text available}


In the contemporary world, Data analysis is a challenge in the era of varied inters- disciplines though there is a specialization in the respective disciplines. In other words, effective data analytics helps in analyzing the data of any business system. But it is the big data which helps and axialrates the process of analysis of data paving way for a success of any business intelligence system. With the expansion of the industry, the data of the industry also expands. Then, it is increasingly difficult to handle huge amount of data that gets generated no matter what's the business is like, range of fields from social media to finance, flight data, environment and health. Big Data can be used to assess risk in the insurance industry and to track reactions to products in real time. Big Data is also used to monitor things as diverse as wave movements, flight data, traffic data, financial transactions, health and crime. The challenge of Big Data is how to use it to create something that is value to the user. How can it be gathered, stored, processed and analyzed it to turn the raw data information to support decision making. In this paper Big Data is depicted in a form of case study for Airline data based on hive tools.


  • Challenges and opportunities with Big Datahttp://cra. org/ccc/wpcontent/uploads/sites/2/2015/05/bigdatawhitepaper. pdf
  • Oracle: Big Data for Enterprise, June 201http://www. oracle. com/us/products/database/big-data-for-enterprise-519135. pdf
  • Marta C. González, César A. Hidalgo, and Albert-László Barabási. 5 June 2008 Understanding individual human mobility patterns. Nature 453, 779-782.
  • James Manyika, Michael Chui, Brad Brown, Jacques Bughin, Richard Dobbs, Charles Roxburgh, and Angela Hung Byers. May 2011 Big data: The next frontier for innovation, competition, and productivity. McKinsey Global Institute.
  • Yuki Noguchi. Nov. 30, 2011 The Search for Analysts to Make Sense of Big Data. . National Public Radio. http://www. npr. org/2011/11/30/142893065/thesearch-for-analysts-to-make-sense-of-big-data
  • Data set is taken from edureka http://www. edureka. co/my-course/big-data-and-hadoop
  • Manjunath T N et. al, Automated Data Validation for Data Migration Security, International Journal of Computer Applications (0975 – 8887), Volume 30– No. 6, September 2011. (Imp act Factor=0. 88)
  • Manjunath T N et. al, The Descriptive Study of Knowledge Discovery from Web Usage Mining, IJCSI International Journal of Computer Science Issues, Vol. 8, Issue 5, No 1, September 2011 ISSN
  • HiveQL Language Manual
  • Apache Tez
  • Working with Students to Improve Indexing in Apache Hive
  • Baru C. K. , Fecteau G. , Goyal A. , Hsiao H. , Jhingran A. , Padmanabhan S. , Copeland, To appear in OSDI 2006 13 G. P. , and Wilson W. G. DB2 parallel edition. IBM Systems Journal 34, 2 (1995), 292. 322.
  • ORACLE. COM. www. oracle. com/technology/products/-database/clustering/index. html.
  • Ratnasamy S. , Francis P. , Handley M. , Karp R. , and Shenker S. A scalable content-addressable network. In Proc. of SIGCOMM (Aug. 2001), pp. 161. 172.
  • Rowstron A. , and Druschel P. Pastry: Scalable, distributed object location and routing for largescale peer-to-peer systems. In Proc. of Middleware 2001 (Nov. 2001), pp. 329. 350.
  • Stoica I. , Morris R. , Karger D. , Kaashoek, M. F. , and Balakrishnan H. Chord: A scalable peer-to-peer lookup service for Internet applications. In Proc. of SIGCOMM (Aug. 2001), pp. 149. 160.
  • Stonebraker M. The case for shared nothing. Database Engineering Bulletin 9, 1 (Mar. 1986), 4. 9.
  • Zhao B. Y. , Kubiatowicz J. , and Joseph A. D. Tapestry: An infrastructure for fault-tolerant wide-area location and routing. Tech. Rep. UCB/CSD-01-1141, CS Division, UC Berkeley, Apr. 2001.