Call for Paper - January 2023 Edition
IJCA solicits original research papers for the January 2023 Edition. Last date of manuscript submission is December 20, 2022. Read More

Multidimensional and Non-Relational Data Models: A Comparison with a Big Volume of Data

Print
PDF
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Year of Publication: 2020
Authors:
Maria Camila S. De Lira, Ademir B. Santos Neto, Maria C. Moraes Batista, Roberta Macedo M. Gouveia, Tiago Alessandro E. Ferreira
10.5120/ijca2020920927

Maria Camila De S Lira, Ademir Santos B Neto, Maria Moraes C Batista, Roberta Macedo M Gouveia and Tiago Alessandro E Ferreira. Multidimensional and Non-Relational Data Models: A Comparison with a Big Volume of Data. International Journal of Computer Applications 175(36):1-7, December 2020. BibTeX

@article{10.5120/ijca2020920927,
	author = {Maria Camila S. De Lira and Ademir B. Santos Neto and Maria C. Moraes Batista and Roberta Macedo M. Gouveia and Tiago Alessandro E. Ferreira},
	title = {Multidimensional and Non-Relational Data Models: A Comparison with a Big Volume of Data},
	journal = {International Journal of Computer Applications},
	issue_date = {December 2020},
	volume = {175},
	number = {36},
	month = {Dec},
	year = {2020},
	issn = {0975-8887},
	pages = {1-7},
	numpages = {7},
	url = {http://www.ijcaonline.org/archives/volume175/number36/31682-2020920927},
	doi = {10.5120/ijca2020920927},
	publisher = {Foundation of Computer Science (FCS), NY, USA},
	address = {New York, USA}
}

Abstract

In the last decade, a huge volume of data has been created every day. There are many sources where this information comes from, among them, it is possible to cite: financial transnational records, internet banking, call centers, ATMs, sensors, among others. As the amount of data is increasingly really quickly and the importance of the information inside the big data is crucial to the world, new tools and technologies are appearing to support the use and the manipulation of big amounts of data. This paper presents a comparative analysis of two data models, a multidimensional data model (Data Warehouse) and a non-relational data model, also known as NoSQL, in order to show their efficiency about insert and query the data in a context of big volumes of data.

References

  1. ALLIANCE. Whats the deal big deal with data? Technical report, BSA, 20 F Street, NW, Washington, 2015.
  2. Andrew McAfee, Erik Brynjolfsson, et al. Big data: the management revolution. Harvard business review, 90(10):60–68, 2012.
  3. John Gantz and David Reinsel. Extracting value from chaos. IDC iview, 1142:1–12, 2011.
  4. Lei Wang, Jianfeng Zhan, Chunjie Luo, Yuqing Zhu, Qiang Yang, Yongqiang He, Wanling Gao, Zhen Jia, Yingjie Shi, Shujie Zhang, et al. Bigdatabench: A big data benchmark suite from internet services. In High Performance Computer Architecture (HPCA), 2014 IEEE 20th International Symposium on, pages 488–499. IEEE, 2014.
  5. Jason Richmond and Jinhua Guo. Pricing the internet for congestion control and social welfare. In Computer Communication and Networks (ICCCN), 2014 23rd International Conference on, pages 1–6. IEEE, 2014.
  6. Ibrahim Abaker Targio Hashem, Ibrar Yaqoob, Nor Badrul Anuar, Salimah Mokhtar, Abdullah Gani, and Samee Ullah Khan. The rise of big data on cloud computing: Review and open research issues. Information Systems, 47:98–115, 2015.
  7. Seref Sagiroglu and Duygu Sinanc. Big data: A review. In Collaboration Technologies and Systems (CTS), 2013 International Conference on, pages 42–47. IEEE, 2013.
  8. Josiane Rodrigues, Marco Cristo, Javier Zambrano Ferreira, David Fernandes, and Andr´e Carvalho. Multi-entity polarity analysis and detection of subjectivity in financial documents. Journal of Information and Data Management, 6(2):130, 2016.
  9. Sam Madden. From databases to big data. IEEE Internet Computing, 16(3):4–6, 2012.
  10. Jing Han, E Haihong, Guan Le, and Jian Du. Survey on nosql database. In Pervasive computing and applications (ICPCA), 6th international conference on, pages 363–366. IEEE, 2011.
  11. Hsinchun Chen, Roger HL Chiang, and Veda C Storey. Business intelligence and analytics: From big data to big impact. MIS quarterly, 36(4):1165–1188, 2012.
  12. Paul Lane and Oracle9i Data Warehousing Guide. Release 1 (9.0. 1). Oracle Corporation, 2001.
  13. Marcos Rodrigues Vieira, J Figueiredo, Gustavo Liberatti, and Alvaro Fellipe Mendes Viebrantz. Nosql databases: Concepts, tools, languages and case study in a big data context. Brasilian Database Congress, 2012.
  14. Ling Liu and M Tamer O¨ zsu. Encyclopedia of database systems, volume 6. Springer Berlin, Heidelberg, Germany, 2009.
  15. Manuel Serrano, Coral Calero, and Mario Piattini. Experimental validation of multidimensional data models metrics. In System Sciences, 2003. Proceedings of the 36th Annual Hawaii International Conference on, page 327. IEEE, 2003.
  16. Torben Bach Pedersen and Christian S Jensen. Multidimensional data modeling for complex data. In Data Engineering. Proceedings., 15th International Conference on, pages 336– 345. IEEE, 1999.
  17. Ralph Kimball and Margy Ross. The data warehouse toolkit: the complete guide to dimensional modeling. John Wiley & Sons, 2011.
  18. William Rowen, Il-Yeol Song, Carl Medsker, and Edward Ewen. An analysis of many-to-many relationships between fact and dimension tables in dimensional modeling. In Proceedings of the International Workshop on Design and Management of Data Warehouses (DMDW 2001), Interlaken, Switzerland, 2001.
  19. William H Inmon. What is a data warehouse? Prism Tech Topic, 1. (1995), 1995.
  20. William H Inmon. Building the data warehouse. John wiley & sons, 2005.
  21. John Palo and Newman Ray. Creation and adoption of online analytical process (olap) into the management decision support system aided by computers. Scholedge International Journal of Business Policy & Governance ISSN 2394-3351, 2(5):8–13, 2015.
  22. Mar´ia Carina Rold´an. Pentaho Data Integration Beginner’s Guide. Packt Publishing Ltd, 2013.
  23. DataOnFocus. Oltp vs olap: De nitions and comparison, 2015.
  24. Ricardo Brito. Nosql databases x relational databases: comparative analysis. Technical report, Fortaleza University, Av. Washington Soares, Fortaleza-CE, 2010.
  25. Rick Cattell. Scalable sql and nosql data stores. Acm Sigmod Record, 39(4):12–27, 2011.
  26. IBM. Bringing big data to the enterprise, 2016.
  27. Eric A Brewer. Towards robust distributed systems. In PODC, volume 7, 2000.
  28. C. Strauch, U.-L. S. Sites, and W. Kriha. Nosql databases, 2016.
  29. ABM Moniruzzaman and Syed Akhter Hossain. Nosql database: New era of databases for big data analyticsclassification, characteristics and comparison. International Journal of Database Theory and Application, 6(4):1–14, 2013.
  30. Berk Atikoglu, Yuehai Xu, Eitan Frachtenberg, Song Jiang, and Mike Paleczny. Workload analysis of a large-scale keyvalue store. In ACM SIGMETRICS Performance Evaluation Review, volume 40, pages 53–64. ACM, 2012.
  31. Daniel J Abadi, Peter A Boncz, and Stavros Harizopoulos. Column-oriented database systems. Proceedings of the VLDB Endowment, 2(2):1664–1665, 2009.
  32. Ameya Nayak, Anil Poriya, and Dikshay Poojary. Type of nosql databases and its comparison with relational databases. International Journal of Applied Information Systems, 5(4):16–19, 2013.
  33. Kristina Chodorow. MongoDB: the definitive guide. O’Reilly Media, Inc., 2013.
  34. Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin, Swaminathan Sivasubramanian, Peter Vosshall, and Werner Vogels. Dynamo: amazon’s highly available key-value store. ACM SIGOPS operating systems review, 41(6):205–220, 2007.
  35. Ricardo Neves and Jorge Bernardino. Performance and scalability of voldemort nosql. In Information Systems and Technologies (CISTI), 2015 10th Iberian Conference on, pages 1– 6. IEEE, 2015.
  36. Avinash Lakshman and Prashant Malik. Cassandra: structured storage system on a p2p network. In Proceedings of the 28th ACM symposium on Principles of distributed computing, pages 5–5. ACM, 2009.
  37. Brad Fitzpatrick. Distributed caching with memcached. Linux journal, (124):5, 2004.
  38. Lars George. HBase: The Definitive Guide: Random Access to Your Planet-Size Data. O’Reilly Media, Inc., 2011.
  39. Konstantin Shvachko, Hairong Kuang, Sanjay Radia, and Robert Chansler. The hadoop distributed file system. In Mass storage systems and technologies (MSST), 2010 IEEE 26th symposium on, pages 1–10. IEEE, 2010.
  40. Ashish Thusoo, Joydeep Sen Sarma, Namit Jain, Zheng Shao, Prasad Chakka, Suresh Anthony, Hao Liu, Pete Wyckoff, and Raghotham Murthy. Hive: a warehousing solution over a map-reduce framework. Proceedings of the VLDB Endowment, 2(2):1626–1629, 2009.
  41. John M Zelle. Python programming: an introduction to computer science. Franklin, Beedle & Associates, Inc., 2004.
  42. Ralph Kimball and Margy Ross. The data warehouse toolkit: The definitive guide to dimensional modeling. John Wiley & Sons, 2013.

Keywords

Comparison, Data Warehouse, HBase, Hadoop, NoSQL