CFP last date
20 May 2024
Reseach Article

Performance Analysis of NoSQL Databases with Large Volumes of Open Educational Data

by Felipe F. De Lima Melo, Roberta M. Marques Gouveia, Andrêza L. De Alencar, Maria da Conceição M. Batista, Ademir B. Santos Neto, Tiago A.E. Ferreira
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 174 - Number 29
Year of Publication: 2021
Authors: Felipe F. De Lima Melo, Roberta M. Marques Gouveia, Andrêza L. De Alencar, Maria da Conceição M. Batista, Ademir B. Santos Neto, Tiago A.E. Ferreira
10.5120/ijca2021921219

Felipe F. De Lima Melo, Roberta M. Marques Gouveia, Andrêza L. De Alencar, Maria da Conceição M. Batista, Ademir B. Santos Neto, Tiago A.E. Ferreira . Performance Analysis of NoSQL Databases with Large Volumes of Open Educational Data. International Journal of Computer Applications. 174, 29 ( Apr 2021), 9-17. DOI=10.5120/ijca2021921219

@article{ 10.5120/ijca2021921219,
author = { Felipe F. De Lima Melo, Roberta M. Marques Gouveia, Andrêza L. De Alencar, Maria da Conceição M. Batista, Ademir B. Santos Neto, Tiago A.E. Ferreira },
title = { Performance Analysis of NoSQL Databases with Large Volumes of Open Educational Data },
journal = { International Journal of Computer Applications },
issue_date = { Apr 2021 },
volume = { 174 },
number = { 29 },
month = { Apr },
year = { 2021 },
issn = { 0975-8887 },
pages = { 9-17 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume174/number29/31859-2021921219/ },
doi = { 10.5120/ijca2021921219 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:23:23.954908+05:30
%A Felipe F. De Lima Melo
%A Roberta M. Marques Gouveia
%A Andrêza L. De Alencar
%A Maria da Conceição M. Batista
%A Ademir B. Santos Neto
%A Tiago A.E. Ferreira
%T Performance Analysis of NoSQL Databases with Large Volumes of Open Educational Data
%J International Journal of Computer Applications
%@ 0975-8887
%V 174
%N 29
%P 9-17
%D 2021
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Non-Relational Databases, also known as NoSQL (Not Only Structured Query Language), emerged in the face of new requirements of Web 2.0 computer applications. Relational databases, although consolidated as a data storage and manipulation model for decades, began to face performance limitations when dealing with large volumes of data. NoSQL databases have flexible data structure, and when associated with distributed computing provide a good scalability, being indicated in the Big Data scenario. In this context, this work evaluates the performance of three NoSQL databases, in order to verify their performance in large volumes of educational data. The experiments were performed with school census data, available in the repository of the Ansio Teixeira National Institute for Educational Studies and Research (INEP) in Brazil. For this case of study, the following databases were adopted: DynamoDB (whose data model is key-value oriented), MongoDB (whose data model is document-oriented), and Cassandra (whose data model is column-oriented). Therefore, among the investigated databases, MongoDB was more efficient, presenting lower processing times in the operations of inserts/loads, queries, updates, and removals of basic educational data.

References
  1. Harish Kumbhar, Edberg Kinny, Kevin Fernandes, and Shirshendu Maitra. Article: Benefits of nosql databases. IJCA Proceedings on Leveraging Information Technology for Inter- Sectoral Research, ICAIM 2017(1):11–13, February 2019. Full text available.
  2. Pramod J. Sadalage and Martin Fowler. NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence. Addison-Wesley Professional, 1st edition, 2012.
  3. Gourav Bathla, Rinkle Rani, and Himanshu Aggarwal. Comparative study of nosql databases for big data storage. International Journal of Engineering & Technology, 7(2.6):83–87, 2018.
  4. Fernando C. Sossai, Viviane Grimm, and Carla C. Loureiro. Highlights on educational technologies and policies in brazil: an analysis of the works published by anped and rbpae (2000- 2013) (translated from portuguese). RELATEC: Latino America Magazine of Educative Technology, 15(3):27–37, 2016.
  5. E. Brewer. Cap twelve years later: How the ”rules” have changed. Computer, 45(2):23–29, 2012.
  6. Jing Han, Haihong E, Guan Le, and Jian Du. Survey on nosql database. In 2011 6th International Conference on Pervasive Computing and Applications, pages 363–366, 2011.
  7. Naglaa Saeed Shehata and Amira Hassan Abed. Big data with column oriented nosql database to overcome the drawbacks of relational databases. International Journal of Advanced Networking and Applications - IJANA, 11(05):4423–4428, 2020.
  8. Andrew Pavlo and Matthew Aslett. What’s really new with newsql? SIGMOD Rec., 45(2):4555, September 2016.
  9. Renzo Angles and Claudio Gutierrez. Survey of graph database models. ACM Comput. Surv., 40(1), February 2008.
  10. Ali Davoudian, Liu Chen, and Mengchi Liu. A survey on nosql stores. ACM Comput. Surv., 51(2), April 2018.
  11. Maria Camila S. De Lira, Ademir B. Santos Neto, Maria C. Moraes Batista, Roberta Macedo M. Gouveia, and Tiago Alessandro E. Ferreira. Multidimensional and non-relational data models: A comparison with a big volume of data. International Journal of Computer Applications, 175(36):1–7, Dec 2020.
  12. Allexandre S. S. Soares and Pablo F. Matos. A comparative analysis between nosql database management systems in the context of internet of things. (translated from portuguese). In Brazilian Symposium on Databases - SBBD, pages 306–311, 2017.
  13. Juccelino Barros, Gustavo Callou, Glauco Gonalves, Victor Wanderley, and Henrique Casteletti. Performance analysis of relational and non-relational databases in genomic data (translated from portuguese). Theoretical and Applied Computer Magazine, 24(2):11–27, 2017.
  14. J¨orn Kuhlenkamp, Markus Klems, and Oliver R¨oss. Benchmarking scalability and elasticity of distributed database systems. Proc. VLDB Endow., 7(12):12191230, August 2014.
  15. Y. Li and S. Manoharan. A performance comparison of sql and nosql databases. In 2013 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM), pages 15–19, 2013.
  16. Veronika Abramova and Jorge Bernardino. Nosql databases: Mongodb vs cassandra. In Proceedings of the International C* Conference on Computer Science and Software Engineering, C3S2E ’13, page 1422, New York, NY, USA, 2013. Association for Computing Machinery.
  17. Anderson Chaves Carniel, Aried de Aguiar S´a, Marcela Xavier Ribeiro, Renato Bueno, Cristina Dutra de Aguiar Ciferri, and Ricardo Rodrigues Ciferri. Experimental analysis of relational databases and nosql in data warehouse queries processing (translated from portuguese). In Brazilian Symposium on Databases - SBBD, pages 113–120, 2012.
  18. Bernadette Farias Lscio, Hlio R. Oliveira de Oliveira, and Jonas C. de S. Pontes. Nosql in the development of collaborative web applications (translated from portuguese). VIII Brazilian Symposium on Collaborative Systems, 10(1):11, 2011.
  19. R. Hecht and S. Jablonski. Nosql evaluation: A use case oriented survey. In 2011 International Conference on Cloud and Service Computing, pages 336–341, 2011.
Index Terms

Computer Science
Information Sciences

Keywords

Nonrelational Databases Data Processing Data Models NoSQL Evaluation Performance