International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 174 - Number 29 |
Year of Publication: 2021 |
Authors: Felipe F. De Lima Melo, Roberta M. Marques Gouveia, Andrêza L. De Alencar, Maria da Conceição M. Batista, Ademir B. Santos Neto, Tiago A.E. Ferreira |
10.5120/ijca2021921219 |
Felipe F. De Lima Melo, Roberta M. Marques Gouveia, Andrêza L. De Alencar, Maria da Conceição M. Batista, Ademir B. Santos Neto, Tiago A.E. Ferreira . Performance Analysis of NoSQL Databases with Large Volumes of Open Educational Data. International Journal of Computer Applications. 174, 29 ( Apr 2021), 9-17. DOI=10.5120/ijca2021921219
Non-Relational Databases, also known as NoSQL (Not Only Structured Query Language), emerged in the face of new requirements of Web 2.0 computer applications. Relational databases, although consolidated as a data storage and manipulation model for decades, began to face performance limitations when dealing with large volumes of data. NoSQL databases have flexible data structure, and when associated with distributed computing provide a good scalability, being indicated in the Big Data scenario. In this context, this work evaluates the performance of three NoSQL databases, in order to verify their performance in large volumes of educational data. The experiments were performed with school census data, available in the repository of the Ansio Teixeira National Institute for Educational Studies and Research (INEP) in Brazil. For this case of study, the following databases were adopted: DynamoDB (whose data model is key-value oriented), MongoDB (whose data model is document-oriented), and Cassandra (whose data model is column-oriented). Therefore, among the investigated databases, MongoDB was more efficient, presenting lower processing times in the operations of inserts/loads, queries, updates, and removals of basic educational data.