Knowledge extraction from courses and online learning activities

Abstract

Dissertation presented as the partial requirement for obtaining a Master's degree in Data Science and Advanced Analytics, specialization in Business AnalyticsTechnological advancement has led to the increasing use of all types of electronic devices, which causes large volumes of data to be constantly generated and stored in repositories. This growth in data through Information Technology (IT) systems makes it necessary to continue its exploration and analysis to support institutions in the decision-making process. Due to the importance of education in society, this field has been the target of several studies over the years. Taking that into account, and knowing that association rules and regression analysis are among the most popular data mining algorithms for finding the hidden patterns in data, the purpose of this paper is to find exciting trends across courses considering the students’ grades, as well as study if, and to what extent, the student’s learning performance is related to their interaction in moodle. The data used were collected through the netp@ and moodle systems, consisting of all student learning data and activities/logs history. This data belongs to students of all masters who attended the academic years between 2012-2013 and 2020- 2021. We chose Sample, Explore, Modify, Model, and Assess (SEMMA) methodology for the applicability of its steps to accomplish the study’s goals. Through the Partial Least Squares Regression (PLSR) algorithm, it was shown that Gestão do Conhecimento, Metodologias de Investigação and Métodos Descritivos de Data Mining are the most importants courses that affect the grades of Dissertation/Work Project/Intership Report in the Business Intelligence specialization. In addition, according to the predictive model, Metodologias de Investigação was the most important variable for predicting the performance of the Dissertation/Work Project/Internship Report of Information Systems and Technologies Management specialization. Finally, the association rules algorithms used were the Apriori, FP-Growth and Eclat. From their results, it was found that courses with continuous assessment methods achieve better academic performance compared to others. Furthermore, higher levels of online interaction are associated with better achievement

    Similar works