Search CORE

2 research outputs found

An analysis of semantic data quality defiencies in a national data warehouse: a data mining approach

Author: Barth Kirstin
Publication venue
Publication date: 01/07/2019
Field of study

This research determines whether data quality mining can be used to describe, monitor and evaluate the scope and impact of semantic data quality problems in the learner enrolment data on the National Learners’ Records Database. Previous data quality mining work has focused on anomaly detection and has assumed that the data quality aspect being measured exists as a data value in the data set being mined. The method for this research is quantitative in that the data mining techniques and model that are best suited for semantic data quality deficiencies are identified and then applied to the data. The research determines that unsupervised data mining techniques that allow for weighted analysis of the data would be most suitable for the data mining of semantic data deficiencies. Further, the academic Knowledge Discovery in Databases model needs to be amended when applied to data mining semantic data quality deficiencies.School of ComputingM. Tech. (Information Technology

Unisa Institutional Repository

Weighted Clustering of Sparse Educational Data

Author: Kärkkäinen Tommi
Saarela Mirka
Publication venue: ESANN
Publication date: 21/08/2015
Field of study

Clustering as an unsupervised technique is predominantly used in unweighted settings. In this paper, we present an efficient version of a robust clustering algorithm for sparse educational data that takes the weights, aligning a sample with the corresponding population, into account. The algorithm is utilized to divide the Finnish student population of PISA 2012 (the latest data from the Programme for International Student Assessment) into groups, according to their attitudes and perceptions towards mathematics, for which one third of the data is missing. Furthermore, necessary modifications of three cluster indices to reveal an appropriate number of groups are proposed and demonstrated.peerReviewe

Jyväskylä University Digital Archive