Scalable Algorithms for Missing Value Imputation

Abdel-rahiem A. Hashem; Marghny H. Mohamed; Mohammed M. Abdelsamea

Scalable Algorithms for Missing Value Imputation

Authors: Abdel-rahiem A. Hashem
Marghny H. Mohamed
Mohammed M. Abdelsamea
Publication date: 30 August 2014
Publisher
Doi

Abstract

Statistical Imputation Techniques have been proposed mainly with the aim of predicting the missing values in the incomplete sets as an essential step in any data analysis framework. K-means-based Imputation, as a representative statistical imputation method, has been producing satisfied results in terms of effectiveness and efficiency in handling popular and freely available data set (e.g., Bupa, Breast Cancer, Pima, etc.). The main idea of K-means based methods is to impute the missing value relying on the prototypes of the representative class and the similarity of the data. However, such kinds of methods share the same limitations of the K-means as data mining technique. In this paper and motivated by such drawbacks, we introduce simple and efficient imputation methods based on K-means to deal with the missing data from various classes of data sets. Our proposed methods give higher accuracy than the one given by the standard K-means

Similar works

Full text

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.428.9...

Last time updated on 22/10/2014