Variance Ranking for Multi-Classed Imbalanced Datasets: A Case Study of One-Versus-All

Al-Bayatti, A. H.; Al-Bayatti, A. H.; Al-Nemrat, A.; Al-Nemrat, A.; Alalwan, N.; Alalwan, N.; Alfarraj, O.; Alfarraj, O.; Alzahrani, A. I.; Alzahrani, A. I.; Ebenuwa, S.; Ebenuwa, S.; Sharif, S.; Sharif, S.

Variance Ranking for Multi-Classed Imbalanced Datasets: A Case Study of One-Versus-All

Authors: A. H. Al-Bayatti
A. H. Al-Bayatti
A. Al-Nemrat
A. Al-Nemrat
N. Alalwan
N. Alalwan
O. Alfarraj
O. Alfarraj
A. I. Alzahrani
A. I. Alzahrani
S. Ebenuwa
S. Ebenuwa
S. Sharif
S. Sharif
Publication date: 1 January 2019
Publisher: 'MDPI AG'
Doi

Abstract

Imbalanced classes in multi-classed datasets is one of the most salient hindrances to the accuracy and dependable results of predictive modeling. In predictions, there are always majority and minority classes, and in most cases it is difficult to capture the members of item belonging to the minority classes. This anomaly is traceable to the designs of the predictive algorithms because most algorithms do not factor in the unequal numbers of classes into their designs and implementations. The accuracy of most modeling processes is subjective to the ever-present consequences of the imbalanced classes. This paper employs the variance ranking technique to deal with the real-world class imbalance problem. We augmented this technique using one-versus-all re-coding of the multi-classed datasets. The proof-of-concept experimentation shows that our technique performs better when compared with the previous work done on capturing small class members in multi-classed datasets