These authors contributed equally to this work Abstract—We propose a novel semi-supervised learning algorithm, called IMCC, designed for co-training classifiers on single-view datasets. Our method runs the co-training algorithm for a predefined number of times, each time using a different random split of features. Thus, a set of diverse cotraining classifiers is created. Each of these classifiers then labels each of the examples for which we want to determine the class label. In this way, each example for classification is assigned multiple labels. We then treat this as a problem of learning from inconsistent and unreliable annotators in a multi-annotator problem setting and estimate the single hidden true label for each example. In experimental results obtained on 25 benchmark datasets of various properties IMCC outperformed five considered alternative methods for cotraining on single-view datasets, and resulted in a statistical tie with a Naive Bayes classifier trained using a much larger set of labeled examples. Keywords- semi-supervised learning; ensemble methods; cotraining; multiple annotation I
To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.