Large-Scale Multi-Label Learning with Incomplete Label Assignments

Fan, Wei; Kong, Xiangnan; Li, Li-Jia; Wu, Hang; Wu, Zhaoming; Yu, Philip S.; Zhang, Ruofei

research

Large-Scale Multi-Label Learning with Incomplete Label Assignments

Authors: Wei Fan
Xiangnan Kong
Li-Jia Li
Hang Wu
Zhaoming Wu
Philip S. Yu
Ruofei Zhang
Publication date: 6 July 2014
Publisher
Doi

Abstract

Multi-label learning deals with the classification problems where each instance can be assigned with multiple labels simultaneously. Conventional multi-label learning approaches mainly focus on exploiting label correlations. It is usually assumed, explicitly or implicitly, that the label sets for training instances are fully labeled without any missing labels. However, in many real-world multi-label datasets, the label assignments for training instances can be incomplete. Some ground-truth labels can be missed by the labeler from the label set. This problem is especially typical when the number instances is very large, and the labeling cost is very high, which makes it almost impossible to get a fully labeled training set. In this paper, we study the problem of large-scale multi-label learning with incomplete label assignments. We propose an approach, called MPU, based upon positive and unlabeled stochastic gradient descent and stacked models. Unlike prior works, our method can effectively and efficiently consider missing labels and label correlations simultaneously, and is very scalable, that has linear time complexities over the size of the data. Extensive experiments on two real-world multi-label datasets show that our MPU model consistently outperform other commonly-used baselines

Similar works

Full text

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.642.4...

Last time updated on 29/10/2017

Crossref

info:doi/10.1137%2F1.978161197...

Last time updated on 05/06/2019

CiteSeerX

oai:CiteSeerX.psu:10.1.1.765.9...

Last time updated on 30/10/2017