Adversarial Data Programming: Using GANs to Relax the Bottleneck of
  Curated Labeled Data

Balasubramanian, Vineeth N; Pal, Arghya

research

Adversarial Data Programming: Using GANs to Relax the Bottleneck of Curated Labeled Data

Authors: Vineeth N Balasubramanian
Arghya Pal
Publication date: 1 January 2018
Publisher
Doi

Abstract

Paucity of large curated hand-labeled training data for every domain-of-interest forms a major bottleneck in the deployment of machine learning models in computer vision and other fields. Recent work (Data Programming) has shown how distant supervision signals in the form of labeling functions can be used to obtain labels for given data in near-constant time. In this work, we present Adversarial Data Programming (ADP), which presents an adversarial methodology to generate data as well as a curated aggregated label has given a set of weak labeling functions. We validated our method on the MNIST, Fashion MNIST, CIFAR 10 and SVHN datasets, and it outperformed many state-of-the-art models. We conducted extensive experiments to study its usefulness, as well as showed how the proposed ADP framework can be used for transfer learning as well as multi-task learning, where data from two domains are generated simultaneously using the framework along with the label information. Our future work will involve understanding the theoretical implications of this new framework from a game-theoretic perspective, as well as explore the performance of the method on more complex datasets.Comment: CVPR 2018 main conference pape

Similar works

Full text

Available Versions

Crossref

Last time updated on 10/08/2021

Research Archive of Indian Institute of Technology Hyderabad

oai:raiith.iith.ac.in:4922

Last time updated on 18/04/2019