Asymptotic Analysis of Generative Semi-Supervised Learning

Balasubramanian, Krishnakumar; Dillon, Joshua V; Lebanon, Guy

research

Asymptotic Analysis of Generative Semi-Supervised Learning

Authors: Krishnakumar Balasubramanian
Joshua V Dillon
Guy Lebanon
Publication date: 1 January 2010
Publisher

Abstract

Semisupervised learning has emerged as a popular framework for improving modeling accuracy while controlling labeling cost. Based on an extension of stochastic composite likelihood we quantify the asymptotic accuracy of generative semi-supervised learning. In doing so, we complement distribution-free analysis by providing an alternative framework to measure the value associated with different labeling policies and resolve the fundamental question of how much data to label and in what manner. We demonstrate our approach with both simulation studies and real world experiments using naive Bayes for text classification and MRFs and CRFs for structured prediction in NLP.Comment: 12 pages, 9 figure

Similar works

Full text

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.762.8...

Last time updated on 30/10/2017

CiteSeerX

oai:CiteSeerX.psu:10.1.1.497.9...

Last time updated on 28/10/2017