Contrastive Learning Using Spectral Methods

Adams, Ryan Prescott; Hsu, Daniel; Parkes, David C.; Zou, James Yang

Contrastive Learning Using Spectral Methods

Authors: Ryan Prescott Adams
Daniel Hsu
David C. Parkes
James Yang Zou
Publication date: 9 June 2017
Publisher: Neural Information Processing Systems Foundation

Abstract

In many natural settings, the analysis goal is not to characterize a single data set in isolation, but rather to understand the difference between one set of observations and another. For example, given a background corpus of news articles together with writings of a particular author, one may want a topic model that explains word patterns and themes specific to the author. Another example comes from genomics, in which biological signals may be collected from different regions of a genome, and one wants a model that captures the differential statistics observed in these regions. This paper formalizes this notion of contrastive learning for mixture models, and develops spectral algorithms for inferring mixture components specific to a foreground data set when contrasted with a background data set. The method builds on recent moment-based estimators and tensor decompositions for latent variable models, and has the intuitive feature of using background data statistics to appropriately modify moments estimated from foreground data. A key advantage of the method is that the background data need only be coarsely modeled, which is important when the background is too complex, noisy, or not of interest. The method is demonstrated on applications in contrastive topic modeling and genomic sequence analysis.Engineering and Applied Science

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Sustaining member

Harvard University - DASH

oai:dash.harvard.edu:1/3300932...

Last time updated on 17/04/2018