Learning mixtures of structured distributions over discrete domains

Chan, Siu-on; Diakonikolas, Ilias; Servedio, Rocco A.; Sun, Xiaorui

research

Learning mixtures of structured distributions over discrete domains

Authors: Siu-on Chan
Ilias Diakonikolas
Rocco A. Servedio
Xiaorui Sun
Publication date: 2 October 2012
Publisher
Doi

Abstract

Let

\mathfrak{C}

be a class of probability distributions over the discrete domain

[n] = \{1,...,n\}.

We show that if

\mathfrak{C}

satisfies a rather general condition -- essentially, that each distribution in

\mathfrak{C}

can be well-approximated by a variable-width histogram with few bins -- then there is a highly efficient (both in terms of running time and sample complexity) algorithm that can learn any mixture of

k

unknown distributions from

\mathfrak{C}.

We analyze several natural types of distributions over

[n]

, including log-concave, monotone hazard rate and unimodal distributions, and show that they have the required structural property of being well-approximated by a histogram with few bins. Applying our general algorithm, we obtain near-optimally efficient algorithms for all these mixture learning problems.Comment: preliminary full version of soda'13 pape

Similar works

Full text

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.640.5...

Last time updated on 29/10/2017

Crossref

info:doi/10.1137%2F1.978161197...

Last time updated on 22/07/2021