3,872 research outputs found

    Poisson Latent Feature Calculus for Generalized Indian Buffet Processes

    Full text link
    The purpose of this work is to describe a unified, and indeed simple, mechanism for non-parametric Bayesian analysis, construction and generative sampling of a large class of latent feature models which one can describe as generalized notions of Indian Buffet Processes(IBP). This is done via the Poisson Process Calculus as it now relates to latent feature models. The IBP was ingeniously devised by Griffiths and Ghahramani in (2005) and its generative scheme is cast in terms of customers entering sequentially an Indian Buffet restaurant and selecting previously sampled dishes as well as new dishes. In this metaphor dishes corresponds to latent features, attributes, preferences shared by individuals. The IBP, and its generalizations, represent an exciting class of models well suited to handle high dimensional statistical problems now common in this information age. The IBP is based on the usage of conditionally independent Bernoulli random variables, coupled with completely random measures acting as Bayesian priors, that are used to create sparse binary matrices. This Bayesian non-parametric view was a key insight due to Thibaux and Jordan (2007). One way to think of generalizations is to to use more general random variables. Of note in the current literature are models employing Poisson and Negative-Binomial random variables. However, unlike their closely related counterparts, generalized Chinese restaurant processes, the ability to analyze IBP models in a systematic and general manner is not yet available. The limitations are both in terms of knowledge about the effects of different priors and in terms of models based on a wider choice of random variables. This work will not only provide a thorough description of the properties of existing models but also provide a simple template to devise and analyze new models.Comment: This version provides more details for the multivariate extensions in section 5. We highlight the case of a simple multinomial distribution and showcase a multivariate Levy process prior we call a stable-Beta Dirichlet process. Section 4.1.1 expande

    Poisson calculus for spatial neutral to the right processes

    Full text link
    Neutral to the right (NTR) processes were introduced by Doksum in 1974 as Bayesian priors on the class of distributions on the real line. Since that time there have been numerous applications to models that arise in survival analysis subject to possible right censoring. However, unlike the Dirichlet process, the larger class of NTR processes has not been used in a wider range of more complex statistical applications. Here, to circumvent some of these limitations, we describe a natural extension of NTR processes to arbitrary Polish spaces, which we call spatial neutral to the right processes. Our construction also leads to a new rich class of random probability measures, which we call NTR species sampling models. We show that this class contains the important two parameter extension of the Dirichlet process. We provide a posterior analysis, which yields tractable NTR analogues of the Blackwell--MacQueen distribution. Our analysis turns out to be closely related to the study of regenerative composition structures. A new computational scheme, which is an ordered variant of the general Chinese restaurant processes, is developed. This can be used to approximate complex posterior quantities. We also discuss some relationships to results that appear outside of Bayesian nonparametrics.Comment: Published at http://dx.doi.org/10.1214/009053605000000732 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org
    • …
    corecore