22,974 research outputs found
Generalized Species Sampling Priors with Latent Beta reinforcements
Many popular Bayesian nonparametric priors can be characterized in terms of
exchangeable species sampling sequences. However, in some applications,
exchangeability may not be appropriate. We introduce a {novel and
probabilistically coherent family of non-exchangeable species sampling
sequences characterized by a tractable predictive probability function with
weights driven by a sequence of independent Beta random variables. We compare
their theoretical clustering properties with those of the Dirichlet Process and
the two parameters Poisson-Dirichlet process. The proposed construction
provides a complete characterization of the joint process, differently from
existing work. We then propose the use of such process as prior distribution in
a hierarchical Bayes modeling framework, and we describe a Markov Chain Monte
Carlo sampler for posterior inference. We evaluate the performance of the prior
and the robustness of the resulting inference in a simulation study, providing
a comparison with popular Dirichlet Processes mixtures and Hidden Markov
Models. Finally, we develop an application to the detection of chromosomal
aberrations in breast cancer by leveraging array CGH data.Comment: For correspondence purposes, Edoardo M. Airoldi's email is
[email protected]; Federico Bassetti's email is
[email protected]; Michele Guindani's email is
[email protected] ; Fabrizo Leisen's email is
[email protected]. To appear in the Journal of the American
Statistical Associatio
Modeling dependent gene expression
In this paper we propose a Bayesian approach for inference about dependence
of high throughput gene expression. Our goals are to use prior knowledge about
pathways to anchor inference about dependence among genes; to account for this
dependence while making inferences about differences in mean expression across
phenotypes; and to explore differences in the dependence itself across
phenotypes. Useful features of the proposed approach are a model-based
parsimonious representation of expression as an ordinal outcome, a novel and
flexible representation of prior information on the nature of dependencies, and
the use of a coherent probability model over both the structure and strength of
the dependencies of interest. We evaluate our approach through simulations and
in the analysis of data on expression of genes in the Complement and
Coagulation Cascade pathway in ovarian cancer.Comment: Published in at http://dx.doi.org/10.1214/11-AOAS525 the Annals of
Applied Statistics (http://www.imstat.org/aoas/) by the Institute of
Mathematical Statistics (http://www.imstat.org
Multiple Testing for Neuroimaging via Hidden Markov Random Field
Traditional voxel-level multiple testing procedures in neuroimaging, mostly
-value based, often ignore the spatial correlations among neighboring voxels
and thus suffer from substantial loss of power. We extend the
local-significance-index based procedure originally developed for the hidden
Markov chain models, which aims to minimize the false nondiscovery rate subject
to a constraint on the false discovery rate, to three-dimensional neuroimaging
data using a hidden Markov random field model. A generalized
expectation-maximization algorithm for maximizing the penalized likelihood is
proposed for estimating the model parameters. Extensive simulations show that
the proposed approach is more powerful than conventional false discovery rate
procedures. We apply the method to the comparison between mild cognitive
impairment, a disease status with increased risk of developing Alzheimer's or
another dementia, and normal controls in the FDG-PET imaging study of the
Alzheimer's Disease Neuroimaging Initiative.Comment: A MATLAB package implementing the proposed FDR procedure is available
with this paper at the Biometrics website on Wiley Online Librar
Bayesian computational methods
In this chapter, we will first present the most standard computational
challenges met in Bayesian Statistics, focussing primarily on mixture
estimation and on model choice issues, and then relate these problems with
computational solutions. Of course, this chapter is only a terse introduction
to the problems and solutions related to Bayesian computations. For more
complete references, see Robert and Casella (2004, 2009), or Marin and Robert
(2007), among others. We also restrain from providing an introduction to
Bayesian Statistics per se and for comprehensive coverage, address the reader
to Robert (2007), (again) among others.Comment: This is a revised version of a chapter written for the Handbook of
Computational Statistics, edited by J. Gentle, W. Hardle and Y. Mori in 2003,
in preparation for the second editio
- …