Search CORE

20 research outputs found

Bayesian ranking and selection methods using hierarchical mixture models in microarray studies.

Author: Matsui Shigeyuki
Noma Hisashi
Omori Takashi
Sato Tosiya
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/04/2010
Field of study

The main purpose of microarray studies is screening to identify differentially expressed genes as candidates for further investigation. Because of limited resources in this stage, prioritizing or ranking genes is a relevant statistical task in microarray studies. In this article, we develop 3 empirical Bayes methods for gene ranking on the basis of differential expression, using hierarchical mixture models. These methods are based on (i) minimizing mean squared errors of estimation for parameters, (ii) minimizing mean squared errors of estimation for ranks of parameters, and (iii) maximizing sensitivity in selecting prespecified numbers of differential genes, with the largest effect. Our methods incorporate the mixture structures of differential and nondifferential components in empirical Bayes models to allow information borrowing across differential genes, with separation from nuisance, nondifferential genes. The accuracy of our ranking methods is compared with that of conventional methods through simulation studies. An application to a clinical study for breast cancer is provided

Kyoto University Research Information Repository

Incorporating predicted functions of nonsynonymous variants into gene-based analysis of exome sequencing data: a comparative study

Author: AL Price
B Li
BE Madsen
ET Cirulli
GJ McLachlan
IA Adzhubei
JM Schwarz
LA Almasy
P Kumar
P Wei
Peng Wei
SE Flanagan
W Sun
Xiaoming Liu
Y Bromberg
Yun-Xin Fu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Next-generation sequencing has opened up new avenues for the genetic study of complex traits. However, because of the small number of observations for any given rare allele and high sequencing error, it is a challenge to identify functional rare variants associated with the phenotype of interest. Recent research shows that grouping variants by gene and incorporating computationally predicted functions of variants may provide higher statistical power. On the other hand, many algorithms are available for predicting the damaging effects of nonsynonymous variants. Here, we use the simulated mini-exome data of Genetic Analysis Workshop 17 to study and compare the effects of incorporating the functional predictions of single-nucleotide polymorphisms using two popular algorithms, SIFT and PolyPhen-2, into a gene-based association test. We also propose a simple mixture model that can effectively combine test results based on different functional prediction algorithms

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

SpeCond: a method to detect condition-specific gene expression

Author: Bourgon Richard
Cavalli Florence MG
Huber Wolfgang
Luscombe Nicholas M
Vaquerizas Juan M
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Transcriptomic studies routinely measure expression levels across numerous conditions. These datasets allow identification of genes that are specifically expressed in a small number of conditions. However, there are currently no statistically robust methods for identifying such genes. Here we present SpeCond, a method to detect condition-specific genes that outperforms alternative approaches. We apply the method to a dataset of 32 human tissues to determine 2,673 specifically expressed genes. An implementation of SpeCond is freely available as a Bioconductor package at http://www.bioconductor.org/packages/release/bioc/html/SpeCond.html

Crossref

Springer - Publisher Connector

PubMed Central

UCL Discovery

Using a 3D virtual muscle model to link gene expression changes during myogenesis to protein spatial location in muscle

Author: A Reverter
A Reverter
AI Su
Antonio Reverter
Ashley J Waardenberg
B Efron
Brian P Dalrymple
Christine A Wells
CS Mermelstein
D Pette
DJ Duggan
EJ Morris
GA Dabiri
GD Bader
GJ McLachlan
GJ McLachlan
GY Koh
H Sorimachi
IH Chen
J Li
J Li
J Wang
JC Wu
JJ Lin
JM Ervasti
K Tagawa
KA Clark
KE Davies
KJ McCullagh
KK Tomczak
M Berendse
M Brancaccio
M Brancaccio
M Kaariainen
ML Golson
P Gunning
RG Whalen
RH Gomer
S Oota
S Peri
T Barrett
W Bains
W Scott
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Background: Myogenesis is an ordered process whereby mononucleated muscle precursor cells (myoblasts) fuse into multinucleated myotubes that eventually differentiate into myofibres, involving substantial changes in gene expression and the organisation of structural components of the cells. To gain further insight into the orchestration of these structural changes we have overlaid the spatial organisation of the protein components of a muscle cell with their gene expression changes during differentiation using a new 3D visualisation tool: the Virtual Muscle 3D (VMus3D)

ResearchOnline@JCU

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

ResearchOnline at James Cook University

PubMed Central

University of Melbourne Institutional Repository

University of Queensland eSpace

A statistical framework for integrating two microarray data sets in differential expression analysis

Author: D Lockhart
D Singh
EM Conlon
F Hong
GJ McLachlan
GJ McLachlan
GJ McLachlan
I Borozan
JD Storey
Jin-Xiong She
JK Choi
KHS Wilson
L Ein-Dor
L Xu
L Xu
M Miron
M Schena
M Zhang
P Cahan
PT Spellman
S Dudoit
Sarah E Eckenrode
SE Eckenrode
TR Golub
VK Mootha
X Cui
Y Benjamini
Y Lai
Yinglei Lai
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Different microarray data sets can be collected for studying the same or similar diseases. We expect to achieve a more efficient analysis of differential expression if an efficient statistical method can be developed for integrating different microarray data sets. Although many statistical methods have been proposed for data integration, the genome-wide concordance of different data sets has not been well considered in the analysis. Results Before considering data integration, it is necessary to evaluate the genome-wide concordance so that misleading results can be avoided. Based on the test results, different subsequent actions are suggested. The evaluation of genome-wide concordance and the data integration can be achieved based on the normal distribution based mixture models. Conclusion The results from our simulation study suggest that misleading results can be generated if the genome-wide concordance issue is not appropriately considered. Our method provides a rigorous parametric solution. The results also show that our method is robust to certain model misspecification and is practically useful for the integrative analysis of differential expression.</p

Crossref

Directory of Open Access Journals

PubMed Central

George Washington University: Health Sciences Research Commons (HSRC)

Evaluation of fecal mRNA reproducibility via a marginal transformed mixture modeling approach

Author: Chapkin Robert S
Davidson Laurie A
George Nysia I
Lupton Joanne R
Turner Nancy D
Wang Naisyin
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Developing and evaluating new technology that enables researchers to recover gene-expression levels of colonic cells from fecal samples could be key to a non-invasive screening tool for early detection of colon cancer. The current study, to the best of our knowledge, is the first to investigate and report the reproducibility of fecal microarray data. Using the intraclass correlation coefficient (ICC) as a measure of reproducibility and the preliminary analysis of fecal and mucosal data, we assessed the reliability of mixture density estimation and the reproducibility of fecal microarray data. Using Monte Carlo-based methods, we explored whether ICC values should be modeled as a beta-mixture or transformed first and fitted with a normal-mixture. We used outcomes from bootstrapped goodness-of-fit tests to determine which approach is less sensitive toward potential violation of distributional assumptions. Results The graphical examination of both the distributions of ICC and probit-transformed ICC (PT-ICC) clearly shows that there are two components in the distributions. For ICC measurements, which are between 0 and 1, the practice in literature has been to assume that the data points are from a beta-mixture distribution. Nevertheless, in our study we show that the use of a normal-mixture modeling approach on PT-ICC could provide superior performance. Conclusions When modeling ICC values of gene expression levels, using mixture of normals in the probit-transformed (PT) scale is less sensitive toward model mis-specification than using mixture of betas. We show that a biased conclusion could be made if we follow the traditional approach and model the two sets of ICC values using the mixture of betas directly. The problematic estimation arises from the sensitivity of beta-mixtures toward model mis-specification, particularly when there are observations in the neighborhood of the the boundary points, 0 or 1. Since beta-mixture modeling is commonly used in approximating the distribution of measurements between 0 and 1, our findings have important implications beyond the findings of the current study. By using the normal-mixture approach on PT-ICC, we observed the quality of reproducible genes in fecal array data to be comparable to those in mucosal arrays.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Texas A&M Repository

Transcription profiling provides insights into gene pathways involved in horn and scurs development in cattle

Author: Barris Wes
Dalrymple Brian
Lehnert Sigrid A
Mariasegaram Maxy
Prayaga Kishore
Reverter Antonio
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Two types of horns are evident in cattle - fixed horns attached to the skull and a variation called scurs, which refers to small loosely attached horns. Cattle lacking horns are referred to as polled. Although both the <it>Poll </it>and <it>Scurs </it>loci have been mapped to BTA1 and 19 respectively, the underlying genetic basis of these phenotypes is unknown, and so far, no candidate genes regulating these developmental processes have been described. This study is the first reported attempt at transcript profiling to identify genes and pathways contributing to horn and scurs development in Brahman cattle, relative to polled counterparts. Results Expression patterns in polled, horned and scurs tissues were obtained using the Agilent 44 k bovine array. The most notable feature when comparing transcriptional profiles of developing horn tissues against polled was the down regulation of genes coding for elements of the cadherin junction as well as those involved in epidermal development. We hypothesize this as a key event involved in keratinocyte migration and subsequent horn development. In the polled-scurs comparison, the most prevalent differentially expressed transcripts code for genes involved in extracellular matrix remodelling, which were up regulated in scurs tissues relative to polled. Conclusion For this first time we describe networks of genes involved in horn and scurs development. Interestingly, we did not observe differential expression in any of the genes present on the fine mapped region of BTA1 known to contain the <it>Poll </it>locus.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central