Search CORE

946 research outputs found

Accurate modeling of confounding variation in eQTL studies leads to a great increase in power to detect trans-regulatory effects

Author: Neil Lawrence
Nicolo Fusi
Oliver Stegle
Publication venue
Publication date: 02/06/2011
Field of study

Expression quantitative trait loci (eQTL) studies are an integral tool to investigate the genetic component of gene expression variation. A major challenge in the analysis of such studies are hidden confounding factors, such as unobserved covariates or unknown environmental influences. These factors can induce a pronounced artifactual correlation structure in the expression profiles, which may create spurious false associations or mask real genetic association signals. 

Here, we report PANAMA (Probabilistic ANAlysis of genoMic dAta), a novel probabilistic model to account for confounding factors within an
eQTL analysis. In contrast to previous methods, PANAMA learns hidden factors jointly with the effect of prominent genetic regulators. As a result, PANAMA can more accurately distinguish between true genetic association signals and confounding variation. 

We applied our model and compared it to existing methods on a variety of datasets and biological systems. PANAMA consistently performs better than alternative methods, and finds in particular substantially more trans regulators. Importantly, PANAMA not only identified a greater number of associations, but also yields hits that are biologically more plausible and can be better reproduced between independent studies

Nature Precedings

Statistical Tests for Detecting Differential RNA-Transcript Expression from Read Counts

Author: Gunnar Rätsch
Karsten Borgwardt
Oliver Stegle
Philipp Drewe
Philipp Drewe
Regina Bohnert
Publication venue
Publication date: 01/01/2010
Field of study

As a fruit of the current revolution in sequencing technology, transcriptomes can now be analyzed at an unprecedented level of detail. These advances have been exploited for detecting differential expressed genes across biological samples and for quantifying the abundances of various RNA transcripts within one gene. However, explicit strategies for detecting the hidden differential abundances of RNA transcripts in biological samples have not been defined. In this work, we present two novel statistical tests to address this issue: a 'gene structure sensitive' Poisson test for detecting differential expression when the transcript structure of the gene is known, and a kernel-based test called Maximum Mean Discrepancy when it is unknown. We analyzed the proposed approaches on simulated read data for two artificial samples as well as on factual reads generated by the Illumina Genome Analyzer for two _C. elegans_ samples. Our analysis shows that the Poisson test identifies genes with differential transcript expression considerably better that previously proposed RNA transcript quantification approaches for this task. The MMD test is able to detect a large fraction (75%) of such differential cases without the knowledge of the annotated transcripts. It is therefore well-suited to analyze RNA-Seq experiments when the genome annotations are incomplete or not available, where other approaches have to fail

Crossref

Nature Precedings

MPG.PuRe

A Whole-Class Support Model for Early Literacy: The Anna Plan

Author: Henk Bill
Hubbs Karen G
Mallette Marla H
Miles Pamela A
Stegle Kathy W
Publication venue: e-Publications@Marquette
Publication date: 01/12/2004
Field of study

The Anna Plan is a unique delivery model for enhancing schoolwide literacy instruction in the primary grades. Based on the principles of Reading Recovery and Four Blocks literacy instruction, it provides supplementary reading instruction through the distinctive use of teaching staff. Over six years, it has resulted in sweeping changes in the way literacy instruction occurs as well as noteworthy increases in children\u27s reading abilities. This article gives a brief history of the authors\u27 work within the Anna Plan, explains each of the model\u27s seven tenets, and describes the research base that drives it. The focal point of the article is the detailed description of the organization and components of the five-day framework used to augment classroom reading and writing instruction. Finally, the authors recount how the Anna Plan has been embraced by two elementary schools and offer some conclusions about what contributes to the success of whole-class support models for early literacy

epublications@Marquette

Recommended from our members

SpatialDE: identification of spatially variable genes.

Author: Stegle Oliver
Svensson Valentine
Teichmann Sarah A
Publication venue: Nature Methods
Publication date: 01/05/2018
Field of study

Technological advances have made it possible to measure spatially resolved gene expression at high throughput. However, methods to analyze these data are not established. Here we describe SpatialDE, a statistical test to identify genes with spatial patterns of expression variation from multiplexed imaging or spatial RNA-sequencing data. SpatialDE also implements 'automatic expression histology', a spatial gene-clustering approach that enables expression-based tissue histology

Apollo (Cambridge)

LIMIX: genetic analysis of multiple traits

Author: Casale F.P.
Lippert C.
Rakitsch B.
Stegle O.
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 22/05/2014
Field of study

Multi-trait mixed models have emerged as a promising approach for joint analyses of multiple traits. In principle, the mixed model framework is remarkably general. However, current methods implement only a very specific range of tasks to optimize the necessary computations. Here, we present a multi-trait modeling framework that is versatile and fast: LIMIX enables to exibly adapt mixed models for a broad range of applications with different observed and hidden covariates, and variable study designs. To highlight the novel modeling aspects of LIMIX we performed three vastly different genetic studies: joint GWAS of correlated blood lipid phenotypes, joint analysis of the expression levels of the multiple transcript-isoforms of a gene, and pathway-based modeling of molecular traits across environments. In these applications we show that LIMIX increases GWAS power and phenotype prediction accuracy, in particular when integrating stepwise multi-locus regression into multi-trait models, and when analyzing large numbers of traits. An open source implementation of LIMIX is freely available at: https://github.com/PMBio/limix

MDC Repository

Warped linear mixed models for the genetic analysis of transformed phenotypes.

Author: Fusi Nicolo
Lawrence Neil D
Lippert Christoph
Stegle Oliver
Publication venue: Nat Commun
Publication date: 19/09/2014
Field of study

Linear mixed models (LMMs) are a powerful and established tool for studying genotype-phenotype relationships. A limitation of the LMM is that the model assumes Gaussian distributed residuals, a requirement that rarely holds in practice. Violations of this assumption can lead to false conclusions and loss in power. To mitigate this problem, it is common practice to pre-process the phenotypic values to make them as Gaussian as possible, for instance by applying logarithmic or other nonlinear transformations. Unfortunately, different phenotypes require different transformations, and choosing an appropriate transformation is challenging and subjective. Here we present an extension of the LMM that estimates an optimal transformation from the observed data. In simulations and applications to real data from human, mouse and yeast, we show that using transformations inferred by our model increases power in genome-wide association studies and increases the accuracy of heritability estimation and phenotype prediction

White Rose Research Online

Using the past to estimate sensory uncertainty

Author: Beierholm U.
Ferrari A.
Noppeney U.
Rohe T.
Stegle O.
Publication venue: eLife Sciences Publications
Publication date: 01/01/2020
Field of study

To form a more reliable percept of the environment, the brain needs to estimate its own sensory uncertainty. Current theories of perceptual inference assume that the brain computes sensory uncertainty instantaneously and independently for each stimulus. We evaluated this assumption in four psychophysical experiments, in which human observers localized auditory signals that were presented synchronously with spatially disparate visual signals. Critically, the visual noise changed dynamically over time continuously or with intermittent jumps. Our results show that observers integrate audiovisual inputs weighted by sensory uncertainty estimates that combine information from past and current signals consistent with an optimal Bayesian learner that can be approximated by exponential discounting. Our results challenge leading models of perceptual inference where sensory uncertainty estimates depend only on the current stimulus. They demonstrate that the brain capitalizes on the temporal dynamics of the external world and estimates sensory uncertainty by combining past experiences with new incoming sensory signals

Durham Research Online

Radboud Repository

MPG.PuRe