Search CORE

37 research outputs found

Testing for association between RNA-Seq and high-dimensional data

Author: Jonker Marianne A.
Menezes Renée X.
Rauschenberger Armin
van de Wiel Mark A.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Background: Testing for association between RNA-Seq and other genomic data is challenging due to high variability of the former and high dimensionality of the latter. Results: Using the negative binomial distribution and a random-effects model, we develop an omnibus test that overcomes both difficulties. It may be conceptualised as a test of overall significance in regression analysis, where the response variable is overdispersed and the number of explanatory variables exceeds the sample size. Conclusions: The proposed test can detect genetic and epigenetic alterations that affect gene expression. It can examine complex regulatory mechanisms of gene expression. The R package globalSeq is available from Bioconductor

Crossref

Springer - Publisher Connector

PubMed Central

Radboud Repository

Open Repository and Bibliography - Luxembourg

Sparse classification with paired covariates

Author: Ciocănea-Teodorescu Iuliana
Jonker Marianne A
Menezes Renée X
Rauschenberger Armin
van de Wiel Mark A
Publication venue: Advances in Data Analysis and Classification
Publication date: 01/01/2020
Field of study

Funder: Department of Epidemiology and Biostatistics, Amsterdam UMC, VU University AmsterdamAbstractThis paper introduces the paired lasso: a generalisation of the lasso for paired covariate settings. Our aim is to predict a single response from two high-dimensional covariate sets. We assume a one-to-one correspondence between the covariate sets, with each covariate in one set forming a pair with a covariate in the other set. Paired covariates arise, for example, when two transformations of the same data are available. It is often unknown which of the two covariate sets leads to better predictions, or whether the two covariate sets complement each other. The paired lasso addresses this problem by weighting the covariates to improve the selection from the covariate sets and the covariate pairs. It thereby combines information from both covariate sets and accounts for the paired structure. We tested the paired lasso on more than 2000 classification problems with experimental genomics data, and found that for estimating sparse but predictive models, the paired lasso outperforms the standard and the adaptive lasso. The R package is available from cran.</jats:p

Radboud Repository

Apollo (Cambridge)

Open Repository and Bibliography - Luxembourg

Can subtle changes in gene expression be consistently detected with different microarray platforms?

Author: 't Hoen Peter AC
Ariyurek Yavuz
Boer Judith M
de Hollander Mattias
de Menezes Renée X
den Dunnen Johan T
Kuiper Rowan
Pedotti Paola
Schenk Geert J
van Ommen Gertjan JB
Vossen Rolf HAM
Vreugdenhil Erno
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Background: The comparability of gene expression data generated with different microarray platforms is still a matter of concern. Here we address the performance and the overlap in the detection of differentially expressed genes for five different microarray platforms in a challenging biological context where differences in gene expression are few and subtle. Results: Gene expression profiles in the hippocampus of five wild-type and five transgenic δC-doublecortin-like kinase mice were evaluated with five microarray platforms: Applied Biosystems, Affymetrix, Agilent, Illumina, LGTC home-spotted arrays. Using a fixed false discovery rate of 10% we detected surprising differences between the number of differentially expressed genes per platform. Four genes were selected by ABI, 130 by Affymetrix, 3,051 by Agilent, 54 by Illumina, and 13 by LGTC. Two genes were found significantly differentially expressed by all platforms and the four genes identified by the ABI platform were found by at least three other platforms. Quantitative RT-PCR analysis confirmed 20 out of 28 of the genes detected by two or more platforms and 8 out of 15 of the genes detected by Agilent only. We observed improved correlations between platforms when ranking the genes based on the significance level than with a fixed statistical cut-off. We demonstrate significant overlap in the affected gene sets identified by the different platforms, although biological processes were represented by only partially overlapping sets of genes. Aberrances in GABA-ergic signalling in the transgenic mice were consistently found by all platforms. Conclusion: The different microarray platforms give partially complementary views on biological processes affected. Our data indicate that when analyzing samples with only subtle differences in gene expression the use of two different platforms might be more attractive than increasing the number of replicates. Commercial two-color platforms seem to have higher power for finding differentially expressed genes between groups with small differences in expression

Crossref

AIR Universita degli studi di Milano

Springer - Publisher Connector

PubMed Central

Erasmus University Digital Repository

Deep sequencing-based expression analysis shows major advances in robustness, resolution and inter-lab portability over five microarray platforms

Author: Beaudoing
Bentley
Brenner
Canales
Cloonan
Deuel
Dohm
Engels
Erno Vreugdenhil
Evans
Feldker
Feldker
Ge
Gert-Jan B. van Ommen
Goeman
Grigoriadis
Harbers
Helene H. Thygesen
Irizarry
Ishii
Johan T. den Dunnen
Jongeneel
Judith M. Boer
Katayama
Kim
Kochetov
Lin
Liu
Lu
Margulies
Marioni
Mortazavi
Nagalakshmi
Nielsen
Pauws
Pedotti
Perocchi
Peter A. C. 't Hoen
Renée X. de Menezes
Rolf H. A. M. Vossen
Ruijter
Shang
Shendure
Shi
Siddiqui
Smyth
Snedecor
Sultan
Sun
Thygesen
Torres
Van Ruissen
Velculescu
Vencio
Werner
Wilhelm
Yavuz Ariyurek
Yelin
Publication venue: Oxford University Press
Publication date
Field of study

The hippocampal expression profiles of wild-type mice and mice transgenic for δC-doublecortin-like kinase were compared with Solexa/Illumina deep sequencing technology and five different microarray platforms. With Illumina's digital gene expression assay, we obtained ∼2.4 million sequence tags per sample, their abundance spanning four orders of magnitude. Results were highly reproducible, even across laboratories. With a dedicated Bayesian model, we found differential expression of 3179 transcripts with an estimated false-discovery rate of 8.5%. This is a much higher figure than found for microarrays. The overlap in differentially expressed transcripts found with deep sequencing and microarrays was most significant for Affymetrix. The changes in expression observed by deep sequencing were larger than observed by microarrays or quantitative PCR. Relevant processes such as calmodulin-dependent protein kinase activity and vesicle transport along microtubules were found affected by deep sequencing but not by microarrays. While undetectable by microarrays, antisense transcription was found for 51% of all genes and alternative polyadenylation for 47%. We conclude that deep sequencing provides a major advance in robustness, comparability and richness of expression profiling data and is expected to boost collaborative, comparative and integrative genomics studies

Crossref

PubMed Central

Testing for association between RNA-Seq and high-dimensional data

Author: A Roehle
AC Frazee
AM Hulse
Armin Rauschenberger
DJ McCarthy
G Verbeke
JJ Goeman
JJ Goeman
JK Pickrell
JN Weinstein
M Rebhan
M Sanchez-Carbayo
M Smid
Marianne A. Jonker
Mark A. van de Wiel
MD Robinson
P McCullagh
P Senchaudhuri
Renée X. Menezes
RX Menezes
S Anders
S le Cessie
SB Montgomery
T Lappalainen
The International HapMap Consortium
WN van Wieringen
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Quasi-variances

Author: De Menezes Renée X.
Firth David
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/03/2004
Field of study

In statistical models of dependence, the effect of a categorical variable is typically described by contrasts among parameters. For reporting such effects, quasi‐variances provide an economical and intuitive method which permits approximate inference on any contrast by subsequent readers. Applications include generalised linear models, generalised additive models and hazard models. The present paper exposes the generality of quasi‐variances, emphasises the need to control relative errors of approximation, gives simple methods for obtaining quasi‐variances and bounds on the approximation error involved, and explores the domain of accuracy of the method. Conditions are identified under which the quasi‐variance approximation is exact, and numerical work indicates high accuracy in a variety of settings

Warwick Research Archives Portal Repository

Recommended from our members

Sparse classification with paired covariates

Author: Ciocănea-Teodorescu Iuliana
Jonker Marianne A.
Menezes Renée X.
Rauschenberger Armin
van de Wiel Mark A.
Publication venue: Advances in Data Analysis and Classification
Publication date: 15/09/2020
Field of study

Funder: Department of Epidemiology and Biostatistics, Amsterdam UMC, VU University AmsterdamAbstract: This paper introduces the paired lasso: a generalisation of the lasso for paired covariate settings. Our aim is to predict a single response from two high-dimensional covariate sets. We assume a one-to-one correspondence between the covariate sets, with each covariate in one set forming a pair with a covariate in the other set. Paired covariates arise, for example, when two transformations of the same data are available. It is often unknown which of the two covariate sets leads to better predictions, or whether the two covariate sets complement each other. The paired lasso addresses this problem by weighting the covariates to improve the selection from the covariate sets and the covariate pairs. It thereby combines information from both covariate sets and accounts for the paired structure. We tested the paired lasso on more than 2000 classification problems with experimental genomics data, and found that for estimating sparse but predictive models, the paired lasso outperforms the standard and the adaptive lasso. The R package palasso is available from cran

Apollo (Cambridge)

A test for detecting differential indirect trans effects between two groups of samples

Author: Chaturvedi Nimisha
De Menezes Renée X.
Goeman Jelle J.
Wieringen Wessel Van
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 31/07/2018
Field of study

Integrative analysis of copy number and gene expression data can help in understanding the cis and trans effect of copy number aberrations on transcription levels of genes involved in a pathway. To analyse how these copy number mediated gene-gene interactions differ between groups of samples we propose a new method, named dNET. Our method uses ridge regression to model the network topology involving one gene's expression level, its gene dosage and the expression levels of other genes in the network. The interaction parameters are estimated by fitting the model per gene for all samples together. However, instead of testing for differential network topology per gene, dNET tests for an overall difference in estimated parameters between two groups of samples and produces a single p-value. With the help of several simulation studies, we show that dNET can detect differential network nodes with high accuracy and low rate of false positives even in the presence of differential cis effects. We also apply dNET to publicly available TCGA cancer datasets and identify pathways where copy number mediated gene-gene interactions differ between samples with cancer stage lower than stage 3 and samples with cancer stage 3 or above

VU Research Portal

Leiden University Scholary Publications