Search CORE

4,155 research outputs found

Consensus and meta-analysis regulatory networks for combining multiple microarray gene expression datasets

Author: Akaike
Allan Tucker
Beissbarth
Conlon
Courcelle
DerSimonian
Eisen
Emma Steele
Faith
Friedman
Gasch
Grigull
Hanley
Hartemink
Jarvinen
Khil
Kuo
Matzkevich
Ng
Pearl
Pearl
Pennock
Pe’er
Pe’er
Quillardet
Salgado
Sangurdekar
Smyth
Soinov
Spellman
Stoica
Sutton
Teixeira
Wang
Yauk
Publication venue: 'Elsevier BV'
Publication date: 01/12/2008
Field of study

Microarray data is a key source of experimental data for modelling gene regulatory interactions from expression levels. With the rapid increase of publicly available microarray data comes the opportunity to produce regulatory network models based on multiple datasets. Such models are potentially more robust with greater confidence, and place less reliance on a single dataset. However, combining datasets directly can be difficult as experiments are often conducted on different microarray platforms, and in different laboratories leading to inherent biases in the data that are not always removed through pre-processing such as normalisation. In this paper we compare two frameworks for combining microarray datasets to model regulatory networks: pre- and post-learning aggregation. In pre-learning approaches, such as using simple scale-normalisation prior to the concatenation of datasets, a model is learnt from a combined dataset, whilst in post-learning aggregation individual models are learnt from each dataset and the models are combined. We present two novel approaches for post-learning aggregation, each based on aggregating high-level features of Bayesian network models that have been generated from different microarray expression datasets. Meta-analysis Bayesian networks are based on combining statistical confidences attached to network edges whilst Consensus Bayesian networks identify consistent network features across all datasets. We apply both approaches to multiple datasets from synthetic and real (Escherichia coli and yeast) networks and demonstrate that both methods can improve on networks learnt from a single dataset or an aggregated dataset formed using a standard scale-normalisation

Elsevier - Publisher Connector

Crossref

Brunel University Research Archive

Study of meta-analysis strategies for network inference using information-theoretic approaches

Author: Bellot Pujalte Pau
Bontempi Gianluca
Haibe-Kains Benjamin
Meyer Patrick E.
Pham Ngoc C.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2016
Field of study

© 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.Reverse engineering of gene regulatory networks (GRNs) from gene expression data is a classical challenge in systems biology. Thanks to high-throughput technologies, a massive amount of gene-expression data has been accumulated in the public repositories. Modelling GRNs from multiple experiments (also called integrative analysis) has; therefore, naturally become a standard procedure in modern computational biology. Indeed, such analysis is usually more robust than the traditional approaches focused on individual datasets, which typically suffer from some experimental bias and a small number of samples. To date, there are mainly two strategies for the problem of interest: the first one (”data merging”) merges all datasets together and then infers a GRN whereas the other (”networks ensemble”) infers GRNs from every dataset separately and then aggregates them using some ensemble rules (such as ranksum or weightsum). Unfortunately, a thorough comparison of these two approaches is lacking. In this paper, we evaluate the performances of various metaanalysis approaches mentioned above with a systematic set of experiments based on in silico benchmarks. Furthermore, we present a new meta-analysis approach for inferring GRNs from multiple studies. Our proposed approach, adapted to methods based on pairwise measures such as correlation or mutual information, consists of two steps: aggregating matrices of the pairwise measures from every dataset followed by extracting the network from the meta-matrix.Peer ReviewedPostprint (author's final draft

University of Toronto Research Repository

Crossref

UPCommons. Portal del coneixement obert de la UPC

Directory of Open Access Journals

DI-fusion

Recommended from our members

Combining heterogeneous sources of data for the reverse-engineering of gene regulatory networks

Author: Steele Emma
Publication venue: Brunel University, School of Information Systems, Computing and Mathematics
Publication date: 01/01/2010
Field of study

This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University.Gene Regulatory Networks (GRNs) represent how genes interact in various cellular processes by describing how the expression level, or activity, of genes can affect the expression of the other genes. Reverse-engineering GRN models can help biologists understand and gain insight into genetic conditions and diseases. Recently, the increasingly widespread use of DNA microarrays, a high-throughput technology that allows the expression of thousands of genes to be measured simultaneously in biological experiments, has led to many datasets of gene expression measurements becoming publicly available and a subsequent explosion of research in the reverse-engineering of GRN models. However, microarray technology has a number of limitations as a data source for the modelling of GRNs, due to concerns over its reliability and the reproducibility of experimental results. The underlying theme of the research presented in this thesis is the incorporation of multiple sources and different types of data into techniques for reverse-engineering or learning GRNs from data. By drawing on many data sources, the resulting network models should be more robust, accurate and reliable than models that have been learnt using a single data source. This is achieved by focusing on two main strands of research. First, the thesis presents some of the earliest work in the incorporation of prior knowledge that has been generated from a large body of scientific papers, for Bayesian network based GRN models. Second, novel methods for the use of multiple microarray datasets to produce Bayesian network based GRN models are introduced. Empirical evaluations are used to show that the incorporation of literature-based prior knowledge and combining multiple microarray datasets can provide an improvement, when compared to the use of a single microarray dataset, for the reverse-engineering of Bayesian network based GRN models

Brunel University Research Archive

Bioinformatics tools in predictive ecology: Applications to fisheries

Author: Allan Tucker
Anvar Y.
Bishop C. M.
Bundy A.
Choi J. S.
Daniel Duplisea
Ghahramani Z.
Hand D. J.
Hartemink A. J.
Imoto S.
Langley P.
Liang S.
Pe'er D.
Pe'er D.
Pearl J.
Spirtes P.
Steele E.
Publication venue: 'The Royal Society'
Publication date: 19/01/2012
Field of study

This article is made available throught the Brunel Open Access Publishing Fund - Copygith @ 2012 Tucker et al.There has been a huge effort in the advancement of analytical techniques for molecular biological data over the past decade. This has led to many novel algorithms that are specialized to deal with data associated with biological phenomena, such as gene expression and protein interactions. In contrast, ecological data analysis has remained focused to some degree on off-the-shelf statistical techniques though this is starting to change with the adoption of state-of-the-art methods, where few assumptions can be made about the data and a more explorative approach is required, for example, through the use of Bayesian networks. In this paper, some novel bioinformatics tools for microarray data are discussed along with their ‘crossover potential’ with an application to fisheries data. In particular, a focus is made on the development of models that identify functionally equivalent species in different fish communities with the aim of predicting functional collapse

Crossref

PubMed Central

Brunel University Research Archive

Discovering study-specific gene regulatory networks

Author: A Lysenko
AL Barabási
Alberto de la Fuente
Allan Tucker
Artem Lysenko
B Grigorova
B Zhang
D Baek
D Heckerman
DA Samac
DJ Spiegelhalter
E Baalmann
E Segal
E Steele
E Wientjes
F Alakwaa
F Llorente
H Parkinson
J Choi
J Friedman
J Hartigan
J Zhang
JA Ihalainen
JT Damkjær
K Ando
L Marri
L Marri
M Ashburner
Mansoor Saqi
N Friedman
N Meinshausen
N Mochizuki
O Thimm
P Erdős
P Kirk
P Langfelder
PE Jensen
R Srinivasan
RA Irizarry
S Anvar
S Dash
S Infanger
S Madeira
S Swift
S Zhang
Stephen Swift
T Obayashi
Tanya Curtis
U Andersson
U Sengupta
Valeria Bo
WY Bang
Y Kluger
Y Kwon
YJ Kim
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2014
Field of study

This article has been made available through the Brunel Open Access Publishing Fund.Microarrays are commonly used in biology because of their ability to simultaneously measure thousands of genes under different conditions. Due to their structure, typically containing a high amount of variables but far fewer samples, scalable network analysis techniques are often employed. In particular, consensus approaches have been recently used that combine multiple microarray studies in order to find networks that are more robust. The purpose of this paper, however, is to combine multiple microarray studies to automatically identify subnetworks that are distinctive to specific experimental conditions rather than common to them all. To better understand key regulatory mechanisms and how they change under different conditions, we derive unique networks from multiple independent networks built using glasso which goes beyond standard correlations. This involves calculating cluster prediction accuracies to detect the most predictive genes for a specific set of conditions. We differentiate between accuracies calculated using cross-validation within a selected cluster of studies (the intra prediction accuracy) and those calculated on a set of independent studies belonging to different study clusters (inter prediction accuracy). Finally, we compare our method's results to related state-of-the art techniques. We explore how the proposed pipeline performs on both synthetic data and real data (wheat and Fusarium). Our results show that subnetworks can be identified reliably that are specific to subsets of studies and that these networks reflect key mechanisms that are fundamental to the experimental conditions in each of those subsets

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Rothamsted Repository

Brunel University Research Archive

Identification of transcription factor's targets using tissue-specific transcriptomic data in Arabidopsis thaliana

Author: A de la Fuente
A Wille
AP Bracken
B Mauch-Mani
Dong Xu
E Ramirez-Parra
E Segal
EI Boyle
F Markowetz
GD Bader
GP Srivastava
Gyan Prakash Srivastava
H Toh
J Kilian
J Schafer
J Schafer
JG Sørensen
Jingdong Liu
K Shinozaki
K Vandepoele
K Yugi
L Reiser
M Kasuga
M Schena
M Schmid
M Seki
MJ Buck
N Friedman
P Brazhnik
P Shannon
Ping Li
PT Spellman
R Mittler
R Opgen-Rhein
RJ Marinelli
RL Poole
S Ma
S Wichert
SK Palaniswamy
T Barrett
T Barrett
T Chen
TI Lee
V. SS Filkov
WR Swindell
X Xu
X Yang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Study of Meta-analysis strategies for network inference using information-theoretic approaches

Author: A Brazma
A Ramasamy
AA Margolin
AH Sims
Benjamin Haibe-Kains
C Cheadle
C Chen
C Desmedt
C Ding
D Marbach
DD Kang
F Emmert-Streib
F Hong
FL Schmidt
Gianluca Bontempi
J Cuzick
J Taminau
JJ Faith
JT Leek
K Wang
KG Kugler
L Dyrskjøt
Ngoc C. Pham
P Adler
P Bellot
P Wirapati
Patrick E. Meyer
Pau Bellot
PE Meyer
PE Meyer
R Edgar
RA Irizarry
T Hase
V Belcastro
W Huber
WE Johnson
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Integrated genomics and proteomics define huntingtin CAG length-dependent networks in mice.

Author: A Dobin
A Hodges
A Kuhn
A Pal
A Reiner
A Valencia
Andreas Tebbe
B Zhang
B Zhang
BM Bolstad
C Zuccato
CA Ross
Christoph Schaab
Daniel J Lavery
David Howland
DI Shirasaki
Doxa Chatzopoulou
DV Zaykin
EH Aylward
Eliana Marisa Ramos
ES Deneris
ES Lein
Fuying Gao
G Fishell
GA Smith
Giovanni Coppola
HT Orr
I Al-Ramahi
IS Seong
Ismael Al-Ramahi
J Cox
J Cox
J Labbadia
JA Miller
JC Jacobsen
JE Phillips
Jeffrey P Cantle
Jeffrey S Aaronson
JF Gusella
JF Gusella
Jim Rosinski
JM Dowen
JM Van Raamsdonk
JP Vonsattel
Juan Botas
K Becanovic
K Monahan
K Sharma
Karla El-Zein
LB Menalled
LB Menalled
LB Menalled
LS Kaltenbach
M Biagioli
M Heiman
M MACDONALD
M Mann
M Mielcarek
MA Pouladi
MC Oldham
MI Love
MK Lobo
MT Lin
N Wang
Nan Wang
P Giles
P Grange
P Langfelder
P Langfelder
P Langfelder
P Langfelder
P Langfelder
Peter Langfelder
PF Durrenberger
RC Gentleman
S Anders
S Horvath
S Toyoda
Sandeep Deverasetty
Seung Kwak
Steve Horvath
T Geiger
T Lu
WE Johnson
WV Chen
X Gu
X Gu
X William Yang
XH Lu
Xiao-Hong Lu
Y Benjamini
Y Guo
Y Shen
Y Wang
Yining Zhao
Publication venue: eScholarship, University of California
Publication date: 01/04/2016
Field of study

To gain insight into how mutant huntingtin (mHtt) CAG repeat length modifies Huntington's disease (HD) pathogenesis, we profiled mRNA in over 600 brain and peripheral tissue samples from HD knock-in mice with increasing CAG repeat lengths. We found repeat length-dependent transcriptional signatures to be prominent in the striatum, less so in cortex, and minimal in the liver. Coexpression network analyses revealed 13 striatal and 5 cortical modules that correlated highly with CAG length and age, and that were preserved in HD models and sometimes in patients. Top striatal modules implicated mHtt CAG length and age in graded impairment in the expression of identity genes for striatal medium spiny neurons and in dysregulation of cyclic AMP signaling, cell death and protocadherin genes. We used proteomics to confirm 790 genes and 5 striatal modules with CAG length-dependent dysregulation at the protein level, and validated 22 striatal module genes as modifiers of mHtt toxicities in vivo

Crossref

eScholarship - University of California

Wisdom of crowds for robust gene network inference

Author: DREAM5 Consortium
Holmes Benjamin Ray
Kellis Manolis
Marbach Daniel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/10/2011
Field of study

Reconstructing gene regulatory networks from high-throughput data is a long-standing challenge. Through the Dialogue on Reverse Engineering Assessment and Methods (DREAM) project, we performed a comprehensive blind assessment of over 30 network inference methods on Escherichia coli, Staphylococcus aureus, Saccharomyces cerevisiae and in silico microarray data. We characterize the performance, data requirements and inherent biases of different inference approaches, and we provide guidelines for algorithm application and development. We observed that no single inference method performs optimally across all data sets. In contrast, integration of predictions from multiple inference methods shows robust and high performance across diverse data sets. We thereby constructed high-confidence networks for E. coli and S. aureus, each comprising ~1,700 transcriptional interactions at a precision of ~50%. We experimentally tested 53 previously unobserved regulatory interactions in E. coli, of which 23 (43%) were supported. Our results establish community-based methods as a powerful and robust tool for the inference of transcriptional gene regulatory networks.National Institutes of Health (U.S.)National Centers for Biomedical Computing (U.S.) (Roadmap Initiative (U54CA121852))Howard Hughes Medical InstituteNational Institutes of Health (U.S.) (Director's Pioneer Award DPI OD003644)Swiss National Science Foundation (Fellowship

DSpace@MIT